Markus Schmitz, Florian Menz, Ruben Grunau, Nils Mandischer, Mathias Hüsing, Burkhard Corves
{"title":"Robot Cooking—Transferring Observations into a Planning Language: An Automated Approach in the Field of Cooking","authors":"Markus Schmitz, Florian Menz, Ruben Grunau, Nils Mandischer, Mathias Hüsing, Burkhard Corves","doi":"10.3390/eng4040143","DOIUrl":null,"url":null,"abstract":"The recognition of human activities from video sequences and their transformation into a machine-readable form is a challenging task, which is the subject of many studies. The goal of this project is to develop an automated method for analyzing, identifying and processing motion capture data into a planning language. This is performed in a cooking scenario by recording the pose of the acting hand. First, predefined side actions are detected in the dataset using classification. The remaining frames are then clustered into main actions. Using this information, the known initial positions and virtual object tracking, a machine-readable planning domain definition language (PDDL) is generated.","PeriodicalId":10630,"journal":{"name":"Comput. Chem. Eng.","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Comput. Chem. Eng.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/eng4040143","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The recognition of human activities from video sequences and their transformation into a machine-readable form is a challenging task, which is the subject of many studies. The goal of this project is to develop an automated method for analyzing, identifying and processing motion capture data into a planning language. This is performed in a cooking scenario by recording the pose of the acting hand. First, predefined side actions are detected in the dataset using classification. The remaining frames are then clustered into main actions. Using this information, the known initial positions and virtual object tracking, a machine-readable planning domain definition language (PDDL) is generated.