Pub Date : 2024-06-10DOI: 10.1007/s10015-024-00955-6
Philipp Eisele, Sajid Nisar, Franz Haas
This research introduces the novel “Kraken-Gear” mechanism, emphasizing the advantages of additive polymer 3D printing in high-ratio gearbox systems for lightweight robotic applications, such as surgical instruments. The innovative kinematic solution provides high torsional system stiffness, substantial gear ratios, and backlash-free transmission. Leveraging the “hot lithography” additive manufacturing method ensures precise and warp-free gearbox components. Targeting medical technology, the gearbox meets stringent requirements: backlash-free, minimal vibration, high precision, and torque, with minimized weight for ergonomic comfort and fatigue mitigation. Computational simulations assess forces and stresses, highlighting the potential of additive manufacturing for cost-effective and functionally efficient gearbox fabrication. Nevertheless, careful material selection remains imperative for optimal functionality, especially in demanding medical applications. In summary, this research underscores a promising approach to gearbox fabrication, emphasizing the critical role of material selection and simulation-based assessments for optimal performance.
{"title":"A conceptual examination of an additive manufactured high-ratio coaxial gearbox","authors":"Philipp Eisele, Sajid Nisar, Franz Haas","doi":"10.1007/s10015-024-00955-6","DOIUrl":"10.1007/s10015-024-00955-6","url":null,"abstract":"<div><p>This research introduces the novel “Kraken-Gear” mechanism, emphasizing the advantages of additive polymer 3D printing in high-ratio gearbox systems for lightweight robotic applications, such as surgical instruments. The innovative kinematic solution provides high torsional system stiffness, substantial gear ratios, and backlash-free transmission. Leveraging the “hot lithography” additive manufacturing method ensures precise and warp-free gearbox components. Targeting medical technology, the gearbox meets stringent requirements: backlash-free, minimal vibration, high precision, and torque, with minimized weight for ergonomic comfort and fatigue mitigation. Computational simulations assess forces and stresses, highlighting the potential of additive manufacturing for cost-effective and functionally efficient gearbox fabrication. Nevertheless, careful material selection remains imperative for optimal functionality, especially in demanding medical applications. In summary, this research underscores a promising approach to gearbox fabrication, emphasizing the critical role of material selection and simulation-based assessments for optimal performance.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"416 - 422"},"PeriodicalIF":0.8,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10015-024-00955-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141361326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-07DOI: 10.1007/s10015-024-00953-8
Masahito Takano, Kosuke Oiwa, Akio Nozawa
Thermal skin images are used to evaluate the physiological and psychological states of patients. To implement remote daily health monitoring, we attempted to assess subjective health conditions using thermal face images. In our previous study, we constructed an anomaly detection model to detect poor health conditions; the area under the receiver operating characteristic curve of the anomaly-detection model was 0.70. However, how the spatial distribution of facial skin temperature changes in response to subjective health conditions remains unclear. In this study, we statistically analyzed the acquired thermal face images to investigate the effect of subjective health conditions on facial skin temperature distribution. As a result of the comparison between health conditions, we confirmed that typically the average temperatures and left-right asymmetry in some regions of the face were significantly higher in poor health conditions than in good health conditions.
{"title":"Effect of subjective health conditions on facial skin temperature distribution: a 1-year statistical analysis among four participants","authors":"Masahito Takano, Kosuke Oiwa, Akio Nozawa","doi":"10.1007/s10015-024-00953-8","DOIUrl":"10.1007/s10015-024-00953-8","url":null,"abstract":"<div><p>Thermal skin images are used to evaluate the physiological and psychological states of patients. To implement remote daily health monitoring, we attempted to assess subjective health conditions using thermal face images. In our previous study, we constructed an anomaly detection model to detect poor health conditions; the area under the receiver operating characteristic curve of the anomaly-detection model was 0.70. However, how the spatial distribution of facial skin temperature changes in response to subjective health conditions remains unclear. In this study, we statistically analyzed the acquired thermal face images to investigate the effect of subjective health conditions on facial skin temperature distribution. As a result of the comparison between health conditions, we confirmed that typically the average temperatures and left-right asymmetry in some regions of the face were significantly higher in poor health conditions than in good health conditions.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"381 - 388"},"PeriodicalIF":0.8,"publicationDate":"2024-06-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141373545","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-05DOI: 10.1007/s10015-024-00954-7
Satoshi Hirata, Yutaka Sakai
Reinforcement learning is a mathematical framework for learning better choices through trial-and-error. Recent studies revealed that reinforcement learning is applicable to animal behavior and cognition. However, applying reinforcement learning to animal behavior sometimes encounters difficulties because the information sources utilized by animals to make choices are often unknown, whereas this is identified as the “state” in the reinforcement learning framework. We sought to identify possible state settings including non-standard formulations suitable for explaining data from past chimpanzee studies. Although chimpanzees’ performance in a serial learning task was inconsistent with standard reinforcement learning formulations, we found that the combination of state-independent choice making and state-dependent evaluation produced consistent results. Exploration of state settings in reinforcement learning may shed new light on animal learning processes.
{"title":"Inferring source of learning by chimpanzees in cognitive tasks using reinforcement learning theory","authors":"Satoshi Hirata, Yutaka Sakai","doi":"10.1007/s10015-024-00954-7","DOIUrl":"10.1007/s10015-024-00954-7","url":null,"abstract":"<div><p>Reinforcement learning is a mathematical framework for learning better choices through trial-and-error. Recent studies revealed that reinforcement learning is applicable to animal behavior and cognition. However, applying reinforcement learning to animal behavior sometimes encounters difficulties because the information sources utilized by animals to make choices are often unknown, whereas this is identified as the “state” in the reinforcement learning framework. We sought to identify possible state settings including non-standard formulations suitable for explaining data from past chimpanzee studies. Although chimpanzees’ performance in a serial learning task was inconsistent with standard reinforcement learning formulations, we found that the combination of state-independent choice making and state-dependent evaluation produced consistent results. Exploration of state settings in reinforcement learning may shed new light on animal learning processes.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"398 - 403"},"PeriodicalIF":0.8,"publicationDate":"2024-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141382692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-31DOI: 10.1007/s10015-024-00952-9
Liang Guo
With the growing population of English learners, how to improve the efficiency of English learning has become a focus of research. This article focuses on automatic error-checking in English short text translation. The Transformer model was enhanced by combining with the bidirectional gated recurrent unit (BiGRU) algorithm to create a dual-encoder model that better captures information within input sequences. Experiments were then conducted on different corpora. The improved Transformer model obtained a ({text{F}}_{0.5}) of 59.09 on CoNLL-2014 and 61.05 Google-bilingual evaluation understudy (GLEU) on JFLEG, both of which were better than the other methods compared. The case analysis showed that the improved Transformer model accurately found errors in short text translation. The findings indicate that the proposed approach is reliable in the automatic error-checking of English short text translation and can be applied in practice.
{"title":"Research on automatic error-checking in English short text translation by a neural network algorithm","authors":"Liang Guo","doi":"10.1007/s10015-024-00952-9","DOIUrl":"10.1007/s10015-024-00952-9","url":null,"abstract":"<div><p>With the growing population of English learners, how to improve the efficiency of English learning has become a focus of research. This article focuses on automatic error-checking in English short text translation. The Transformer model was enhanced by combining with the bidirectional gated recurrent unit (BiGRU) algorithm to create a dual-encoder model that better captures information within input sequences. Experiments were then conducted on different corpora. The improved Transformer model obtained a <span>({text{F}}_{0.5})</span> of 59.09 on CoNLL-2014 and 61.05 Google-bilingual evaluation understudy (GLEU) on JFLEG, both of which were better than the other methods compared. The case analysis showed that the improved Transformer model accurately found errors in short text translation. The findings indicate that the proposed approach is reliable in the automatic error-checking of English short text translation and can be applied in practice.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"423 - 429"},"PeriodicalIF":0.8,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142415366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-27DOI: 10.1007/s10015-024-00951-w
Xiaoyan Lei
Real-time English speech translation is useful in numerous situations, including business and travel. The goal of this research is to improve real-time English speech translation efficacy. Initially, filter bank (FBank) features were extracted from English speech. Subsequently, an enhanced Transformer model was introduced, incorporating a causal convolution module in the front end of the encoder to capture English speech features with location information. The performance of the optimized model in translating English speech to different target languages was tested using the MuST-C dataset. The results revealed differences in translation results for different target languages using the improved Transformer. The highest bilingual evaluation understudy (BLEU) score was observed for Spanish text at 20.84, while Russian text obtained the lowest score of 10.56. The average BLEU score was 18.51, with an average lag time delay of 1202.33 ms. Compared to the conventional Transformer model, the improved model exhibited higher BLEU scores, lower time delay, and optimal performance when utilizing a convolutional kernel size of 3 × 3. The results demonstrate the dependability of the improved Transformer model in real-time English speech translation, highlighting its practical usefulness.
{"title":"Real-time translation of English speech through speech feature extraction","authors":"Xiaoyan Lei","doi":"10.1007/s10015-024-00951-w","DOIUrl":"10.1007/s10015-024-00951-w","url":null,"abstract":"<div><p>Real-time English speech translation is useful in numerous situations, including business and travel. The goal of this research is to improve real-time English speech translation efficacy. Initially, filter bank (FBank) features were extracted from English speech. Subsequently, an enhanced Transformer model was introduced, incorporating a causal convolution module in the front end of the encoder to capture English speech features with location information. The performance of the optimized model in translating English speech to different target languages was tested using the MuST-C dataset. The results revealed differences in translation results for different target languages using the improved Transformer. The highest bilingual evaluation understudy (BLEU) score was observed for Spanish text at 20.84, while Russian text obtained the lowest score of 10.56. The average BLEU score was 18.51, with an average lag time delay of 1202.33 ms. Compared to the conventional Transformer model, the improved model exhibited higher BLEU scores, lower time delay, and optimal performance when utilizing a convolutional kernel size of 3 × 3. The results demonstrate the dependability of the improved Transformer model in real-time English speech translation, highlighting its practical usefulness.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"410 - 415"},"PeriodicalIF":0.8,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142414281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-24DOI: 10.1007/s10015-024-00950-x
Natee Chirachongcharoen, Sajid Nisar
Gait-type recognition is important for robotic exoskeletons and walking-assistance devices to adjust their output according to the users’ needs. However, the growing trend of using machine learning (ML) models is both labor- and data-intensive, which makes it practically less attractive for application in exoskeletons and wearable-assistive devices. This research aims to devise a fuzzy-based gait recognition algorithm that requires minimum training data (only 40 cycles for each of the 5 gait types) and adapts to new users without having the need of pre-training for each of them. The proposed algorithm uses the fuzzy logic system (FLS) and Welford’s (variance computation) method to enhance the adaptability by adjusting the rules for gait-type recognition and fine-tuning them in real time for every new user without requiring a specific prior training. Simulation-based evaluation of the proposed algorithm shows a gait-type recognition accuracy of 63.0%, an improvement of 36.8% over the non-adaptive fuzzy-based recognition algorithm. Moreover, the results show that the proposed algorithm outperforms the popular ML methods (support vector machine, Naive Bayes classifier, and logistic regression) when subjected to limited gait-cycles data and no prior training is provided.
步态类型识别对于机器人外骨骼和步行辅助设备根据用户需求调整输出非常重要。然而,使用机器学习(ML)模型是一种日益增长的趋势,既耗费大量人力又耗费大量数据,这使得它在外骨骼和可穿戴辅助设备中的应用实际上并不那么有吸引力。本研究旨在设计一种基于模糊的步态识别算法,该算法只需最少的训练数据(5 种步态类型中每种只需 40 个循环),并能适应新用户,无需对每个用户进行预训练。所提出的算法使用模糊逻辑系统(FLS)和维尔福(方差计算)方法,通过调整步态类型识别规则来增强适应性,并针对每个新用户进行实时微调,而无需特定的预先训练。对拟议算法的仿真评估显示,步态类型识别准确率为 63.0%,比非自适应模糊识别算法提高了 36.8%。此外,结果表明,在有限的步态周期数据和不提供事先训练的情况下,所提出的算法优于流行的 ML 方法(支持向量机、Naive Bayes 分类器和逻辑回归)。
{"title":"Human gait-type recognition without pre-training: an adaptive fuzzy-based approach for locomotion-assistance devices","authors":"Natee Chirachongcharoen, Sajid Nisar","doi":"10.1007/s10015-024-00950-x","DOIUrl":"10.1007/s10015-024-00950-x","url":null,"abstract":"<div><p>Gait-type recognition is important for robotic exoskeletons and walking-assistance devices to adjust their output according to the users’ needs. However, the growing trend of using machine learning (ML) models is both labor- and data-intensive, which makes it practically less attractive for application in exoskeletons and wearable-assistive devices. This research aims to devise a fuzzy-based gait recognition algorithm that requires minimum training data (only 40 cycles for each of the 5 gait types) and adapts to new users without having the need of pre-training for each of them. The proposed algorithm uses the fuzzy logic system (FLS) and Welford’s (variance computation) method to enhance the adaptability by adjusting the rules for gait-type recognition and fine-tuning them in real time for every new user without requiring a specific prior training. Simulation-based evaluation of the proposed algorithm shows a gait-type recognition accuracy of 63.0%, an improvement of 36.8% over the non-adaptive fuzzy-based recognition algorithm. Moreover, the results show that the proposed algorithm outperforms the popular ML methods (support vector machine, Naive Bayes classifier, and logistic regression) when subjected to limited gait-cycles data and no prior training is provided.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"389 - 397"},"PeriodicalIF":0.8,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141102062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In the near future, autonomous vehicles will be able to share information with other vehicles via communication, enabling appropriate traffic control approaches. A new approach to traffic control using fully autonomous driving involves the realization of a flat interchange. Unlike conventional approaches, this study focused on roads and interchanges that do not assume lanes. Specifically, we propose an interchange flow control approach to traffic control using a virtual wall (VW), which acquires and shares the initial position, destination, and speed of all vehicles entering an interchange in a two-dimensional space where vehicles can move freely, and then realizes appropriate control based on this information. Each vehicle individually calculates the shortest path to avoid the VW, thereby realizing a safe and rational path selection. In this study, a genetic algorithm was used to determine the location of the VW. The effectiveness of the proposed method was evaluated using simulations, and the results showed that compared to manual deployment in the roundabout form, the proposed method using VWs reduced the total path length and the number of collisions to zero. In addition, when comparing the case where VWs were deployed in common for all vehicles and the case where VWs were deployed individually for each vehicle, it was shown that the total path length was shorter when individual VWs were deployed.
{"title":"Interchange flow control with dynamic obstacles optimized using genetic algorithms—a concept of virtual walls","authors":"Junya Hoshino, Yuki Itoh, Ryuma Saotome, Tomohiro Harada, Kenji Matsuda, Tenta Suzuki, Mao Tobisawa, Kaito Kumagae, Johei Matsuoka, Toshinori Kagawa, Kiyohiko Hattori","doi":"10.1007/s10015-024-00946-7","DOIUrl":"10.1007/s10015-024-00946-7","url":null,"abstract":"<div><p>In the near future, autonomous vehicles will be able to share information with other vehicles via communication, enabling appropriate traffic control approaches. A new approach to traffic control using fully autonomous driving involves the realization of a flat interchange. Unlike conventional approaches, this study focused on roads and interchanges that do not assume lanes. Specifically, we propose an interchange flow control approach to traffic control using a virtual wall (VW), which acquires and shares the initial position, destination, and speed of all vehicles entering an interchange in a two-dimensional space where vehicles can move freely, and then realizes appropriate control based on this information. Each vehicle individually calculates the shortest path to avoid the VW, thereby realizing a safe and rational path selection. In this study, a genetic algorithm was used to determine the location of the VW. The effectiveness of the proposed method was evaluated using simulations, and the results showed that compared to manual deployment in the roundabout form, the proposed method using VWs reduced the total path length and the number of collisions to zero. In addition, when comparing the case where VWs were deployed in common for all vehicles and the case where VWs were deployed individually for each vehicle, it was shown that the total path length was shorter when individual VWs were deployed.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"230 - 241"},"PeriodicalIF":0.8,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140663845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-05DOI: 10.1007/s10015-024-00944-9
Kazuteru Miyazaki, Masaaki Ida
Character-level convolutional neural networks (CLCNNs) are commonly used to classify textual data. CLCNN is used as a more versatile tool. For natural language recognition, after decomposing a sentence into character units, each unit is converted into a corresponding character code (e.g., Unicode values) and the code is input into the CLCNN network. Thus, sentences can be treated like images. We have previously applied a CLCNN to verify whether a university’s diploma and/or curriculum policies are well written. In this study, we experimentally confirm the effectiveness of CLCNN using tweet data. In particular, we focus on the effect of the number of units on performance using the following two types of data; one is a real and public tweet dataset on the reputation of a cell phone, and the other is the NTCIR-13 MedWeb task, which consists of pseudo-tweet data and is a well-known collection of tests for multi-label problems. Results of experiments conducted by varying the number of units in the all-coupled layer confirmed the agreement of the results with the theorem introduced in the Amari’s book (Amari in Mathematical Science New Development of Information Geometry, For Senior & Graduate Courses. SAIENSU-SHA Co., 2014). Furthermore, in the NTCIR-13 MedWeb task, we analyze two kinds of experiments, the effects of kernel size and weight perturbation. The results of the difference in the kernel size suggest the existence of an optimal kernel size for sentence comprehension. The results of perturbations to the convolutional layer and pooling layer indicate the possibility of relationship between the numbers of degrees of freedom and network parameters.
字符级卷积神经网络(CLCNN)通常用于对文本数据进行分类。CLCNN 是一种用途更为广泛的工具。在自然语言识别中,将句子分解为字符单元后,将每个单元转换为相应的字符代码(如 Unicode 值),然后将代码输入 CLCNN 网络。因此,句子可以像图像一样处理。此前,我们曾将 CLCNN 用于验证一所大学的文凭和/或课程政策是否编写得当。在本研究中,我们使用推文数据对 CLCNN 的有效性进行了实验验证。具体而言,我们使用以下两种数据重点研究了单元数对性能的影响:一种是关于手机声誉的真实公开推文数据集,另一种是 NTCIR-13 MedWeb 任务,该任务由伪推文数据组成,是众所周知的多标签问题测试集合。通过改变全耦合层中的单元数量进行的实验结果证实,实验结果与阿马里在其著作《阿马里在数学科学中的新发展:信息几何》(Amari in Mathematical Science New Development of Information Geometry, For Senior & Graduate Courses.SAIENSU-SHA Co., 2014)。此外,在 NTCIR-13 MedWeb 任务中,我们分析了两种实验,即核大小和权重扰动的影响。内核大小差异的结果表明,句子理解存在最佳内核大小。对卷积层和池化层的扰动结果表明,自由度数与网络参数之间可能存在关系。
{"title":"Performance evaluation of character-level CNNs using tweet data and analysis for weight perturbations","authors":"Kazuteru Miyazaki, Masaaki Ida","doi":"10.1007/s10015-024-00944-9","DOIUrl":"10.1007/s10015-024-00944-9","url":null,"abstract":"<div><p>Character-level convolutional neural networks (CLCNNs) are commonly used to classify textual data. CLCNN is used as a more versatile tool. For natural language recognition, after decomposing a sentence into character units, each unit is converted into a corresponding character code (e.g., Unicode values) and the code is input into the CLCNN network. Thus, sentences can be treated like images. We have previously applied a CLCNN to verify whether a university’s diploma and/or curriculum policies are well written. In this study, we experimentally confirm the effectiveness of CLCNN using tweet data. In particular, we focus on the effect of the number of units on performance using the following two types of data; one is a real and public tweet dataset on the reputation of a cell phone, and the other is the NTCIR-13 MedWeb task, which consists of pseudo-tweet data and is a well-known collection of tests for multi-label problems. Results of experiments conducted by varying the number of units in the all-coupled layer confirmed the agreement of the results with the theorem introduced in the Amari’s book (Amari in Mathematical Science New Development of Information Geometry, For Senior & Graduate Courses. SAIENSU-SHA Co., 2014). Furthermore, in the NTCIR-13 MedWeb task, we analyze two kinds of experiments, the effects of kernel size and weight perturbation. The results of the difference in the kernel size suggest the existence of an optimal kernel size for sentence comprehension. The results of perturbations to the convolutional layer and pooling layer indicate the possibility of relationship between the numbers of degrees of freedom and network parameters.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"266 - 273"},"PeriodicalIF":0.8,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140738951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-03DOI: 10.1007/s10015-024-00945-8
Isamu Bungo, Tomohiro Hayakawa, Toshiyuki Yasuda
In recent years, many studies have used Convolutional Neural Networks (CNN) as an approach to automate bin picking tasks by robots. In these previous studies, all types of objects in a bin were used as training data to optimize CNN parameters. Therefore, as the number of types of objects in a bin increases under the condition that the total number of training data is retained, CNN is not sufficiently optimized. In this study, we propose a learning method of CNN to achieve a bin picking task for multi-types of objects. Unlike previous learning method using multi-types of objects, we use a single type of object with a well-designed shape to obtain training data. It is true that when an object is used for training, then the learning for the corresponding object proceeds very well. However, the training does not contribute so much to the leaning of other types of objects. we expect the CNN to learn many types of grasping methods simultaneously by the training data of the single well-designed object. In that case, the total number of training data for each type of objects in a bin can be retained even if the number of types of objects increases. To verify the idea, we construct 12 different CNN models which are trained by different types of objects. Through simulations and robot experiments, bin picking tasks to pick multi-types of objects were performed using those CNN models. As a result, the training method which uses a complex-shaped object achieved higher grasping success rate than the training method which uses a primitive-shaped object. Moreover, the training method which uses a complex-shaped object achieved higher grasping success rate than the previous training method which uses all types of objects in a bin.
{"title":"Investigation of a single object shape for efficient learning in bin picking of multiple types of objects","authors":"Isamu Bungo, Tomohiro Hayakawa, Toshiyuki Yasuda","doi":"10.1007/s10015-024-00945-8","DOIUrl":"10.1007/s10015-024-00945-8","url":null,"abstract":"<div><p>In recent years, many studies have used Convolutional Neural Networks (CNN) as an approach to automate bin picking tasks by robots. In these previous studies, all types of objects in a bin were used as training data to optimize CNN parameters. Therefore, as the number of types of objects in a bin increases under the condition that the total number of training data is retained, CNN is not sufficiently optimized. In this study, we propose a learning method of CNN to achieve a bin picking task for multi-types of objects. Unlike previous learning method using multi-types of objects, we use a single type of object with a well-designed shape to obtain training data. It is true that when an object is used for training, then the learning for the corresponding object proceeds very well. However, the training does not contribute so much to the leaning of other types of objects. we expect the CNN to learn many types of grasping methods simultaneously by the training data of the single well-designed object. In that case, the total number of training data for each type of objects in a bin can be retained even if the number of types of objects increases. To verify the idea, we construct 12 different CNN models which are trained by different types of objects. Through simulations and robot experiments, bin picking tasks to pick multi-types of objects were performed using those CNN models. As a result, the training method which uses a complex-shaped object achieved higher grasping success rate than the training method which uses a primitive-shaped object. Moreover, the training method which uses a complex-shaped object achieved higher grasping success rate than the previous training method which uses all types of objects in a bin.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"372 - 379"},"PeriodicalIF":0.8,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140747642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This paper proposes a simple method based on a novel use of elitism to increase the population size of artificial creatures while minimizing evaluation cost. This can contribute to preventing premature convergence of the population. We propose the “Excessive Elitism (EE)” method by modifying elitism in HyperNEAT (Hypercube-based NeuroEvolution of Augmenting Topologies), which is an evolutionary algorithm commonly used to evolve genotype [i.e., Compositional Pattern Producing Network (CPPN)] of artificial creatures. In EE, the evaluated fitness of best-fit individuals will be succeeded and reused instead of being re-evaluated during subsequent fitness evaluations, thereby reducing the evaluation cost if the elite size is excessive. Notably, EE also disables speciation and fitness sharing, serving to simplify the population structure and reduce complexity. In a 3D multi-agent environment, we evolved the morphology and behavior of artificial creatures with a simple target approach task. We assumed a baseline case (EE (2, 20)) in which a small population size was used due to the strong limitation of the evaluation cost and adopted a normal small elite size. This often led to premature convergence of the population to suboptimal individuals who could not reach the target. However, with the application of EE, the population was capable of evolving to reach the target, maintaining an evaluation cost comparable to EE (2, 20). We demonstrate that EE method serves as a simpler alternative to speciation for diversity preservation, capable of enhancing both the average and optimal fitness of a population, thus preventing premature convergence at a minimal evaluation cost. Further research in complex environments is required to fully uncover the potential and limitations of this method.
本文提出了一种基于新颖的精英主义的简单方法,以增加人工生物的种群数量,同时最大限度地降低评估成本。这有助于防止种群过早收敛。我们通过修改 HyperNEAT(基于超立方体的增强拓扑神经进化算法)中的精英主义,提出了 "过度精英主义(EE)"方法,HyperNEAT 是一种进化算法,常用于进化人工生物的基因型[即组合模式生成网络(CPPN)]。在 EE 中,最合适个体的适配度评估结果将被继承和重用,而不是在后续适配度评估过程中重新评估,从而在精英规模过大时降低评估成本。值得注意的是,EE 还禁止了物种分化和适应度共享,从而简化了种群结构并降低了复杂性。在三维多代理环境中,我们通过一个简单的目标接近任务来进化人工生物的形态和行为。我们假设了一种基线情况(EE (2, 20)),在这种情况下,由于评估成本的强烈限制,我们使用了较小的种群规模,并采用了正常的小精英规模。这往往会导致群体过早趋同于无法达到目标的次优个体。然而,应用 EE 后,种群能够不断进化以达到目标,并保持与 EE 相当的评估成本(2, 20)。我们证明,EE 方法是物种多样性保护的一种更简单的替代方法,它能够提高种群的平均和最佳适应性,从而以最小的评估成本防止过早趋同。要充分发掘这种方法的潜力和局限性,还需要在复杂环境中开展进一步研究。
{"title":"Effects of excessive elitism on the evolution of artificial creatures with NEAT","authors":"Siti Aisyah Binti Jaafar, Reiji Suzuki, Satoru Komori, Takaya Arita","doi":"10.1007/s10015-024-00948-5","DOIUrl":"10.1007/s10015-024-00948-5","url":null,"abstract":"<div><p>This paper proposes a simple method based on a novel use of elitism to increase the population size of artificial creatures while minimizing evaluation cost. This can contribute to preventing premature convergence of the population. We propose the “Excessive Elitism (EE)” method by modifying elitism in HyperNEAT (Hypercube-based NeuroEvolution of Augmenting Topologies), which is an evolutionary algorithm commonly used to evolve genotype [i.e., Compositional Pattern Producing Network (CPPN)] of artificial creatures. In EE, the evaluated fitness of best-fit individuals will be succeeded and reused instead of being re-evaluated during subsequent fitness evaluations, thereby reducing the evaluation cost if the elite size is excessive. Notably, EE also disables speciation and fitness sharing, serving to simplify the population structure and reduce complexity. In a 3D multi-agent environment, we evolved the morphology and behavior of artificial creatures with a simple target approach task. We assumed a baseline case (EE (2, 20)) in which a small population size was used due to the strong limitation of the evaluation cost and adopted a normal small elite size. This often led to premature convergence of the population to suboptimal individuals who could not reach the target. However, with the application of EE, the population was capable of evolving to reach the target, maintaining an evaluation cost comparable to EE (2, 20). We demonstrate that EE method serves as a simpler alternative to speciation for diversity preservation, capable of enhancing both the average and optimal fitness of a population, thus preventing premature convergence at a minimal evaluation cost. Further research in complex environments is required to fully uncover the potential and limitations of this method.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"286 - 297"},"PeriodicalIF":0.8,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140750133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}