Artificial Life and Robotics最新文献_第3页

A conceptual examination of an additive manufactured high-ratio coaxial gearbox 增材制造高传动比同轴齿轮箱的概念研究

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-06-10 DOI: 10.1007/s10015-024-00955-6

Philipp Eisele, Sajid Nisar, Franz Haas

This research introduces the novel “Kraken-Gear” mechanism, emphasizing the advantages of additive polymer 3D printing in high-ratio gearbox systems for lightweight robotic applications, such as surgical instruments. The innovative kinematic solution provides high torsional system stiffness, substantial gear ratios, and backlash-free transmission. Leveraging the “hot lithography” additive manufacturing method ensures precise and warp-free gearbox components. Targeting medical technology, the gearbox meets stringent requirements: backlash-free, minimal vibration, high precision, and torque, with minimized weight for ergonomic comfort and fatigue mitigation. Computational simulations assess forces and stresses, highlighting the potential of additive manufacturing for cost-effective and functionally efficient gearbox fabrication. Nevertheless, careful material selection remains imperative for optimal functionality, especially in demanding medical applications. In summary, this research underscores a promising approach to gearbox fabrication, emphasizing the critical role of material selection and simulation-based assessments for optimal performance.

这项研究介绍了新颖的 "Kraken-Gear "机构，强调了增材聚合物三维打印技术在轻型机器人应用（如手术器械）的高传动比齿轮箱系统中的优势。创新的运动学解决方案提供了高扭转系统刚度、大齿轮比和无间隙传动。利用 "热光刻 "快速成型制造方法，可确保齿轮箱部件精确无翘曲。齿轮箱以医疗技术为目标，满足了严格的要求：无间隙、振动小、精度高、扭矩大、重量轻，符合人体工程学的舒适性和减轻疲劳。计算模拟对力和应力进行了评估，凸显了增材制造在成本效益和功能效率方面的潜力。尽管如此，为实现最佳功能，尤其是在要求苛刻的医疗应用中，谨慎选择材料仍是当务之急。总之，这项研究强调了一种很有前景的齿轮箱制造方法，强调了材料选择和基于模拟的评估对实现最佳性能的关键作用。

{"title":"A conceptual examination of an additive manufactured high-ratio coaxial gearbox","authors":"Philipp Eisele, Sajid Nisar, Franz Haas","doi":"10.1007/s10015-024-00955-6","DOIUrl":"10.1007/s10015-024-00955-6","url":null,"abstract":"<div><p>This research introduces the novel “Kraken-Gear” mechanism, emphasizing the advantages of additive polymer 3D printing in high-ratio gearbox systems for lightweight robotic applications, such as surgical instruments. The innovative kinematic solution provides high torsional system stiffness, substantial gear ratios, and backlash-free transmission. Leveraging the “hot lithography” additive manufacturing method ensures precise and warp-free gearbox components. Targeting medical technology, the gearbox meets stringent requirements: backlash-free, minimal vibration, high precision, and torque, with minimized weight for ergonomic comfort and fatigue mitigation. Computational simulations assess forces and stresses, highlighting the potential of additive manufacturing for cost-effective and functionally efficient gearbox fabrication. Nevertheless, careful material selection remains imperative for optimal functionality, especially in demanding medical applications. In summary, this research underscores a promising approach to gearbox fabrication, emphasizing the critical role of material selection and simulation-based assessments for optimal performance.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"416 - 422"},"PeriodicalIF":0.8,"publicationDate":"2024-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10015-024-00955-6.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141361326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Effect of subjective health conditions on facial skin temperature distribution: a 1-year statistical analysis among four participants 主观健康状况对面部皮肤温度分布的影响：对四名参与者进行的为期一年的统计分析

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-06-07 DOI: 10.1007/s10015-024-00953-8

Masahito Takano, Kosuke Oiwa, Akio Nozawa

Thermal skin images are used to evaluate the physiological and psychological states of patients. To implement remote daily health monitoring, we attempted to assess subjective health conditions using thermal face images. In our previous study, we constructed an anomaly detection model to detect poor health conditions; the area under the receiver operating characteristic curve of the anomaly-detection model was 0.70. However, how the spatial distribution of facial skin temperature changes in response to subjective health conditions remains unclear. In this study, we statistically analyzed the acquired thermal face images to investigate the effect of subjective health conditions on facial skin temperature distribution. As a result of the comparison between health conditions, we confirmed that typically the average temperatures and left-right asymmetry in some regions of the face were significantly higher in poor health conditions than in good health conditions.

皮肤热图像可用于评估患者的生理和心理状态。为了实现远程日常健康监测，我们尝试利用面部热图像来评估主观健康状况。在之前的研究中，我们构建了一个异常检测模型来检测不良健康状况；异常检测模型的接收者工作特征曲线下面积为 0.70。然而，面部皮肤温度的空间分布如何随主观健康状况而变化仍不清楚。在本研究中，我们对获取的面部热图像进行了统计分析，以研究主观健康状况对面部皮肤温度分布的影响。通过对不同健康状况的比较，我们证实健康状况差的人面部某些区域的平均温度和左右不对称程度明显高于健康状况好的人。

引用次数: 0

Inferring source of learning by chimpanzees in cognitive tasks using reinforcement learning theory 利用强化学习理论推断黑猩猩在认知任务中的学习来源

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-06-05 DOI: 10.1007/s10015-024-00954-7

Satoshi Hirata, Yutaka Sakai

Reinforcement learning is a mathematical framework for learning better choices through trial-and-error. Recent studies revealed that reinforcement learning is applicable to animal behavior and cognition. However, applying reinforcement learning to animal behavior sometimes encounters difficulties because the information sources utilized by animals to make choices are often unknown, whereas this is identified as the “state” in the reinforcement learning framework. We sought to identify possible state settings including non-standard formulations suitable for explaining data from past chimpanzee studies. Although chimpanzees’ performance in a serial learning task was inconsistent with standard reinforcement learning formulations, we found that the combination of state-independent choice making and state-dependent evaluation produced consistent results. Exploration of state settings in reinforcement learning may shed new light on animal learning processes.

强化学习是一个数学框架，用于通过试错学习更好的选择。最新研究表明，强化学习适用于动物行为和认知。然而，将强化学习应用于动物行为有时会遇到困难，因为动物在做出选择时所利用的信息源往往是未知的，而这在强化学习框架中被确定为 "状态"。我们试图找出可能的状态设置，包括适合解释过去黑猩猩研究数据的非标准公式。虽然黑猩猩在连续学习任务中的表现与标准强化学习公式不一致，但我们发现，与状态无关的选择决策和与状态有关的评价相结合，会产生一致的结果。对强化学习中状态设置的探索可能会为动物学习过程带来新的启示。

引用次数: 0

Research on automatic error-checking in English short text translation by a neural network algorithm 神经网络算法在英语短文翻译中的自动查错研究

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-05-31 DOI: 10.1007/s10015-024-00952-9

Liang Guo

With the growing population of English learners, how to improve the efficiency of English learning has become a focus of research. This article focuses on automatic error-checking in English short text translation. The Transformer model was enhanced by combining with the bidirectional gated recurrent unit (BiGRU) algorithm to create a dual-encoder model that better captures information within input sequences. Experiments were then conducted on different corpora. The improved Transformer model obtained a ({text{F}}_{0.5}) of 59.09 on CoNLL-2014 and 61.05 Google-bilingual evaluation understudy (GLEU) on JFLEG, both of which were better than the other methods compared. The case analysis showed that the improved Transformer model accurately found errors in short text translation. The findings indicate that the proposed approach is reliable in the automatic error-checking of English short text translation and can be applied in practice.

随着英语学习者人数的不断增加，如何提高英语学习效率已成为研究的重点。本文主要研究英语短文翻译中的自动错误检查。通过结合双向门控递归单元（BiGRU）算法，Transformer 模型得到了增强，从而创建了一个双编码器模型，能更好地捕捉输入序列中的信息。然后在不同的语料库中进行了实验。改进后的 Transformer 模型在 CoNLL-2014 上的 ({{F}}_{0.5}) 得分为 59.09，在 JFLEG 上的谷歌双语评估 understudy (GLEU) 得分为 61.05，均优于所比较的其他方法。案例分析表明，改进后的 Transformer 模型能准确发现短文翻译中的错误。研究结果表明，所提出的方法在英语短文翻译的自动查错方面是可靠的，可以在实践中应用。

引用次数: 0

Real-time translation of English speech through speech feature extraction 通过语音特征提取实现英语语音的实时翻译

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-05-27 DOI: 10.1007/s10015-024-00951-w

Xiaoyan Lei

Real-time English speech translation is useful in numerous situations, including business and travel. The goal of this research is to improve real-time English speech translation efficacy. Initially, filter bank (FBank) features were extracted from English speech. Subsequently, an enhanced Transformer model was introduced, incorporating a causal convolution module in the front end of the encoder to capture English speech features with location information. The performance of the optimized model in translating English speech to different target languages was tested using the MuST-C dataset. The results revealed differences in translation results for different target languages using the improved Transformer. The highest bilingual evaluation understudy (BLEU) score was observed for Spanish text at 20.84, while Russian text obtained the lowest score of 10.56. The average BLEU score was 18.51, with an average lag time delay of 1202.33 ms. Compared to the conventional Transformer model, the improved model exhibited higher BLEU scores, lower time delay, and optimal performance when utilizing a convolutional kernel size of 3 × 3. The results demonstrate the dependability of the improved Transformer model in real-time English speech translation, highlighting its practical usefulness.

实时英语语音翻译在商务和旅行等多种场合都非常有用。本研究的目标是提高实时英语语音翻译的效率。最初，从英语语音中提取滤波器库（FBank）特征。随后，引入了增强型变换器模型，在编码器前端加入了因果卷积模块，以捕捉带有位置信息的英语语音特征。我们使用 MuST-C 数据集测试了优化模型将英语语音翻译成不同目标语言的性能。结果显示，使用改进后的转换器，不同目标语言的翻译结果存在差异。西班牙语文本的双语评估劣等（BLEU）得分最高，为 20.84 分，而俄语文本的得分最低，为 10.56 分。平均 BLEU 得分为 18.51，平均延迟时间为 1202.33 毫秒。与传统的 Transformer 模型相比，改进后的模型在使用 3 × 3 的卷积核大小时，显示出更高的 BLEU 分数、更低的时延和最佳性能。这些结果证明了改进的 Transformer 模型在实时英语语音翻译中的可靠性，突出了其实用性。

{"title":"Real-time translation of English speech through speech feature extraction","authors":"Xiaoyan Lei","doi":"10.1007/s10015-024-00951-w","DOIUrl":"10.1007/s10015-024-00951-w","url":null,"abstract":"<div><p>Real-time English speech translation is useful in numerous situations, including business and travel. The goal of this research is to improve real-time English speech translation efficacy. Initially, filter bank (FBank) features were extracted from English speech. Subsequently, an enhanced Transformer model was introduced, incorporating a causal convolution module in the front end of the encoder to capture English speech features with location information. The performance of the optimized model in translating English speech to different target languages was tested using the MuST-C dataset. The results revealed differences in translation results for different target languages using the improved Transformer. The highest bilingual evaluation understudy (BLEU) score was observed for Spanish text at 20.84, while Russian text obtained the lowest score of 10.56. The average BLEU score was 18.51, with an average lag time delay of 1202.33 ms. Compared to the conventional Transformer model, the improved model exhibited higher BLEU scores, lower time delay, and optimal performance when utilizing a convolutional kernel size of 3 × 3. The results demonstrate the dependability of the improved Transformer model in real-time English speech translation, highlighting its practical usefulness.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"410 - 415"},"PeriodicalIF":0.8,"publicationDate":"2024-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142414281","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Human gait-type recognition without pre-training: an adaptive fuzzy-based approach for locomotion-assistance devices 无需预训练的人类步态类型识别：基于自适应模糊的运动辅助设备方法

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-05-24 DOI: 10.1007/s10015-024-00950-x

Natee Chirachongcharoen, Sajid Nisar

Gait-type recognition is important for robotic exoskeletons and walking-assistance devices to adjust their output according to the users’ needs. However, the growing trend of using machine learning (ML) models is both labor- and data-intensive, which makes it practically less attractive for application in exoskeletons and wearable-assistive devices. This research aims to devise a fuzzy-based gait recognition algorithm that requires minimum training data (only 40 cycles for each of the 5 gait types) and adapts to new users without having the need of pre-training for each of them. The proposed algorithm uses the fuzzy logic system (FLS) and Welford’s (variance computation) method to enhance the adaptability by adjusting the rules for gait-type recognition and fine-tuning them in real time for every new user without requiring a specific prior training. Simulation-based evaluation of the proposed algorithm shows a gait-type recognition accuracy of 63.0%, an improvement of 36.8% over the non-adaptive fuzzy-based recognition algorithm. Moreover, the results show that the proposed algorithm outperforms the popular ML methods (support vector machine, Naive Bayes classifier, and logistic regression) when subjected to limited gait-cycles data and no prior training is provided.

步态类型识别对于机器人外骨骼和步行辅助设备根据用户需求调整输出非常重要。然而，使用机器学习（ML）模型是一种日益增长的趋势，既耗费大量人力又耗费大量数据，这使得它在外骨骼和可穿戴辅助设备中的应用实际上并不那么有吸引力。本研究旨在设计一种基于模糊的步态识别算法，该算法只需最少的训练数据（5 种步态类型中每种只需 40 个循环），并能适应新用户，无需对每个用户进行预训练。所提出的算法使用模糊逻辑系统（FLS）和维尔福（方差计算）方法，通过调整步态类型识别规则来增强适应性，并针对每个新用户进行实时微调，而无需特定的预先训练。对拟议算法的仿真评估显示，步态类型识别准确率为 63.0%，比非自适应模糊识别算法提高了 36.8%。此外，结果表明，在有限的步态周期数据和不提供事先训练的情况下，所提出的算法优于流行的 ML 方法（支持向量机、Naive Bayes 分类器和逻辑回归）。

{"title":"Human gait-type recognition without pre-training: an adaptive fuzzy-based approach for locomotion-assistance devices","authors":"Natee Chirachongcharoen, Sajid Nisar","doi":"10.1007/s10015-024-00950-x","DOIUrl":"10.1007/s10015-024-00950-x","url":null,"abstract":"<div><p>Gait-type recognition is important for robotic exoskeletons and walking-assistance devices to adjust their output according to the users’ needs. However, the growing trend of using machine learning (ML) models is both labor- and data-intensive, which makes it practically less attractive for application in exoskeletons and wearable-assistive devices. This research aims to devise a fuzzy-based gait recognition algorithm that requires minimum training data (only 40 cycles for each of the 5 gait types) and adapts to new users without having the need of pre-training for each of them. The proposed algorithm uses the fuzzy logic system (FLS) and Welford’s (variance computation) method to enhance the adaptability by adjusting the rules for gait-type recognition and fine-tuning them in real time for every new user without requiring a specific prior training. Simulation-based evaluation of the proposed algorithm shows a gait-type recognition accuracy of 63.0%, an improvement of 36.8% over the non-adaptive fuzzy-based recognition algorithm. Moreover, the results show that the proposed algorithm outperforms the popular ML methods (support vector machine, Naive Bayes classifier, and logistic regression) when subjected to limited gait-cycles data and no prior training is provided.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 3","pages":"389 - 397"},"PeriodicalIF":0.8,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141102062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Interchange flow control with dynamic obstacles optimized using genetic algorithms—a concept of virtual walls 利用遗传算法优化带有动态障碍物的互通流量控制--虚拟墙的概念

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-04-24 DOI: 10.1007/s10015-024-00946-7

Junya Hoshino, Yuki Itoh, Ryuma Saotome, Tomohiro Harada, Kenji Matsuda, Tenta Suzuki, Mao Tobisawa, Kaito Kumagae, Johei Matsuoka, Toshinori Kagawa, Kiyohiko Hattori

In the near future, autonomous vehicles will be able to share information with other vehicles via communication, enabling appropriate traffic control approaches. A new approach to traffic control using fully autonomous driving involves the realization of a flat interchange. Unlike conventional approaches, this study focused on roads and interchanges that do not assume lanes. Specifically, we propose an interchange flow control approach to traffic control using a virtual wall (VW), which acquires and shares the initial position, destination, and speed of all vehicles entering an interchange in a two-dimensional space where vehicles can move freely, and then realizes appropriate control based on this information. Each vehicle individually calculates the shortest path to avoid the VW, thereby realizing a safe and rational path selection. In this study, a genetic algorithm was used to determine the location of the VW. The effectiveness of the proposed method was evaluated using simulations, and the results showed that compared to manual deployment in the roundabout form, the proposed method using VWs reduced the total path length and the number of collisions to zero. In addition, when comparing the case where VWs were deployed in common for all vehicles and the case where VWs were deployed individually for each vehicle, it was shown that the total path length was shorter when individual VWs were deployed.

在不久的将来，自动驾驶车辆将能够通过通信与其他车辆共享信息，从而实现适当的交通管制方法。利用完全自动驾驶技术进行交通控制的新方法包括实现平面交叉。与传统方法不同的是，本研究侧重于不假设车道的道路和互通式立交。具体来说，我们提出了一种使用虚拟墙（VW）进行交通控制的互通式立交流量控制方法，该方法可在车辆可自由移动的二维空间中获取并共享所有进入互通式立交的车辆的初始位置、目的地和速度，然后根据这些信息实现适当的控制。每辆车单独计算避开大众汽车的最短路径，从而实现安全合理的路径选择。本研究采用遗传算法来确定大众汽车的位置。模拟评估了所提方法的有效性，结果表明，与环岛形式下的人工部署相比，使用大众汽车的所提方法将总路径长度和碰撞次数降至零。此外，在比较所有车辆共同部署大众汽车和每辆车单独部署大众汽车的情况时，结果表明，单独部署大众汽车时的总路径长度更短。

{"title":"Interchange flow control with dynamic obstacles optimized using genetic algorithms—a concept of virtual walls","authors":"Junya Hoshino, Yuki Itoh, Ryuma Saotome, Tomohiro Harada, Kenji Matsuda, Tenta Suzuki, Mao Tobisawa, Kaito Kumagae, Johei Matsuoka, Toshinori Kagawa, Kiyohiko Hattori","doi":"10.1007/s10015-024-00946-7","DOIUrl":"10.1007/s10015-024-00946-7","url":null,"abstract":"<div><p>In the near future, autonomous vehicles will be able to share information with other vehicles via communication, enabling appropriate traffic control approaches. A new approach to traffic control using fully autonomous driving involves the realization of a flat interchange. Unlike conventional approaches, this study focused on roads and interchanges that do not assume lanes. Specifically, we propose an interchange flow control approach to traffic control using a virtual wall (VW), which acquires and shares the initial position, destination, and speed of all vehicles entering an interchange in a two-dimensional space where vehicles can move freely, and then realizes appropriate control based on this information. Each vehicle individually calculates the shortest path to avoid the VW, thereby realizing a safe and rational path selection. In this study, a genetic algorithm was used to determine the location of the VW. The effectiveness of the proposed method was evaluated using simulations, and the results showed that compared to manual deployment in the roundabout form, the proposed method using VWs reduced the total path length and the number of collisions to zero. In addition, when comparing the case where VWs were deployed in common for all vehicles and the case where VWs were deployed individually for each vehicle, it was shown that the total path length was shorter when individual VWs were deployed.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"230 - 241"},"PeriodicalIF":0.8,"publicationDate":"2024-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140663845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance evaluation of character-level CNNs using tweet data and analysis for weight perturbations 利用推文数据和权重扰动分析对字符级 CNN 进行性能评估

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-04-05 DOI: 10.1007/s10015-024-00944-9

Kazuteru Miyazaki, Masaaki Ida

Character-level convolutional neural networks (CLCNNs) are commonly used to classify textual data. CLCNN is used as a more versatile tool. For natural language recognition, after decomposing a sentence into character units, each unit is converted into a corresponding character code (e.g., Unicode values) and the code is input into the CLCNN network. Thus, sentences can be treated like images. We have previously applied a CLCNN to verify whether a university’s diploma and/or curriculum policies are well written. In this study, we experimentally confirm the effectiveness of CLCNN using tweet data. In particular, we focus on the effect of the number of units on performance using the following two types of data; one is a real and public tweet dataset on the reputation of a cell phone, and the other is the NTCIR-13 MedWeb task, which consists of pseudo-tweet data and is a well-known collection of tests for multi-label problems. Results of experiments conducted by varying the number of units in the all-coupled layer confirmed the agreement of the results with the theorem introduced in the Amari’s book (Amari in Mathematical Science New Development of Information Geometry, For Senior & Graduate Courses. SAIENSU-SHA Co., 2014). Furthermore, in the NTCIR-13 MedWeb task, we analyze two kinds of experiments, the effects of kernel size and weight perturbation. The results of the difference in the kernel size suggest the existence of an optimal kernel size for sentence comprehension. The results of perturbations to the convolutional layer and pooling layer indicate the possibility of relationship between the numbers of degrees of freedom and network parameters.

字符级卷积神经网络（CLCNN）通常用于对文本数据进行分类。CLCNN 是一种用途更为广泛的工具。在自然语言识别中，将句子分解为字符单元后，将每个单元转换为相应的字符代码（如 Unicode 值），然后将代码输入 CLCNN 网络。因此，句子可以像图像一样处理。此前，我们曾将 CLCNN 用于验证一所大学的文凭和/或课程政策是否编写得当。在本研究中，我们使用推文数据对 CLCNN 的有效性进行了实验验证。具体而言，我们使用以下两种数据重点研究了单元数对性能的影响：一种是关于手机声誉的真实公开推文数据集，另一种是 NTCIR-13 MedWeb 任务，该任务由伪推文数据组成，是众所周知的多标签问题测试集合。通过改变全耦合层中的单元数量进行的实验结果证实，实验结果与阿马里在其著作《阿马里在数学科学中的新发展：信息几何》（Amari in Mathematical Science New Development of Information Geometry, For Senior & Graduate Courses.SAIENSU-SHA Co., 2014）。此外，在 NTCIR-13 MedWeb 任务中，我们分析了两种实验，即核大小和权重扰动的影响。内核大小差异的结果表明，句子理解存在最佳内核大小。对卷积层和池化层的扰动结果表明，自由度数与网络参数之间可能存在关系。

{"title":"Performance evaluation of character-level CNNs using tweet data and analysis for weight perturbations","authors":"Kazuteru Miyazaki, Masaaki Ida","doi":"10.1007/s10015-024-00944-9","DOIUrl":"10.1007/s10015-024-00944-9","url":null,"abstract":"<div><p>Character-level convolutional neural networks (CLCNNs) are commonly used to classify textual data. CLCNN is used as a more versatile tool. For natural language recognition, after decomposing a sentence into character units, each unit is converted into a corresponding character code (e.g., Unicode values) and the code is input into the CLCNN network. Thus, sentences can be treated like images. We have previously applied a CLCNN to verify whether a university’s diploma and/or curriculum policies are well written. In this study, we experimentally confirm the effectiveness of CLCNN using tweet data. In particular, we focus on the effect of the number of units on performance using the following two types of data; one is a real and public tweet dataset on the reputation of a cell phone, and the other is the NTCIR-13 MedWeb task, which consists of pseudo-tweet data and is a well-known collection of tests for multi-label problems. Results of experiments conducted by varying the number of units in the all-coupled layer confirmed the agreement of the results with the theorem introduced in the Amari’s book (Amari in Mathematical Science New Development of Information Geometry, For Senior & Graduate Courses. SAIENSU-SHA Co., 2014). Furthermore, in the NTCIR-13 MedWeb task, we analyze two kinds of experiments, the effects of kernel size and weight perturbation. The results of the difference in the kernel size suggest the existence of an optimal kernel size for sentence comprehension. The results of perturbations to the convolutional layer and pooling layer indicate the possibility of relationship between the numbers of degrees of freedom and network parameters.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"266 - 273"},"PeriodicalIF":0.8,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140738951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Investigation of a single object shape for efficient learning in bin picking of multiple types of objects 研究单一物体形状对多种类型物体分拣的高效学习作用

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-04-03 DOI: 10.1007/s10015-024-00945-8

Isamu Bungo, Tomohiro Hayakawa, Toshiyuki Yasuda

In recent years, many studies have used Convolutional Neural Networks (CNN) as an approach to automate bin picking tasks by robots. In these previous studies, all types of objects in a bin were used as training data to optimize CNN parameters. Therefore, as the number of types of objects in a bin increases under the condition that the total number of training data is retained, CNN is not sufficiently optimized. In this study, we propose a learning method of CNN to achieve a bin picking task for multi-types of objects. Unlike previous learning method using multi-types of objects, we use a single type of object with a well-designed shape to obtain training data. It is true that when an object is used for training, then the learning for the corresponding object proceeds very well. However, the training does not contribute so much to the leaning of other types of objects. we expect the CNN to learn many types of grasping methods simultaneously by the training data of the single well-designed object. In that case, the total number of training data for each type of objects in a bin can be retained even if the number of types of objects increases. To verify the idea, we construct 12 different CNN models which are trained by different types of objects. Through simulations and robot experiments, bin picking tasks to pick multi-types of objects were performed using those CNN models. As a result, the training method which uses a complex-shaped object achieved higher grasping success rate than the training method which uses a primitive-shaped object. Moreover, the training method which uses a complex-shaped object achieved higher grasping success rate than the previous training method which uses all types of objects in a bin.

近年来，许多研究将卷积神经网络（CNN）作为机器人自动分拣垃圾的一种方法。在以往的研究中，垃圾箱中所有类型的物体都被用作优化 CNN 参数的训练数据。因此，在保留全部训练数据的条件下，随着垃圾箱中物体类型数量的增加，CNN 并没有得到充分优化。在本研究中，我们提出了一种 CNN 学习方法，以实现多类型物体的分仓任务。与以往使用多类型物体的学习方法不同，我们使用形状设计良好的单一类型物体来获取训练数据。诚然，当一个物体被用于训练时，相应物体的学习就会非常顺利。我们希望 CNN 能够通过单一设计良好的物体的训练数据同时学习多种抓取方法。在这种情况下，即使物体类型的数量增加，也可以保留一仓中每种类型物体的训练数据总数。为了验证这一想法，我们构建了 12 个不同的 CNN 模型，这些模型由不同类型的物体训练而成。通过模拟和机器人实验，我们使用这些 CNN 模型完成了多类型物品的拣选任务。结果，使用复杂形状物体的训练方法比使用原始形状物体的训练方法获得了更高的抓取成功率。此外，使用复杂形状物体的训练方法的抓取成功率也高于之前使用垃圾箱中所有类型物体的训练方法。

{"title":"Investigation of a single object shape for efficient learning in bin picking of multiple types of objects","authors":"Isamu Bungo, Tomohiro Hayakawa, Toshiyuki Yasuda","doi":"10.1007/s10015-024-00945-8","DOIUrl":"10.1007/s10015-024-00945-8","url":null,"abstract":"<div><p>In recent years, many studies have used Convolutional Neural Networks (CNN) as an approach to automate bin picking tasks by robots. In these previous studies, all types of objects in a bin were used as training data to optimize CNN parameters. Therefore, as the number of types of objects in a bin increases under the condition that the total number of training data is retained, CNN is not sufficiently optimized. In this study, we propose a learning method of CNN to achieve a bin picking task for multi-types of objects. Unlike previous learning method using multi-types of objects, we use a single type of object with a well-designed shape to obtain training data. It is true that when an object is used for training, then the learning for the corresponding object proceeds very well. However, the training does not contribute so much to the leaning of other types of objects. we expect the CNN to learn many types of grasping methods simultaneously by the training data of the single well-designed object. In that case, the total number of training data for each type of objects in a bin can be retained even if the number of types of objects increases. To verify the idea, we construct 12 different CNN models which are trained by different types of objects. Through simulations and robot experiments, bin picking tasks to pick multi-types of objects were performed using those CNN models. As a result, the training method which uses a complex-shaped object achieved higher grasping success rate than the training method which uses a primitive-shaped object. Moreover, the training method which uses a complex-shaped object achieved higher grasping success rate than the previous training method which uses all types of objects in a bin.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"372 - 379"},"PeriodicalIF":0.8,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140747642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Effects of excessive elitism on the evolution of artificial creatures with NEAT 过度精英化对使用 NEAT 的人造生物进化的影响

IF 0.8 Q4 ROBOTICS

Artificial Life and Robotics

Pub Date : 2024-04-03 DOI: 10.1007/s10015-024-00948-5

Siti Aisyah Binti Jaafar, Reiji Suzuki, Satoru Komori, Takaya Arita

This paper proposes a simple method based on a novel use of elitism to increase the population size of artificial creatures while minimizing evaluation cost. This can contribute to preventing premature convergence of the population. We propose the “Excessive Elitism (EE)” method by modifying elitism in HyperNEAT (Hypercube-based NeuroEvolution of Augmenting Topologies), which is an evolutionary algorithm commonly used to evolve genotype [i.e., Compositional Pattern Producing Network (CPPN)] of artificial creatures. In EE, the evaluated fitness of best-fit individuals will be succeeded and reused instead of being re-evaluated during subsequent fitness evaluations, thereby reducing the evaluation cost if the elite size is excessive. Notably, EE also disables speciation and fitness sharing, serving to simplify the population structure and reduce complexity. In a 3D multi-agent environment, we evolved the morphology and behavior of artificial creatures with a simple target approach task. We assumed a baseline case (EE (2, 20)) in which a small population size was used due to the strong limitation of the evaluation cost and adopted a normal small elite size. This often led to premature convergence of the population to suboptimal individuals who could not reach the target. However, with the application of EE, the population was capable of evolving to reach the target, maintaining an evaluation cost comparable to EE (2, 20). We demonstrate that EE method serves as a simpler alternative to speciation for diversity preservation, capable of enhancing both the average and optimal fitness of a population, thus preventing premature convergence at a minimal evaluation cost. Further research in complex environments is required to fully uncover the potential and limitations of this method.

本文提出了一种基于新颖的精英主义的简单方法，以增加人工生物的种群数量，同时最大限度地降低评估成本。这有助于防止种群过早收敛。我们通过修改 HyperNEAT（基于超立方体的增强拓扑神经进化算法）中的精英主义，提出了 "过度精英主义（EE）"方法，HyperNEAT 是一种进化算法，常用于进化人工生物的基因型[即组合模式生成网络（CPPN）]。在 EE 中，最合适个体的适配度评估结果将被继承和重用，而不是在后续适配度评估过程中重新评估，从而在精英规模过大时降低评估成本。值得注意的是，EE 还禁止了物种分化和适应度共享，从而简化了种群结构并降低了复杂性。在三维多代理环境中，我们通过一个简单的目标接近任务来进化人工生物的形态和行为。我们假设了一种基线情况（EE (2, 20)），在这种情况下，由于评估成本的强烈限制，我们使用了较小的种群规模，并采用了正常的小精英规模。这往往会导致群体过早趋同于无法达到目标的次优个体。然而，应用 EE 后，种群能够不断进化以达到目标，并保持与 EE 相当的评估成本（2, 20）。我们证明，EE 方法是物种多样性保护的一种更简单的替代方法，它能够提高种群的平均和最佳适应性，从而以最小的评估成本防止过早趋同。要充分发掘这种方法的潜力和局限性，还需要在复杂环境中开展进一步研究。

{"title":"Effects of excessive elitism on the evolution of artificial creatures with NEAT","authors":"Siti Aisyah Binti Jaafar, Reiji Suzuki, Satoru Komori, Takaya Arita","doi":"10.1007/s10015-024-00948-5","DOIUrl":"10.1007/s10015-024-00948-5","url":null,"abstract":"<div><p>This paper proposes a simple method based on a novel use of elitism to increase the population size of artificial creatures while minimizing evaluation cost. This can contribute to preventing premature convergence of the population. We propose the “Excessive Elitism (EE)” method by modifying elitism in HyperNEAT (Hypercube-based NeuroEvolution of Augmenting Topologies), which is an evolutionary algorithm commonly used to evolve genotype [i.e., Compositional Pattern Producing Network (CPPN)] of artificial creatures. In EE, the evaluated fitness of best-fit individuals will be succeeded and reused instead of being re-evaluated during subsequent fitness evaluations, thereby reducing the evaluation cost if the elite size is excessive. Notably, EE also disables speciation and fitness sharing, serving to simplify the population structure and reduce complexity. In a 3D multi-agent environment, we evolved the morphology and behavior of artificial creatures with a simple target approach task. We assumed a baseline case (EE (2, 20)) in which a small population size was used due to the strong limitation of the evaluation cost and adopted a normal small elite size. This often led to premature convergence of the population to suboptimal individuals who could not reach the target. However, with the application of EE, the population was capable of evolving to reach the target, maintaining an evaluation cost comparable to EE (2, 20). We demonstrate that EE method serves as a simpler alternative to speciation for diversity preservation, capable of enhancing both the average and optimal fitness of a population, thus preventing premature convergence at a minimal evaluation cost. Further research in complex environments is required to fully uncover the potential and limitations of this method.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"29 2","pages":"286 - 297"},"PeriodicalIF":0.8,"publicationDate":"2024-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140750133","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0