首页 > 最新文献

Artificial Life and Robotics最新文献

英文 中文
Research on rotary crane control using a neural network optimized by an improved bat algorithm 改进蝙蝠算法优化的神经网络在旋转起重机控制中的应用研究
IF 0.8 Q4 ROBOTICS Pub Date : 2025-02-20 DOI: 10.1007/s10015-025-01011-7
Hiroyuki Fujii, Kunihiko Nakazono, Naoki Oshiro, Hiroshi Kinjo

In this paper, we propose a three-layered neural network controller (NC) optimized using an improved bat algorithm (BA) for a rotary crane system. In our previous study, the simulation results showed that an NC optimized using the original BA exhibits good control and evolutionary performance. However, the simulation execution time was long. Therefore, to address this problem, we propose an improved BA that reduces the execution time. We show that the NC optimized by the improved BA exhibits the same control performance as that optimized via conventional methods. It is also shown that the time for evolutionary calculations can be reduced.

本文提出了一种基于改进蝙蝠算法优化的三层神经网络控制器(NC)。在我们之前的研究中,仿真结果表明,使用原始BA优化的NC具有良好的控制和进化性能。但是,仿真执行时间较长。因此,为了解决这个问题,我们提出了一个改进的BA,它可以减少执行时间。结果表明,改进的BA优化后的数控系统与传统方法优化后的数控系统具有相同的控制性能。还表明,进化计算的时间可以减少。
{"title":"Research on rotary crane control using a neural network optimized by an improved bat algorithm","authors":"Hiroyuki Fujii,&nbsp;Kunihiko Nakazono,&nbsp;Naoki Oshiro,&nbsp;Hiroshi Kinjo","doi":"10.1007/s10015-025-01011-7","DOIUrl":"10.1007/s10015-025-01011-7","url":null,"abstract":"<div><p>In this paper, we propose a three-layered neural network controller (NC) optimized using an improved bat algorithm (BA) for a rotary crane system. In our previous study, the simulation results showed that an NC optimized using the original BA exhibits good control and evolutionary performance. However, the simulation execution time was long. Therefore, to address this problem, we propose an improved BA that reduces the execution time. We show that the NC optimized by the improved BA exhibits the same control performance as that optimized via conventional methods. It is also shown that the time for evolutionary calculations can be reduced.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 3","pages":"465 - 471"},"PeriodicalIF":0.8,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145167088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power-law distributions in an online video-sharing system and its long-term dynamics 在线视频分享系统的幂律分布及其长期动态
IF 0.8 Q4 ROBOTICS Pub Date : 2025-02-13 DOI: 10.1007/s10015-025-01007-3
Kiminori Ito, Takashi Shimada

We study the data of Japanese video-sharing platform, Niconico, which contains 21 million videos. From our analysis, the rank size distribution of video views is found to exhibit a crossover from a power law with an exponent around (-0.5) for the top (approx 10^5) movies to another power low with exponent around (-1) for the movies in the following ranks. The probability density function of video views for the bottom (90%) movies is well fitted by log-normal distribution. This implies that, while videos in the top rank regime follow a different dynamics which yields the power law, videos in the middle and low rank regime seem to be evolving according to a random multiplicative process. Furthermore, we observe temporal relaxation process of video views for 3 years. Temporal relaxation process of video views is grouped by the size of the number of video views, and averaged within each size group. Interestingly, the daily video views universally show power-law relaxation in all view size, from the top total view group ((10^6-10^7)) to the bottom group ((approx 10^2)). This indicates the existence of memory processes longer than the exponential function, which are universally independent of video size.

我们研究了日本视频分享平台Niconico的数据,该平台包含2100万个视频。从我们的分析中,我们发现视频观看的排名大小分布表现出一个交叉,从指数在(-0.5)附近的幂律(对于排名靠前的(approx 10^5)电影)到指数在(-1)附近的另一个幂律(对于排名靠后的电影)。底部(90%)影片的视频观看概率密度函数用对数正态分布很好地拟合。这意味着,虽然在最高等级制度下的视频遵循不同的动态,产生幂律,但在中低等级制度下的视频似乎是根据随机乘法过程进化的。此外,我们观察了3年视频观看的时间松弛过程。视频视图的时间松弛过程按视频视图数的大小分组,并在每个大小组内取平均值。有趣的是,每日视频观看量普遍显示,从总观看量最高的组((10^6-10^7))到总观看量最低的组((approx 10^2)),所有观看量都呈幂律松弛。这表明存在比指数函数更长的内存进程,它们普遍与视频大小无关。
{"title":"Power-law distributions in an online video-sharing system and its long-term dynamics","authors":"Kiminori Ito,&nbsp;Takashi Shimada","doi":"10.1007/s10015-025-01007-3","DOIUrl":"10.1007/s10015-025-01007-3","url":null,"abstract":"<div><p>We study the data of Japanese video-sharing platform, Niconico, which contains 21 million videos. From our analysis, the rank size distribution of video views is found to exhibit a crossover from a power law with an exponent around <span>(-0.5)</span> for the top <span>(approx 10^5)</span> movies to another power low with exponent around <span>(-1)</span> for the movies in the following ranks. The probability density function of video views for the bottom <span>(90%)</span> movies is well fitted by log-normal distribution. This implies that, while videos in the top rank regime follow a different dynamics which yields the power law, videos in the middle and low rank regime seem to be evolving according to a random multiplicative process. Furthermore, we observe temporal relaxation process of video views for 3 years. Temporal relaxation process of video views is grouped by the size of the number of video views, and averaged within each size group. Interestingly, the daily video views universally show power-law relaxation in all view size, from the top total view group (<span>(10^6-10^7)</span>) to the bottom group (<span>(approx 10^2)</span>). This indicates the existence of memory processes longer than the exponential function, which are universally independent of video size.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"325 - 331"},"PeriodicalIF":0.8,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10015-025-01007-3.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143925555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of coupling complexity in echo state networks via ordinal persistent homology 基于有序持久同调的回波状态网络耦合复杂性分析
IF 0.8 Q4 ROBOTICS Pub Date : 2025-02-13 DOI: 10.1007/s10015-025-01010-8
Taichi Haruna

We study coupling complexity in multivariate time series generated by echo state networks subject to i.i.d. input signals using the ordinal persistent index as a coupling complexity measure. Coupling complexity is a notion of complexity focusing on the relations among components of a given system. Given a time segment of a multivariate time series, its ordinal persistent index is defined by taking the persistent homology of a filtered simplicial complex reflecting similarity among the ordinal patterns of individual time series. As the strength of input signals increases, the dynamics of echo state networks shift from asynchronous ones to more synchronized ones. We show that the original ordinal persistent index cannot capture such change in the synchronization behavior, but a generalized version of the ordinal persistent index is sensitive to the change: the latter takes relatively high values between the two extremes, namely when the strength of input signals to the echo state networks is within a certain range of intermediate values.

本文采用有序持久指数作为耦合复杂度度量,研究了受i.i.d输入信号影响的回声状态网络产生的多变量时间序列的耦合复杂性。耦合复杂性是关注给定系统中组件之间关系的复杂性概念。给定多变量时间序列的一个时间段,其有序持久指数通过取反映单个时间序列有序模式之间相似性的过滤简单复合体的持久同调来定义。随着输入信号强度的增加,回波状态网络的动态从异步状态向同步状态转变。我们发现,原始的有序持久索引不能捕捉同步行为的这种变化,但广义版本的有序持久索引对这种变化很敏感:后者在两个极端之间的值相对较高,即当回波状态网络的输入信号强度在一定的中间值范围内时。
{"title":"Analysis of coupling complexity in echo state networks via ordinal persistent homology","authors":"Taichi Haruna","doi":"10.1007/s10015-025-01010-8","DOIUrl":"10.1007/s10015-025-01010-8","url":null,"abstract":"<div><p>We study coupling complexity in multivariate time series generated by echo state networks subject to i.i.d. input signals using the ordinal persistent index as a coupling complexity measure. Coupling complexity is a notion of complexity focusing on the relations among components of a given system. Given a time segment of a multivariate time series, its ordinal persistent index is defined by taking the persistent homology of a filtered simplicial complex reflecting similarity among the ordinal patterns of individual time series. As the strength of input signals increases, the dynamics of echo state networks shift from asynchronous ones to more synchronized ones. We show that the original ordinal persistent index cannot capture such change in the synchronization behavior, but a generalized version of the ordinal persistent index is sensitive to the change: the latter takes relatively high values between the two extremes, namely when the strength of input signals to the echo state networks is within a certain range of intermediate values.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 3","pages":"417 - 423"},"PeriodicalIF":0.8,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10015-025-01010-8.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145165532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A TDA-based performance analysis for neural networks with low-bit weights 基于tda的低位权神经网络性能分析
IF 0.8 Q4 ROBOTICS Pub Date : 2025-02-12 DOI: 10.1007/s10015-025-01005-5
Yugo Ogio, Naoki Tsubone, Yuki Minami, Masato Ishikawa

Advances in neural network (NN) models and learning methods have resulted in breakthroughs in various fields. A larger NN model is more difficult to install on a computer with limited computing resources. One method for compressing NN models is to quantize the weights, in which the connection weights of the NNs are approximated with low-bit precision. The existing quantization methods for NN models can be categorized into two approaches: quantization-aware training (QAT) and post-training quantization (PTQ). In this study, we focused on the performance degradation of NN models using PTQ. This paper proposes a method for visually evaluating the performance of quantized NNs using topological data analysis (TDA). Subjecting the structure of NNs to TDA allows the performance of quantized NNs to be assessed without experiments or simulations. We developed a TDA-based evaluation method for NNs with low-bit weights by referring to previous research on a TDA-based evaluation method for NNs with high-bit weights. We also tested the TDA-based method using the MNIST dataset. Finally, we compared the performance of the quantized NNs generated by static and dynamic quantization through a visual demonstration.

神经网络(NN)模型和学习方法的进步导致了各个领域的突破。在计算资源有限的计算机上,更大的神经网络模型更难安装。一种压缩神经网络模型的方法是量化权重,其中神经网络的连接权重以低比特精度近似。现有的神经网络模型量化方法可分为量化感知训练(QAT)和训练后量化(PTQ)两种。在这项研究中,我们关注的是使用PTQ的神经网络模型的性能退化。本文提出了一种利用拓扑数据分析(TDA)直观评价量化神经网络性能的方法。将神经网络的结构置于TDA下,可以在没有实验或模拟的情况下评估量化神经网络的性能。参考前人关于基于tda的高比特权重神经网络评价方法的研究,我们开发了一种基于tda的低比特权重神经网络评价方法。我们还使用MNIST数据集测试了基于tda的方法。最后,我们通过视觉演示比较了静态和动态量化生成的量化神经网络的性能。
{"title":"A TDA-based performance analysis for neural networks with low-bit weights","authors":"Yugo Ogio,&nbsp;Naoki Tsubone,&nbsp;Yuki Minami,&nbsp;Masato Ishikawa","doi":"10.1007/s10015-025-01005-5","DOIUrl":"10.1007/s10015-025-01005-5","url":null,"abstract":"<div><p>Advances in neural network (NN) models and learning methods have resulted in breakthroughs in various fields. A larger NN model is more difficult to install on a computer with limited computing resources. One method for compressing NN models is to quantize the weights, in which the connection weights of the NNs are approximated with low-bit precision. The existing quantization methods for NN models can be categorized into two approaches: quantization-aware training (QAT) and post-training quantization (PTQ). In this study, we focused on the performance degradation of NN models using PTQ. This paper proposes a method for visually evaluating the performance of quantized NNs using topological data analysis (TDA). Subjecting the structure of NNs to TDA allows the performance of quantized NNs to be assessed without experiments or simulations. We developed a TDA-based evaluation method for NNs with low-bit weights by referring to previous research on a TDA-based evaluation method for NNs with high-bit weights. We also tested the TDA-based method using the MNIST dataset. Finally, we compared the performance of the quantized NNs generated by static and dynamic quantization through a visual demonstration.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"332 - 341"},"PeriodicalIF":0.8,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143925554","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint pairwise learning and masked language models for neural machine translation of English 英语神经机器翻译的联合两两学习和掩码语言模型
IF 0.8 Q4 ROBOTICS Pub Date : 2025-02-10 DOI: 10.1007/s10015-025-01008-2
Shuhan Yang, Qun Yang

The translation activity of language is a link and bridge for the integration of politics, economy, and culture in various countries. However, manual translation requires high quality of professional translators and takes a long time. The study attempts to introduce dual learning on the basis of traditional neural machine translation models. The improved neural machine translation model includes decoding of the source language and target language. With the help of the source language encoder, forward translation, backward backtranslation, and parallel decoding can be achieved; At the same time, adversarial training is carried out using a corpus containing noise to enhance the robustness of the model, enriching the technical and theoretical knowledge of existing neural machine translation models. The test results show that compared with the training speed of the baseline model, the training speed of the constructed model is 115 K words/s and the decoding speed is 2647 K words/s, which is 7.65 times faster than the decoding speed, and the translation quality loss is within the acceptable range. The mean bilingual evaluation score for the “two-step” training method was 16.51, an increase of 3.64 points from the lowest score, and the K-nearest-neighbor algorithm and the changing-character attack ensured the semantic integrity of noisy source language utterances to a greater extent. The translation quality of the changing character method outperformed that of the unrestricted noise attack method, with the highest bilingual evaluation study score value improving by 3.34 points and improving the robustness of the model. The translation model constructed by the study has been improved in terms of training speed and robustness performance, and is of practical use in many translation domains.

语言的翻译活动是各国政治、经济、文化相互融合的纽带和桥梁。但手工翻译对专业翻译人员的要求较高,耗时较长。本研究试图在传统神经机器翻译模型的基础上引入双重学习。改进的神经机器翻译模型包括源语言和目标语言的解码。在源语言编码器的帮助下,可以实现正向翻译、反向翻译、并行解码;同时,利用含噪声的语料库进行对抗性训练,增强模型的鲁棒性,丰富了现有神经机器翻译模型的技术和理论知识。测试结果表明,与基线模型的训练速度相比,构建模型的训练速度为115 K words/s,解码速度为2647 K words/s,比解码速度快7.65倍,翻译质量损失在可接受范围内。“两步”训练方法的双语评价平均分为16.51分,比最低分提高了3.64分,k -最近邻算法和变字符攻击在更大程度上保证了噪声源语言话语的语义完整性。变换特征方法的翻译质量优于无限制噪声攻击方法,最高双语评价研究得分值提高了3.34分,提高了模型的鲁棒性。本文构建的翻译模型在训练速度和鲁棒性方面都得到了提高,在许多翻译领域具有实际应用价值。
{"title":"Joint pairwise learning and masked language models for neural machine translation of English","authors":"Shuhan Yang,&nbsp;Qun Yang","doi":"10.1007/s10015-025-01008-2","DOIUrl":"10.1007/s10015-025-01008-2","url":null,"abstract":"<div><p>The translation activity of language is a link and bridge for the integration of politics, economy, and culture in various countries. However, manual translation requires high quality of professional translators and takes a long time. The study attempts to introduce dual learning on the basis of traditional neural machine translation models. The improved neural machine translation model includes decoding of the source language and target language. With the help of the source language encoder, forward translation, backward backtranslation, and parallel decoding can be achieved; At the same time, adversarial training is carried out using a corpus containing noise to enhance the robustness of the model, enriching the technical and theoretical knowledge of existing neural machine translation models. The test results show that compared with the training speed of the baseline model, the training speed of the constructed model is 115 K words/s and the decoding speed is 2647 K words/s, which is 7.65 times faster than the decoding speed, and the translation quality loss is within the acceptable range. The mean bilingual evaluation score for the “two-step” training method was 16.51, an increase of 3.64 points from the lowest score, and the K-nearest-neighbor algorithm and the changing-character attack ensured the semantic integrity of noisy source language utterances to a greater extent. The translation quality of the changing character method outperformed that of the unrestricted noise attack method, with the highest bilingual evaluation study score value improving by 3.34 points and improving the robustness of the model. The translation model constructed by the study has been improved in terms of training speed and robustness performance, and is of practical use in many translation domains.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"342 - 353"},"PeriodicalIF":0.8,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143925547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance evaluation of ORB-SLAM3 with quantized images 基于量化图像的ORB-SLAM3性能评价
IF 0.8 Q4 ROBOTICS Pub Date : 2025-02-07 DOI: 10.1007/s10015-025-01006-4
Siyuan Tao, Yuki Minami, Masato Ishikawa

Visual simultaneous localization and mapping (SLAM) is a critical technology for robots to perform high-precision navigation, increasing the focus among researchers to improve its accuracy. However, improvements in SLAM accuracy always come at the cost of an increased memory footprint, which limits the long-term operation of devices that operate under constrained hardware resources. Application of quantization methods is proposed as a promising solution to this problem. Since quantization can result in performance degradation, it is crucial to quantitatively evaluate the trade-off between potential degradation and memory savings to assess its practicality for visual SLAM. This paper introduces a mechanism to evaluate the influence of a quantization method on visual SLAM, and applies it to assess the impact of three different quantization methods on ORB-SLAM3. Specifically, we examine two static quantization methods and a dynamic quantization method called error diffusion, which can pseudo-preserve image shading information. The paper contributes to the conclusion that error diffusion, with controlled weight parameters in the error diffusion filter, can suppress degradation and reduce the memory footprint, demonstrating its effectiveness in dynamic environments.

视觉同步定位与绘图(SLAM)技术是机器人实现高精度导航的关键技术,其精度的提高日益受到研究人员的关注。然而,SLAM精度的提高总是以增加内存占用为代价,这限制了在受限硬件资源下运行的设备的长期运行。量化方法的应用是解决这一问题的一个有希望的方法。由于量化可能导致性能下降,因此定量评估潜在的性能下降和内存节省之间的权衡对于评估其在视觉SLAM中的实用性至关重要。本文介绍了一种量化方法对视觉SLAM影响的评估机制,并应用该机制评估了三种不同量化方法对orb -SLAM的影响。具体来说,我们研究了两种静态量化方法和一种称为误差扩散的动态量化方法,该方法可以伪保留图像阴影信息。通过对误差扩散滤波器中权值参数的控制,误差扩散可以抑制退化并减少内存占用,证明了其在动态环境中的有效性。
{"title":"Performance evaluation of ORB-SLAM3 with quantized images","authors":"Siyuan Tao,&nbsp;Yuki Minami,&nbsp;Masato Ishikawa","doi":"10.1007/s10015-025-01006-4","DOIUrl":"10.1007/s10015-025-01006-4","url":null,"abstract":"<div><p>Visual simultaneous localization and mapping (SLAM) is a critical technology for robots to perform high-precision navigation, increasing the focus among researchers to improve its accuracy. However, improvements in SLAM accuracy always come at the cost of an increased memory footprint, which limits the long-term operation of devices that operate under constrained hardware resources. Application of quantization methods is proposed as a promising solution to this problem. Since quantization can result in performance degradation, it is crucial to quantitatively evaluate the trade-off between potential degradation and memory savings to assess its practicality for visual SLAM. This paper introduces a mechanism to evaluate the influence of a quantization method on visual SLAM, and applies it to assess the impact of three different quantization methods on ORB-SLAM3. Specifically, we examine two static quantization methods and a dynamic quantization method called error diffusion, which can pseudo-preserve image shading information. The paper contributes to the conclusion that error diffusion, with controlled weight parameters in the error diffusion filter, can suppress degradation and reduce the memory footprint, demonstrating its effectiveness in dynamic environments.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"354 - 363"},"PeriodicalIF":0.8,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143925573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of human motion detection for non-verbal collaborative robot communication cue 基于非语言协作机器人交流线索的人体运动检测设计
IF 0.8 Q4 ROBOTICS Pub Date : 2025-01-30 DOI: 10.1007/s10015-024-01000-2
Wendy Cahya Kurniawan, Yeoh Wen Liang, Hiroshi Okumura, Osamu Fukuda

The integration of modern manufacturing systems has promised increased flexibility, productivity, and efficiency. In such an environment, collaboration between humans and robots in a shared workspace is essential to effectively accomplish shared tasks. Strong communication among partners is essential for collaborative efficiency. This research investigates an approach to non-verbal communication cues. The system focuses on integrating human motion detection with vision sensors. This method addresses the bias human action detection in frames and enhances the accuracy of perception as information about human activities to the robot. By interpreting spatial and temporal data, the system detects human movements through sequences of human activity frames while working together. The training and validation results confirm that the approach achieves an accuracy of 91%. The sequential testing performance showed an average detection of 83%. This research not only emphasizes the importance of advanced communication in human–robot collaboration, but also effectively promotes future developments in collaborative robotics.

现代制造系统的集成保证了灵活性、生产力和效率的提高。在这样的环境中,共享工作空间中人类和机器人之间的协作对于有效地完成共享任务至关重要。合作伙伴之间强有力的沟通对提高协作效率至关重要。本研究探讨了一种非语言交际线索的方法。该系统的重点是将人体运动检测与视觉传感器相结合。该方法解决了帧中人类动作检测的偏差,提高了感知作为人类活动信息对机器人的准确性。通过解释空间和时间数据,该系统在一起工作时通过人类活动框架序列检测人类运动。训练和验证结果表明,该方法达到了91%的准确率。序列测试性能显示平均检出率为83%。该研究不仅强调了先进通信在人机协作中的重要性,而且有效地促进了协作机器人的未来发展。
{"title":"Design of human motion detection for non-verbal collaborative robot communication cue","authors":"Wendy Cahya Kurniawan,&nbsp;Yeoh Wen Liang,&nbsp;Hiroshi Okumura,&nbsp;Osamu Fukuda","doi":"10.1007/s10015-024-01000-2","DOIUrl":"10.1007/s10015-024-01000-2","url":null,"abstract":"<div><p>The integration of modern manufacturing systems has promised increased flexibility, productivity, and efficiency. In such an environment, collaboration between humans and robots in a shared workspace is essential to effectively accomplish shared tasks. Strong communication among partners is essential for collaborative efficiency. This research investigates an approach to non-verbal communication cues. The system focuses on integrating human motion detection with vision sensors. This method addresses the bias human action detection in frames and enhances the accuracy of perception as information about human activities to the robot. By interpreting spatial and temporal data, the system detects human movements through sequences of human activity frames while working together. The training and validation results confirm that the approach achieves an accuracy of 91%. The sequential testing performance showed an average detection of 83%. This research not only emphasizes the importance of advanced communication in human–robot collaboration, but also effectively promotes future developments in collaborative robotics.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 1","pages":"12 - 20"},"PeriodicalIF":0.8,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143481147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial life and robotics celebrates its 30th anniversary 人工生命和机器人迎来30周年纪念
IF 0.8 Q4 ROBOTICS Pub Date : 2025-01-30 DOI: 10.1007/s10015-025-01009-1
Fumitoshi Matsuno
{"title":"Artificial life and robotics celebrates its 30th anniversary","authors":"Fumitoshi Matsuno","doi":"10.1007/s10015-025-01009-1","DOIUrl":"10.1007/s10015-025-01009-1","url":null,"abstract":"","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 1","pages":"1 - 2"},"PeriodicalIF":0.8,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143481146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reducing the effect of face orientation using FaceMesh landmarks in drowsiness estimation based on facial thermal images 基于人脸热图像的困倦估计中,使用FaceMesh标记减少人脸方向的影响
IF 0.8 Q4 ROBOTICS Pub Date : 2025-01-26 DOI: 10.1007/s10015-024-01001-1
Ayaka Nomura, Atsushi Yoshida, Kent Nagumo, Akio Nozawa

In this study, facial skin temperature distribution (FSTD) is focused on as a new driver monitoring index. FSTD is an autonomic index that can be measured remotely. Studies have been conducted to estimate drowsiness based on FSTD using modelng methods such as CNN, a type of deep learning, and sparse modeling, which can be trained with a small amount of data. These studies, however, only evaluated front-facing facial thermal images. FaceMesh is a model that extracts 478 3D facial feature landmarks from a 2D face image. In contrast to conventional models that extract only 68 facial feature landmarks, FaceMesh can extract facial feature landmarks for the entire face, including the cheeks, forehead, and other areas of the face that are in the blind spots. This study aims to improve the accuracy of drowsiness estimation by applying FaceMesh to automatically detect tilted faces and not including tilted images in the training data. As a result, the method proposed in this study improved drowsiness estimation accuracy by about 6% compared to the old method, which did not take face orientation into account.

本研究将面部皮肤温度分布(FSTD)作为一种新的驾驶员监测指标进行研究。FSTD是一种可以远程测量的自主指标。已经有研究使用CNN(一种深度学习)和稀疏建模(可以用少量数据训练)等建模方法基于FSTD来估计困倦。然而,这些研究只评估了正面的面部热图像。FaceMesh是一个从2D人脸图像中提取478个3D面部特征地标的模型。与传统模型只能提取68个面部特征地标不同,FaceMesh可以提取整个脸部的面部特征地标,包括脸颊、前额和其他处于盲点的面部区域。本研究旨在通过应用FaceMesh自动检测倾斜人脸,不将倾斜图像包含在训练数据中,提高困倦估计的准确性。因此,与不考虑面部朝向的旧方法相比,本研究提出的方法将困倦估计精度提高了约6%。
{"title":"Reducing the effect of face orientation using FaceMesh landmarks in drowsiness estimation based on facial thermal images","authors":"Ayaka Nomura,&nbsp;Atsushi Yoshida,&nbsp;Kent Nagumo,&nbsp;Akio Nozawa","doi":"10.1007/s10015-024-01001-1","DOIUrl":"10.1007/s10015-024-01001-1","url":null,"abstract":"<div><p>In this study, facial skin temperature distribution (FSTD) is focused on as a new driver monitoring index. FSTD is an autonomic index that can be measured remotely. Studies have been conducted to estimate drowsiness based on FSTD using modelng methods such as CNN, a type of deep learning, and sparse modeling, which can be trained with a small amount of data. These studies, however, only evaluated front-facing facial thermal images. FaceMesh is a model that extracts 478 3D facial feature landmarks from a 2D face image. In contrast to conventional models that extract only 68 facial feature landmarks, FaceMesh can extract facial feature landmarks for the entire face, including the cheeks, forehead, and other areas of the face that are in the blind spots. This study aims to improve the accuracy of drowsiness estimation by applying FaceMesh to automatically detect tilted faces and not including tilted images in the training data. As a result, the method proposed in this study improved drowsiness estimation accuracy by about 6% compared to the old method, which did not take face orientation into account.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"317 - 324"},"PeriodicalIF":0.8,"publicationDate":"2025-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://link.springer.com/content/pdf/10.1007/s10015-024-01001-1.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143925494","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An empirical evaluation of a hierarchical reinforcement learning method towards modular robot control 模块化机器人控制的层次强化学习方法的经验评价
IF 0.8 Q4 ROBOTICS Pub Date : 2025-01-23 DOI: 10.1007/s10015-025-01003-7
Sho Takeda, Satoshi Yamamori, Satoshi Yagi, Jun Morimoto

There is a growing expectation that deep reinforcement learning will enable multi-degree-of-freedom robots to acquire policies suitable for real-world applications. However, a robot system with a variety of components requires many learning trials for each different combination of robot modules. In this study, we propose a hierarchical policy design to segment tasks according to different robot components. The tasks of the multi-module robot are performed by skill sets trained on a component-by-component basis. In our learning approach, each module learns reusable skills, which are then integrated to control the whole robotic system. By adopting component-based learning and reusing previously acquired policies, we transform the action space from continuous to discrete. This transformation reduces the complexity of exploration across the entire robotic system. We validated our proposed method by applying it to a valve rotation task using a combination of a robotic arm and a robotic gripper. Evaluation based on physical simulations showed that hierarchical policy construction improved sample efficiency, achieving performance comparable to the baseline with 46.3% fewer samples.

人们越来越期望深度强化学习将使多自由度机器人能够获得适合现实世界应用的策略。然而,一个具有多种组件的机器人系统需要对机器人模块的每种不同组合进行多次学习试验。在这项研究中,我们提出了一种分层策略设计,根据不同的机器人组件来分割任务。多模块机器人的任务由逐个组件训练的技能集执行。在我们的学习方法中,每个模块学习可重复使用的技能,然后将其集成到控制整个机器人系统中。通过采用基于组件的学习和重用先前获得的策略,我们将动作空间从连续转换为离散。这种转换降低了整个机器人系统探索的复杂性。我们通过将其应用于使用机械臂和机械夹具组合的阀门旋转任务来验证我们提出的方法。基于物理模拟的评估表明,分层策略构建提高了样本效率,在样本减少46.3%的情况下实现了与基线相当的性能。
{"title":"An empirical evaluation of a hierarchical reinforcement learning method towards modular robot control","authors":"Sho Takeda,&nbsp;Satoshi Yamamori,&nbsp;Satoshi Yagi,&nbsp;Jun Morimoto","doi":"10.1007/s10015-025-01003-7","DOIUrl":"10.1007/s10015-025-01003-7","url":null,"abstract":"<div><p>There is a growing expectation that deep reinforcement learning will enable multi-degree-of-freedom robots to acquire policies suitable for real-world applications. However, a robot system with a variety of components requires many learning trials for each different combination of robot modules. In this study, we propose a hierarchical policy design to segment tasks according to different robot components. The tasks of the multi-module robot are performed by skill sets trained on a component-by-component basis. In our learning approach, each module learns reusable skills, which are then integrated to control the whole robotic system. By adopting component-based learning and reusing previously acquired policies, we transform the action space from continuous to discrete. This transformation reduces the complexity of exploration across the entire robotic system. We validated our proposed method by applying it to a valve rotation task using a combination of a robotic arm and a robotic gripper. Evaluation based on physical simulations showed that hierarchical policy construction improved sample efficiency, achieving performance comparable to the baseline with 46.3% fewer samples.</p></div>","PeriodicalId":46050,"journal":{"name":"Artificial Life and Robotics","volume":"30 2","pages":"245 - 251"},"PeriodicalIF":0.8,"publicationDate":"2025-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143925714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Life and Robotics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1