首页 > 最新文献

Cognitive Computation and Systems最新文献

英文 中文
Exploring conventional enhancement and separation methods for multi-speech enhancement in indoor environments 探索室内环境下多语音增强的常规增强和分离方法
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-05-30 DOI: 10.1049/ccs2.12023
Yangjie Wei, Ke Zhang, Dan Wu, Zhongqi Hu

Speech enhancement is an important preprocessing step in a wide diversity of practical fields related to speech signals, and many signal-processing methods have already been proposed for speech enhancement. However, the lack of a comprehensive and quantitative evaluation of enhancement performance for multi-speech makes it difficult to choose an appropriate enhancement method for a multi-speech application. This work aims to study the implementation of several enhancement methods for multi-speech enhancement in indoor environments of T60 = 0 s and T60 = 0.3 s. Two types of enhancement approaches are proposed and compared. The first type is the basic enhancement methods, including delay-and-sum beamforming (DSB), minimum variance distortionless response (MVDR), linearly constrained minimum variance (LCMV), and independent component analysis (ICA). The second type is the robust enhancement methods, including improved MVDR and LCMV realized by eigendecomposition and diagonal loading. In addition, online enhancement performance based on the iteration of single-frame speech signals is researched, as is the comprehensive performance of various enhancement methods. The experimental results show that the enhancement effects of LCMV and ICA are relatively more stable in the case of basic enhancement methods; in the case of the improved enhancement algorithms, methods that employ diagonal loading iterations show better performance. In terms of online enhancement, DSB with frequency masking (FM) yields the best performance on the signal-to-interference ratio (SIR) and can suppress interference. The comprehensive performance test showed that LCMV and ICA yielded the best effects when there was no reverberation, while DSB with FM yielded the best SIR value when reverberation was present.

语音增强是语音信号广泛应用领域中重要的预处理步骤,针对语音增强已经提出了许多信号处理方法。然而,由于对多语音增强性能缺乏全面、定量的评价,因此难以为多语音应用选择合适的增强方法。本工作旨在研究几种增强方法在T60 = 0 s和T60 = 0.3 s室内环境下的多语音增强实现。提出并比较了两种增强方法。第一类是基本增强方法,包括延迟和波束形成(DSB)、最小方差无失真响应(MVDR)、线性约束最小方差(LCMV)和独立分量分析(ICA)。第二类是鲁棒增强方法,包括通过特征分解和对角加载实现改进的MVDR和LCMV。此外,还研究了基于单帧语音信号迭代的在线增强性能,以及各种增强方法的综合性能。实验结果表明,在基本增强方法下,LCMV和ICA的增强效果相对更稳定;在改进的增强算法中,采用对角加载迭代的方法表现出更好的性能。在在线增强方面,带频率掩蔽(FM)的DSB在信干扰比(SIR)方面的性能最好,并且可以抑制干扰。综合性能测试表明,LCMV和ICA在无混响情况下的SIR效果最好,而DSB和FM在有混响情况下的SIR效果最好。
{"title":"Exploring conventional enhancement and separation methods for multi-speech enhancement in indoor environments","authors":"Yangjie Wei,&nbsp;Ke Zhang,&nbsp;Dan Wu,&nbsp;Zhongqi Hu","doi":"10.1049/ccs2.12023","DOIUrl":"10.1049/ccs2.12023","url":null,"abstract":"<p>Speech enhancement is an important preprocessing step in a wide diversity of practical fields related to speech signals, and many signal-processing methods have already been proposed for speech enhancement. However, the lack of a comprehensive and quantitative evaluation of enhancement performance for multi-speech makes it difficult to choose an appropriate enhancement method for a multi-speech application. This work aims to study the implementation of several enhancement methods for multi-speech enhancement in indoor environments of T60 = 0 s and T60 = 0.3 s. Two types of enhancement approaches are proposed and compared. The first type is the basic enhancement methods, including delay-and-sum beamforming (DSB), minimum variance distortionless response (MVDR), linearly constrained minimum variance (LCMV), and independent component analysis (ICA). The second type is the robust enhancement methods, including improved MVDR and LCMV realized by eigendecomposition and diagonal loading. In addition, online enhancement performance based on the iteration of single-frame speech signals is researched, as is the comprehensive performance of various enhancement methods. The experimental results show that the enhancement effects of LCMV and ICA are relatively more stable in the case of basic enhancement methods; in the case of the improved enhancement algorithms, methods that employ diagonal loading iterations show better performance. In terms of online enhancement, DSB with frequency masking (FM) yields the best performance on the signal-to-interference ratio (SIR) and can suppress interference. The comprehensive performance test showed that LCMV and ICA yielded the best effects when there was no reverberation, while DSB with FM yielded the best SIR value when reverberation was present.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 4","pages":"307-322"},"PeriodicalIF":0.0,"publicationDate":"2021-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12023","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132352157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Augmented reality display of neurosurgery craniotomy lesions based on feature contour matching 基于特征轮廓匹配的神经外科开颅病变增强现实显示
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-05-21 DOI: 10.1049/ccs2.12021
Hao Zhang, Qi-Yuan Sun, Zhen-Zhong Liu

Traditional neurosurgical craniotomy primarily uses two-dimensional cranial medical images to estimate the location of a patient’s intracranial lesions. Such work relies on the experience and skills of the doctor and may result in accidental injury to important intracranial physiological tissues. To help doctors more intuitively determine patient lesion information and improve the accuracy of surgical route formulation and craniotomy safety, an augmented reality method for displaying neurosurgery craniotomy lesions based on feature contour matching is proposed. This method uses threshold segmentation and region growing algorithms to reconstruct a 3-D Computed tomography image of the patient’s head. The augmented reality engine is used to adjust the reconstruction model’s relevant parameters to meet the doctor’s requirements and determine the augmented reality matching method for feature contour matching. By using the mobile terminal to align the real skull model, the virtual lesion model is displayed. Using the designed user interface, doctors can view the patient’s personal information and can zoom in, zoom out, and rotate the virtual model. Therefore, the patient’s lesions information can be visualized accurately, which provides a visual basis for preoperative preparation.

传统的神经外科开颅术主要使用二维颅医学图像来估计患者颅内病变的位置。这种工作依赖于医生的经验和技能,并可能导致颅内重要生理组织的意外损伤。为了帮助医生更直观地判断患者病变信息,提高手术路径制定的准确性和开颅安全性,提出了一种基于特征轮廓匹配的增强现实神经外科开颅病变显示方法。该方法使用阈值分割和区域增长算法重建患者头部的三维计算机断层图像。利用增强现实引擎调整重建模型的相关参数以满足医生的要求,并确定增强现实匹配方法进行特征轮廓匹配。通过移动终端对真实颅骨模型进行对齐,显示虚拟病变模型。通过设计的用户界面,医生可以查看患者的个人信息,并可以放大、缩小和旋转虚拟模型。因此,可以准确地可视化患者的病变信息,为术前准备提供视觉依据。
{"title":"Augmented reality display of neurosurgery craniotomy lesions based on feature contour matching","authors":"Hao Zhang,&nbsp;Qi-Yuan Sun,&nbsp;Zhen-Zhong Liu","doi":"10.1049/ccs2.12021","DOIUrl":"10.1049/ccs2.12021","url":null,"abstract":"<p>Traditional neurosurgical craniotomy primarily uses two-dimensional cranial medical images to estimate the location of a patient’s intracranial lesions. Such work relies on the experience and skills of the doctor and may result in accidental injury to important intracranial physiological tissues. To help doctors more intuitively determine patient lesion information and improve the accuracy of surgical route formulation and craniotomy safety, an augmented reality method for displaying neurosurgery craniotomy lesions based on feature contour matching is proposed. This method uses threshold segmentation and region growing algorithms to reconstruct a 3-D Computed tomography image of the patient’s head. The augmented reality engine is used to adjust the reconstruction model’s relevant parameters to meet the doctor’s requirements and determine the augmented reality matching method for feature contour matching. By using the mobile terminal to align the real skull model, the virtual lesion model is displayed. Using the designed user interface, doctors can view the patient’s personal information and can zoom in, zoom out, and rotate the virtual model. Therefore, the patient’s lesions information can be visualized accurately, which provides a visual basis for preoperative preparation.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 3","pages":"221-228"},"PeriodicalIF":0.0,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12021","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129447576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of instantaneous likeability of advertisements using deep learning 利用深度学习预测广告的瞬时喜爱度
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-05-21 DOI: 10.1049/ccs2.12022
Dipayan Saha, S.M.Mahbubur Rahman, Mohammad Tariqul Islam, M. Omair Ahmad, M.N.S. Swamy

The degree to which advertisements are successful is of prime concern for vendors in highly competitive global markets. Given the astounding growth of multimedia content on the internet, online marketing has become another form of advertising. Researchers consider advertisement likeability a major predictor of effective market penetration. An algorithm is presented to predict how much an advertisement clip will be liked with the aid of an end-to-end audiovisual feature extraction process using cognitive computing technology. Specifically, the usefulness of different spatial and time-domain deep-learning architectures such as convolutional neural and long short-term memory networks is investigated to predict the frame-by-frame instantaneous and root mean square likeability of advertisement clips. A data set named the ‘BUET Advertisement Likeness Data Set’, containing annotations of frame-wise likeability scores for various categories of advertisements, is also introduced. Experiments with the developed database show that the proposed algorithm performs better than existing methods in terms of commonly used performance indices at the expense of slightly increased computational complexity.

在竞争激烈的全球市场上,广告的成功程度是供应商最关心的问题。鉴于互联网上多媒体内容的惊人增长,在线营销已成为另一种形式的广告。研究人员认为,广告的受欢迎程度是有效市场渗透的主要预测因素。提出了一种基于认知计算的端到端视听特征提取算法来预测广告片段的受欢迎程度。具体而言,研究了不同空间和时域深度学习架构(如卷积神经网络和长短期记忆网络)在预测广告片段逐帧瞬时和均方根喜爱度方面的有用性。还介绍了一个名为“BUET广告相似性数据集”的数据集,该数据集包含对各种类别广告的逐帧喜爱度分数的注释。在开发的数据库上进行的实验表明,该算法在常用性能指标方面优于现有方法,但计算复杂度略有增加。
{"title":"Prediction of instantaneous likeability of advertisements using deep learning","authors":"Dipayan Saha,&nbsp;S.M.Mahbubur Rahman,&nbsp;Mohammad Tariqul Islam,&nbsp;M. Omair Ahmad,&nbsp;M.N.S. Swamy","doi":"10.1049/ccs2.12022","DOIUrl":"10.1049/ccs2.12022","url":null,"abstract":"<p>The degree to which advertisements are successful is of prime concern for vendors in highly competitive global markets. Given the astounding growth of multimedia content on the internet, online marketing has become another form of advertising. Researchers consider advertisement likeability a major predictor of effective market penetration. An algorithm is presented to predict how much an advertisement clip will be liked with the aid of an end-to-end audiovisual feature extraction process using cognitive computing technology. Specifically, the usefulness of different spatial and time-domain deep-learning architectures such as convolutional neural and long short-term memory networks is investigated to predict the frame-by-frame instantaneous and root mean square likeability of advertisement clips. A data set named the ‘BUET Advertisement Likeness Data Set’, containing annotations of frame-wise likeability scores for various categories of advertisements, is also introduced. Experiments with the developed database show that the proposed algorithm performs better than existing methods in terms of commonly used performance indices at the expense of slightly increased computational complexity.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 3","pages":"263-275"},"PeriodicalIF":0.0,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12022","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122800371","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design and research of a robotic system for ultrasonic-assisted lamellar keratoplasty 超声辅助板层角膜移植术机器人系统的设计与研究
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-05-21 DOI: 10.1049/ccs2.12020
Jingjing Xiao, Mengqiong Li, Chiming Wang, Jun Pi, Hui He

In order to solve the problem of uncontrollable cutting depth and the rough incision edge of the cornea with manual trephine in lamellar keratoplasty, an ultrasonic-assisted corneal trephination method has been proposed for the first time in accordance with the advantage of ultrasonic vibration cutting, and the corresponding robotic system has been designed and researched. According to the traditional process of lamellar keratoplasty, the requirements of the surgical robotic system were first proposed. On this basis, the robotic system was designed and its schematic diagram was introduced. Second, the key components of the robotic body such as the eccentric adjusting mechanism and the end-effector of ultrasonic scalpel were illustrated, which can realise corneal trephination of different incision diameters without scalpel replacement. Then the operation flow chart of a robot-assisted lamellar keratoplasty was put forward. Finally, the preliminary verified experiments were performed using a grape and a porcine eyeball, respectively, in vitro with the prototype system. The results show that the robotic system can basically satisfy the operation requirements of lamellar keratoplasty. Owing to the less cutting force and smoother corneal incision edge of ultrasonic-assisted lamellar keratoplasty compared with manual trephine, it was proved to be more feasible and superior.

针对板层角膜移植术中手工钻孔机切割深度不可控和角膜切口边缘粗糙的问题,根据超声振动切割的优点,首次提出了超声辅助角膜钻孔机的方法,并对相应的机器人系统进行了设计和研究。根据板层角膜移植术的传统工艺,首先提出了手术机器人系统的要求。在此基础上,对机器人系统进行了设计,并给出了系统原理图。其次,对机器人主体的关键部件偏心调节机构和超声手术刀末端执行器进行了分析,实现了在不更换手术刀的情况下实现不同切口直径的角膜穿刺;提出了机器人辅助板层角膜移植术的手术流程。最后,用葡萄眼球和猪眼球分别在体外进行了初步验证实验。结果表明,该机器人系统基本满足板层角膜移植术的操作要求。超声辅助板层角膜移植术由于切割力小,切口边缘较手工环钻平滑,证明超声辅助板层角膜移植术更为可行和优越。
{"title":"Design and research of a robotic system for ultrasonic-assisted lamellar keratoplasty","authors":"Jingjing Xiao,&nbsp;Mengqiong Li,&nbsp;Chiming Wang,&nbsp;Jun Pi,&nbsp;Hui He","doi":"10.1049/ccs2.12020","DOIUrl":"10.1049/ccs2.12020","url":null,"abstract":"<p>In order to solve the problem of uncontrollable cutting depth and the rough incision edge of the cornea with manual trephine in lamellar keratoplasty, an ultrasonic-assisted corneal trephination method has been proposed for the first time in accordance with the advantage of ultrasonic vibration cutting, and the corresponding robotic system has been designed and researched. According to the traditional process of lamellar keratoplasty, the requirements of the surgical robotic system were first proposed. On this basis, the robotic system was designed and its schematic diagram was introduced. Second, the key components of the robotic body such as the eccentric adjusting mechanism and the end-effector of ultrasonic scalpel were illustrated, which can realise corneal trephination of different incision diameters without scalpel replacement. Then the operation flow chart of a robot-assisted lamellar keratoplasty was put forward. Finally, the preliminary verified experiments were performed using a grape and a porcine eyeball, respectively, in vitro with the prototype system. The results show that the robotic system can basically satisfy the operation requirements of lamellar keratoplasty. Owing to the less cutting force and smoother corneal incision edge of ultrasonic-assisted lamellar keratoplasty compared with manual trephine, it was proved to be more feasible and superior.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 4","pages":"297-306"},"PeriodicalIF":0.0,"publicationDate":"2021-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12020","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126896621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Questioning ‘what makes us human’: How audiences react to an artificial intelligence–driven show 质疑“是什么让我们成为人类”:观众对人工智能驱动的节目的反应
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-05-12 DOI: 10.1049/ccs2.12018
Rob Eagle, Rik Lander, Phil D. Hall

I am Echoborg is promoted as ‘a show created afresh each time by the audience in conversation with an artificial intelligence (AI)’. The show demonstrates how AI in a creative and performance context can raise questions about the technology’s ethical use for persuasion and compliance, and how humans can reclaim agency. This audience study focuses on a consecutive three-night run in Bristol, UK in October 2019. The different outcomes of each show illustrate the unpredictability of audience interactions with conversational AI and how the collective dynamic of audience members shapes each performance. This study analyses (1) how I am Echoborg facilitates audience cocreation in a live performance context, (2) the show’s capacity to provoke nuanced understandings of the potential for AI and (3) the ability for intelligent technology to facilitate social interaction and group collaboration. This audience study demonstrates how the show inspires debate beyond binary conclusions (i.e. AI as good or bad) and how audiences can understand potential creative uses of AI, including as a tool for cocreating entertainment with (not just for) them.

《我是Echoborg》被宣传为“观众与人工智能(AI)对话,每次都重新创作的节目”。该节目展示了在创意和表演环境下的人工智能如何引发人们对该技术在说服和合规方面的道德使用的质疑,以及人类如何重新获得代理权。这项观众研究的重点是2019年10月在英国布里斯托尔连续三晚的演出。每场演出的不同结果说明了观众与对话式人工智能互动的不可预测性,以及观众的集体动态如何影响每场演出。本研究分析了(1)I am Echoborg如何在现场表演环境中促进观众的共同创造,(2)该节目激发对人工智能潜力的细微理解的能力,以及(3)智能技术促进社会互动和群体协作的能力。这项观众研究展示了该节目如何激发辩论,而不是二元结论(即人工智能是好是坏),以及观众如何理解人工智能的潜在创造性用途,包括作为与他们共同创造娱乐的工具(而不仅仅是为了他们)。
{"title":"Questioning ‘what makes us human’: How audiences react to an artificial intelligence–driven show","authors":"Rob Eagle,&nbsp;Rik Lander,&nbsp;Phil D. Hall","doi":"10.1049/ccs2.12018","DOIUrl":"10.1049/ccs2.12018","url":null,"abstract":"<p>I am Echoborg is promoted as ‘a show created afresh each time by the audience in conversation with an artificial intelligence (AI)’. The show demonstrates how AI in a creative and performance context can raise questions about the technology’s ethical use for persuasion and compliance, and how humans can reclaim agency. This audience study focuses on a consecutive three-night run in Bristol, UK in October 2019. The different outcomes of each show illustrate the unpredictability of audience interactions with conversational AI and how the collective dynamic of audience members shapes each performance. This study analyses (1) how I am Echoborg facilitates audience cocreation in a live performance context, (2) the show’s capacity to provoke nuanced understandings of the potential for AI and (3) the ability for intelligent technology to facilitate social interaction and group collaboration. This audience study demonstrates how the show inspires debate beyond binary conclusions (i.e. AI as good or bad) and how audiences can understand potential creative uses of AI, including as a tool for cocreating entertainment <i>with</i> (not just for) them.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 2","pages":"91-99"},"PeriodicalIF":0.0,"publicationDate":"2021-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12018","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125864216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Multi-attribute quantitative bearing fault diagnosis based on convolutional neural network 基于卷积神经网络的多属性轴承故障定量诊断
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-05-04 DOI: 10.1049/ccs2.12016
Shixin Zhang, Qin Lv, Shenlin Zhang, Jianhua Shan

Existing bearing fault diagnosis methods have some disadvantages, one being that most methods cannot completely consider all specific fault attributes. Another disadvantage is that the qualitative diagnosis method considers different fault types as a whole, and qualitative diagnosis of a single fault attribute is complicated. A convolutional neural network is proposed for application in the multi-attribute quantitative bearing fault diagnosis. Multiple combinations of convolutional layers are adopted to directly extract features from one-dimensional vibration signals. In addition, a softmax layer is designed to realise the simultaneous recognition of different fault attributes. The advantage of this approach is that it can realise diagnostic results for any combination of fault attributes and corresponding types, which overcomes the disadvantage of single attribute recognition in the traditional method. The method is simple but has strong generalisation ability with average diagnostic accuracy of more than 95%. According to bearing data from Case Western Reserve University and laboratory experiments by the authors, the results verify that the method can accurately and quantitatively diagnose bearing faults.

现有的轴承故障诊断方法存在一些缺点,一是大多数方法不能完全考虑所有特定的故障属性。定性诊断方法的另一个缺点是将不同的故障类型作为一个整体来考虑,对单个故障属性进行定性诊断比较复杂。提出了一种卷积神经网络在多属性轴承故障定量诊断中的应用。采用卷积层的多重组合直接从一维振动信号中提取特征。此外,还设计了softmax层,实现了对不同故障属性的同时识别。该方法的优点是可以实现故障属性及其对应类型的任意组合的诊断结果,克服了传统方法单一属性识别的缺点。该方法简便,泛化能力强,平均诊断准确率达95%以上。根据凯斯西储大学的轴承数据和作者的实验室实验,结果验证了该方法可以准确定量地诊断轴承故障。
{"title":"Multi-attribute quantitative bearing fault diagnosis based on convolutional neural network","authors":"Shixin Zhang,&nbsp;Qin Lv,&nbsp;Shenlin Zhang,&nbsp;Jianhua Shan","doi":"10.1049/ccs2.12016","DOIUrl":"10.1049/ccs2.12016","url":null,"abstract":"<p>Existing bearing fault diagnosis methods have some disadvantages, one being that most methods cannot completely consider all specific fault attributes. Another disadvantage is that the qualitative diagnosis method considers different fault types as a whole, and qualitative diagnosis of a single fault attribute is complicated. A convolutional neural network is proposed for application in the multi-attribute quantitative bearing fault diagnosis. Multiple combinations of convolutional layers are adopted to directly extract features from one-dimensional vibration signals. In addition, a softmax layer is designed to realise the simultaneous recognition of different fault attributes. The advantage of this approach is that it can realise diagnostic results for any combination of fault attributes and corresponding types, which overcomes the disadvantage of single attribute recognition in the traditional method. The method is simple but has strong generalisation ability with average diagnostic accuracy of more than 95%. According to bearing data from Case Western Reserve University and laboratory experiments by the authors, the results verify that the method can accurately and quantitatively diagnose bearing faults.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 4","pages":"287-296"},"PeriodicalIF":0.0,"publicationDate":"2021-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12016","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116346485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Multi-modal broad learning for material recognition 材料识别的多模态广义学习
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-04-21 DOI: 10.1049/ccs2.12004
Zhaoxin Wang, Huaping Liu, Xinying Xu, Fuchun Sun
Joint Fund of Science & Technology Department of Liaoning Province and State Key Laboratory of Robotics, China, Grant/Award Number: 2020‐KF‐ 22‐06 Abstract Material recognition plays an important role in the interaction between robots and the external environment. For example, household service robots need to replace humans in the home environment to complete housework, so they need to interact with daily necessities and obtain their material performance. Images provide rich visual information about objects; however, it is often difficult to apply when objects are not visually distinct. In addition, tactile signals can be used to capture multiple characteristics of objects, such as texture, roughness, softness, and friction, which provides another crucial way for perception. How to effectively integrate multi‐modal information is an urgent problem to be addressed. Therefore, a multi‐modal material recognition framework CFBRL‐KCCA for target recognition tasks is proposed in the paper. The preliminary features of each model are extracted by cascading broad learning, which is combined with the kernel canonical correlation learning, considering the differences among different models of heterogeneous data. Finally, the open dataset of household objects is evaluated. The results demonstrate that the proposed fusion algorithm provides an effective strategy for material recognition.
材料识别在机器人与外界环境的交互中起着重要的作用。例如,家庭服务机器人需要在家庭环境中代替人类完成家务,因此需要与生活用品进行交互,获取其物质性能。图像提供了关于物体的丰富视觉信息;然而,当对象在视觉上不明显时,通常很难应用。此外,触觉信号可以用来捕捉物体的多种特征,如纹理、粗糙度、柔软度和摩擦度,这为感知提供了另一种重要途径。如何有效地整合多模态信息是一个亟待解决的问题。为此,本文提出了一种用于目标识别任务的多模态材料识别框架CFBRL-KCCA。考虑到异构数据的不同模型之间的差异,采用级联广义学习和核典型相关学习相结合的方法提取每个模型的初步特征。最后,对家庭对象开放数据集进行评估。结果表明,该融合算法为材料识别提供了一种有效的策略。
{"title":"Multi-modal broad learning for material recognition","authors":"Zhaoxin Wang,&nbsp;Huaping Liu,&nbsp;Xinying Xu,&nbsp;Fuchun Sun","doi":"10.1049/ccs2.12004","DOIUrl":"10.1049/ccs2.12004","url":null,"abstract":"Joint Fund of Science & Technology Department of Liaoning Province and State Key Laboratory of Robotics, China, Grant/Award Number: 2020‐KF‐ 22‐06 Abstract Material recognition plays an important role in the interaction between robots and the external environment. For example, household service robots need to replace humans in the home environment to complete housework, so they need to interact with daily necessities and obtain their material performance. Images provide rich visual information about objects; however, it is often difficult to apply when objects are not visually distinct. In addition, tactile signals can be used to capture multiple characteristics of objects, such as texture, roughness, softness, and friction, which provides another crucial way for perception. How to effectively integrate multi‐modal information is an urgent problem to be addressed. Therefore, a multi‐modal material recognition framework CFBRL‐KCCA for target recognition tasks is proposed in the paper. The preliminary features of each model are extracted by cascading broad learning, which is combined with the kernel canonical correlation learning, considering the differences among different models of heterogeneous data. Finally, the open dataset of household objects is evaluated. The results demonstrate that the proposed fusion algorithm provides an effective strategy for material recognition.","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 2","pages":"123-130"},"PeriodicalIF":0.0,"publicationDate":"2021-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12004","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122481833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Research on intelligent service of customer service system 客服系统的智能服务研究
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-04-16 DOI: 10.1049/ccs2.12012
Jinji Nie, Qi Wang, Jianbin Xiong

With the development of the wireless network, from 4G network to 5G network, people's communication quality has improved significantly and the processing requirements of operators' customer service systems will ameliorate, whereas the business undertaken by the intelligent network becomes more difficult. Customer service system, which can convey files and video, has evolved from manual to intelligent. At the same time, this system establishes a knowledge base based on the process of solving problems with customers. The customer service system can also undertake the task of process control within the enterprise. The ultimate goal is to understand the needs of customers through the knowledge base and develop corporate products based on customer data. Furthermore, this study proposes a network architecture of an intelligent customer service system to provide a reference for the construction.

随着无线网络的发展,从4G网络到5G网络,人们的通信质量显著提高,运营商客户服务系统的处理要求也将改善,而智能网络承担的业务则变得更加困难。客户服务系统已经从手动向智能发展,可以传输文件和视频。同时,该系统建立了一个基于客户解决问题过程的知识库。客户服务系统还可以承担企业内部的过程控制任务。最终目标是通过知识库了解客户的需求,并基于客户数据开发企业产品。此外,本研究还提出了智能客服系统的网络架构,为智能客服系统的建设提供参考。
{"title":"Research on intelligent service of customer service system","authors":"Jinji Nie,&nbsp;Qi Wang,&nbsp;Jianbin Xiong","doi":"10.1049/ccs2.12012","DOIUrl":"10.1049/ccs2.12012","url":null,"abstract":"<p>With the development of the wireless network, from 4G network to 5G network, people's communication quality has improved significantly and the processing requirements of operators' customer service systems will ameliorate, whereas the business undertaken by the intelligent network becomes more difficult. Customer service system, which can convey files and video, has evolved from manual to intelligent. At the same time, this system establishes a knowledge base based on the process of solving problems with customers. The customer service system can also undertake the task of process control within the enterprise. The ultimate goal is to understand the needs of customers through the knowledge base and develop corporate products based on customer data. Furthermore, this study proposes a network architecture of an intelligent customer service system to provide a reference for the construction.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 3","pages":"197-205"},"PeriodicalIF":0.0,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12012","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122539417","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research and sustainable design of wearable sensor for clothing based on body area network 基于体域网络的服装可穿戴传感器的研究与可持续设计
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-04-16 DOI: 10.1049/ccs2.12014
Ren Xiangfang, Shen Lei, Liu Miaomiao, Zhang Xiying, Chen Han

The body area network (BAN) is composed of every wearable device network on the body to share information and data, which is applied in medical and health, especially in the direction of intelligent clothing. A wearable device is an integrated body of multi-sensor fusion. At the same time, the multi-dimensional needs of users and the unique problems of sensors appear. How to solve the problems of wearable sensors and sustainable design is the research focus. Based on the wearable sensor in the critical factor of wearable device fusion, this paper analyses the classification, technology, and current situation of a wearable sensor, discusses the problems of a wearable sensor for BAN from the aspects of human–computer interaction experience, data accuracy, multiple interaction modes, and battery power supply, and summarizes the direction of multi-sensor fusion, compatible biosensor materials, and low power consumption and high sensitivity. The sustainable design direction of visibility design, identification of use scenarios, short-term human–computer interaction, interaction process reduction, and integration invisibility are introduced. The integration research of wearable sensors is the future trend, and it has been widely used in medical and health, intelligent clothing, wireless communication, military, automobile, and other fields.

人体区域网络(body area network, BAN)是由人体上的各个可穿戴设备组成的网络,实现信息和数据的共享,在医疗健康,尤其是智能服装方向上的应用。可穿戴设备是多传感器融合的集成体。同时,用户的多维需求和传感器的独特性问题也随之显现。如何解决可穿戴传感器的可持续性设计问题是研究的重点。基于可穿戴传感器在可穿戴设备融合中的关键因素,本文分析了可穿戴传感器的分类、技术和现状,从人机交互体验、数据精度、多种交互模式、电池供电等方面探讨了BAN可穿戴传感器存在的问题,总结了多传感器融合、兼容生物传感器材料、低功耗高灵敏度的发展方向。介绍了可视性设计、使用场景识别、短期人机交互、减少交互过程、集成不可见等可持续设计方向。可穿戴传感器的集成化研究是未来的发展趋势,已广泛应用于医疗健康、智能服装、无线通信、军事、汽车等领域。
{"title":"Research and sustainable design of wearable sensor for clothing based on body area network","authors":"Ren Xiangfang,&nbsp;Shen Lei,&nbsp;Liu Miaomiao,&nbsp;Zhang Xiying,&nbsp;Chen Han","doi":"10.1049/ccs2.12014","DOIUrl":"10.1049/ccs2.12014","url":null,"abstract":"<p>The body area network (BAN) is composed of every wearable device network on the body to share information and data, which is applied in medical and health, especially in the direction of intelligent clothing. A wearable device is an integrated body of multi-sensor fusion. At the same time, the multi-dimensional needs of users and the unique problems of sensors appear. How to solve the problems of wearable sensors and sustainable design is the research focus. Based on the wearable sensor in the critical factor of wearable device fusion, this paper analyses the classification, technology, and current situation of a wearable sensor, discusses the problems of a wearable sensor for BAN from the aspects of human–computer interaction experience, data accuracy, multiple interaction modes, and battery power supply, and summarizes the direction of multi-sensor fusion, compatible biosensor materials, and low power consumption and high sensitivity. The sustainable design direction of visibility design, identification of use scenarios, short-term human–computer interaction, interaction process reduction, and integration invisibility are introduced. The integration research of wearable sensors is the future trend, and it has been widely used in medical and health, intelligent clothing, wireless communication, military, automobile, and other fields.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 3","pages":"206-220"},"PeriodicalIF":0.0,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133933982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Deep learning techniques-based perfection of multi-sensor fusion oriented human-robot interaction system for identification of dense organisms 基于深度学习技术的面向多传感器融合的密集生物识别人机交互系统的完善
Q4 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2021-04-16 DOI: 10.1049/ccs2.12010
Haiju Li, Chuntang Zhang, Jingwen Bo, Zhongjun Ding

For detection of dense small-target organisms with indistinct features in complex background, the efficiency and accuracy of traditional target detection methods are low. Multi-sensor fusion oriented human-robot interaction (HRI) system has facilitated biologists to process and analyse data. For this, several deep learning models based on convolutional neural network (CNN) are improved and compared to study the species and density of dense organisms in deep-sea hydrothermal vent, which are fused it with related environmental information given by position sensors and conductivity-temperature-depth (CTD) sensors, so as to perfect multi-sensor fusion oriented HRI system. Firstly, the authors combined different meta-architectures and different feature extractors, and obtained five object identification algorithms based on CNN. Then, they compared computational cost of feature extractors and weighed the pros and cons of each algorithm from mean detection speed, correlation coefficient and mean class-specific confidence score to confirm that Faster Region-based CNN (R-CNN)_InceptionNet is the best algorithm applicable to hydrothermal vent biological dataset. Finally, they calculated the cognitive accuracy of rimicaris exoculata in dense and sparse areas, which were 88.3% and 95.9% respectively, to analyse the performance of the Faster R-CNN_InceptionNet. Results show that the proposed method can be used in the multi-sensor fusion oriented HRI system for the statistics of dense organisms in complex environments.

对于复杂背景下特征模糊的密集小靶点生物,传统的目标检测方法效率和精度较低。面向多传感器融合的人机交互(HRI)系统为生物学家处理和分析数据提供了便利。为此,对几种基于卷积神经网络(CNN)的深度学习模型进行改进和比较,研究深海热液喷口中密集生物的种类和密度,并将其与位置传感器和电导率-温度-深度(CTD)传感器给出的相关环境信息融合,完善面向多传感器融合的HRI系统。首先,结合不同的元架构和不同的特征提取器,得到了5种基于CNN的目标识别算法;然后,他们比较了特征提取器的计算成本,并从平均检测速度、相关系数和平均类特异性置信度评分等方面权衡了每种算法的优缺点,确认Faster Region-based CNN (R-CNN)_InceptionNet是适用于热液喷口生物数据集的最佳算法。最后,他们计算了密集和稀疏区域的外眼小眼的认知准确率,分别为88.3%和95.9%,以分析Faster R-CNN_InceptionNet的性能。结果表明,该方法可用于多传感器融合的HRI系统中,用于复杂环境中密集生物的统计。
{"title":"Deep learning techniques-based perfection of multi-sensor fusion oriented human-robot interaction system for identification of dense organisms","authors":"Haiju Li,&nbsp;Chuntang Zhang,&nbsp;Jingwen Bo,&nbsp;Zhongjun Ding","doi":"10.1049/ccs2.12010","DOIUrl":"10.1049/ccs2.12010","url":null,"abstract":"<p>For detection of dense small-target organisms with indistinct features in complex background, the efficiency and accuracy of traditional target detection methods are low. Multi-sensor fusion oriented human-robot interaction (HRI) system has facilitated biologists to process and analyse data. For this, several deep learning models based on convolutional neural network (CNN) are improved and compared to study the species and density of dense organisms in deep-sea hydrothermal vent, which are fused it with related environmental information given by position sensors and conductivity-temperature-depth (CTD) sensors, so as to perfect multi-sensor fusion oriented HRI system. Firstly, the authors combined different meta-architectures and different feature extractors, and obtained five object identification algorithms based on CNN. Then, they compared computational cost of feature extractors and weighed the pros and cons of each algorithm from mean detection speed, correlation coefficient and mean class-specific confidence score to confirm that Faster Region-based CNN (R-CNN)_InceptionNet is the best algorithm applicable to hydrothermal vent biological dataset. Finally, they calculated the cognitive accuracy of <i>rimicaris exoculata</i> in dense and sparse areas, which were 88.3% and 95.9% respectively, to analyse the performance of the Faster R-CNN_InceptionNet. Results show that the proposed method can be used in the multi-sensor fusion oriented HRI system for the statistics of dense organisms in complex environments.</p>","PeriodicalId":33652,"journal":{"name":"Cognitive Computation and Systems","volume":"3 3","pages":"187-196"},"PeriodicalIF":0.0,"publicationDate":"2021-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/ccs2.12010","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115715360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cognitive Computation and Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1