首页 > 最新文献

Cognitive Computation最新文献

英文 中文
An eXplainable Artificial Intelligence Methodology on Big Data Architecture 大数据架构上的易懂人工智能方法论
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-11 DOI: 10.1007/s12559-024-10272-6
Valerio La Gatta, Vincenzo Moscato, Marco Postiglione, Giancarlo Sperlì

Although artificial intelligence has become part of everyone’s real life, a trust crisis against such systems is occurring, thus increasing the need to explain black-box predictions, especially in the military, medical, and financial domains. Modern eXplainable Artificial Intelligence (XAI) techniques focus on benchmark datasets, but the cognitive applicability of such solutions under big data settings is still unclear due to memory or computation constraints. In this paper, we extend a model-agnostic XAI methodology, named Cluster-Aided Space Transformation for Local Explanation (CASTLE), to be able to deal with high-volume datasets. CASTLE aims to explain the black-box behavior of predictive models by combining both local (i.e., based on the input sample) and global (i.e., based on the whole scope for action of the model) information. In particular, the local explanation provides a rule-based explanation for the prediction of a target instance as well as the directions to update the likelihood of the predicted class. Our extension leverages modern big data technologies (e.g., Apache Spark) to handle the high volume, variety, and velocity of huge datasets. We have evaluated the framework on five datasets, in terms of temporal efficiency, explanation quality, and model significance. Our results indicate that the proposed approach retains the high-quality explanations associated with CASTLE while efficiently handling large datasets. Importantly, it exhibits a sub-linear, rather than exponential, dependence on dataset size, making it a scalable solution for massive datasets or in any big data scenario.

尽管人工智能已成为每个人现实生活的一部分,但针对此类系统的信任危机正在发生,因此对解释黑箱预测的需求日益增加,尤其是在军事、医疗和金融领域。现代可解释人工智能(XAI)技术侧重于基准数据集,但由于内存或计算的限制,这些解决方案在大数据环境下的认知适用性仍不明确。在本文中,我们扩展了一种与模型无关的 XAI 方法,命名为 "集群辅助空间转换局部解释(CASTLE)",以便能够处理大容量数据集。CASTLE 旨在通过结合局部(即基于输入样本)和全局(即基于模型的整个作用范围)信息来解释预测模型的黑箱行为。特别是,局部解释为目标实例的预测提供了基于规则的解释,也为更新预测类别的可能性提供了方向。我们的扩展利用了现代大数据技术(如 Apache Spark)来处理大量、多样和高速的海量数据集。我们在五个数据集上从时间效率、解释质量和模型意义等方面对该框架进行了评估。我们的结果表明,所提出的方法在高效处理大型数据集的同时,保留了与 CASTLE 相关的高质量解释。重要的是,该方法对数据集规模的依赖程度呈亚线性关系,而非指数关系,这使其成为适用于海量数据集或任何大数据场景的可扩展解决方案。
{"title":"An eXplainable Artificial Intelligence Methodology on Big Data Architecture","authors":"Valerio La Gatta, Vincenzo Moscato, Marco Postiglione, Giancarlo Sperlì","doi":"10.1007/s12559-024-10272-6","DOIUrl":"https://doi.org/10.1007/s12559-024-10272-6","url":null,"abstract":"<p>Although artificial intelligence has become part of everyone’s real life, a trust crisis against such systems is occurring, thus increasing the need to explain black-box predictions, especially in the military, medical, and financial domains. Modern eXplainable Artificial Intelligence (XAI) techniques focus on benchmark datasets, but the cognitive applicability of such solutions under big data settings is still unclear due to memory or computation constraints. In this paper, we extend a model-agnostic XAI methodology, named <i>Cluster-Aided Space Transformation for Local Explanation</i> (CASTLE), to be able to deal with high-volume datasets. CASTLE aims to explain the black-box behavior of predictive models by combining both <i>local</i> (i.e., based on the input sample) and <i>global</i> (i.e., based on the whole scope for action of the model) information. In particular, the local explanation provides a rule-based explanation for the prediction of a target instance as well as the directions to update the likelihood of the predicted class. Our extension leverages modern big data technologies (e.g., Apache Spark) to handle the high volume, variety, and velocity of huge datasets. We have evaluated the framework on five datasets, in terms of temporal efficiency, explanation quality, and model significance. Our results indicate that the proposed approach retains the high-quality explanations associated with CASTLE while efficiently handling large datasets. Importantly, it exhibits a sub-linear, rather than exponential, dependence on dataset size, making it a scalable solution for massive datasets or in any big data scenario.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Synergy of Human-Centered AI and Cyber-Physical-Social Systems for Enhanced Cognitive Situation Awareness: Applications, Challenges and Opportunities 以人为本的人工智能与网络-物理-社会系统协同增强认知态势感知:应用、挑战和机遇
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-11 DOI: 10.1007/s12559-024-10271-7
Saeed Hamood Alsamhi, Santosh Kumar, Ammar Hawbani, Alexey V. Shvetsov, Liang Zhao, Mohsen Guizani

This paper explores the convergence of Human-Centered AI (HCAI) and Cyber-Physical Social Systems (CPSS) in pursuing advanced Cognitive Situation Awareness (CSA). Integrating HCAI principles within CPSS fosters systems prioritizing human needs, values, and experiences, improving perception, understanding, and responsiveness to complex environments. By incorporating transparency, interpretability, and usability into Artificial Intelligence (AI) systems, the human-centered approach enhances user interaction and cooperation with intelligent systems, leading to more adaptive and efficient CPSS. The study employs a comprehensive approach to explore the intersection of HCAI and CPSS. Moreover, the paper presents case studies to illustrate real-world applications of HCAI and CPSS, such as self-driving cars and smart homes, transportation, healthcare, energy management, social media, and emergency response systems. Nevertheless, technical complexities, privacy concerns, and regulatory considerations must be addressed. The paper demonstrates the practical implications of integrating HCAI into CPSS through case studies in various domains. Furthermore, It highlights the positive impact of CSA systems such as self-driving cars, showcasing improvements in transportation. This paper contributes to advancing CSA and designing intelligent systems, promoting human–machine collaboration and societal well-being. By examining the intersection of HCAI and CPSS, this study advances research in CSA and designing intelligent systems prioritizing human needs, values, and experiences.

本文探讨了以人为本的人工智能(HCAI)与网络物理社会系统(CPSS)在追求高级认知态势感知(CSA)方面的融合。在 CPSS 中融入 HCAI 原则,可促进系统优先考虑人类的需求、价值观和体验,提高对复杂环境的感知、理解和响应能力。通过将透明度、可解释性和可用性融入人工智能(AI)系统,以人为本的方法增强了用户与智能系统的互动和合作,从而提高了 CPSS 的适应性和效率。本研究采用了一种综合方法来探索 HCAI 与 CPSS 的交叉点。此外,论文还通过案例研究来说明 HCAI 和 CPSS 在现实世界中的应用,如自动驾驶汽车和智能家居、交通、医疗保健、能源管理、社交媒体和应急响应系统。然而,技术复杂性、隐私问题和监管方面的考虑因素必须得到解决。本文通过不同领域的案例研究,展示了将 HCAI 集成到 CPSS 中的实际意义。此外,本文还强调了 CSA 系统(如自动驾驶汽车)的积极影响,展示了对交通的改善。本文有助于推进 CSA 和设计智能系统,促进人机协作和社会福祉。通过研究 HCAI 和 CPSS 的交叉点,本研究推动了 CSA 研究和优先考虑人类需求、价值观和体验的智能系统设计。
{"title":"Synergy of Human-Centered AI and Cyber-Physical-Social Systems for Enhanced Cognitive Situation Awareness: Applications, Challenges and Opportunities","authors":"Saeed Hamood Alsamhi, Santosh Kumar, Ammar Hawbani, Alexey V. Shvetsov, Liang Zhao, Mohsen Guizani","doi":"10.1007/s12559-024-10271-7","DOIUrl":"https://doi.org/10.1007/s12559-024-10271-7","url":null,"abstract":"<p>This paper explores the convergence of Human-Centered AI (HCAI) and Cyber-Physical Social Systems (CPSS) in pursuing advanced Cognitive Situation Awareness (CSA). Integrating HCAI principles within CPSS fosters systems prioritizing human needs, values, and experiences, improving perception, understanding, and responsiveness to complex environments. By incorporating transparency, interpretability, and usability into Artificial Intelligence (AI) systems, the human-centered approach enhances user interaction and cooperation with intelligent systems, leading to more adaptive and efficient CPSS. The study employs a comprehensive approach to explore the intersection of HCAI and CPSS. Moreover, the paper presents case studies to illustrate real-world applications of HCAI and CPSS, such as self-driving cars and smart homes, transportation, healthcare, energy management, social media, and emergency response systems. Nevertheless, technical complexities, privacy concerns, and regulatory considerations must be addressed. The paper demonstrates the practical implications of integrating HCAI into CPSS through case studies in various domains. Furthermore, It highlights the positive impact of CSA systems such as self-driving cars, showcasing improvements in transportation. This paper contributes to advancing CSA and designing intelligent systems, promoting human–machine collaboration and societal well-being. By examining the intersection of HCAI and CPSS, this study advances research in CSA and designing intelligent systems prioritizing human needs, values, and experiences.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Perceived Risk Index Leveraging Social Media Data: Assessing Severity of Fire on Microblogging 利用社交媒体数据的感知风险指数:评估微博火灾的严重程度
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-10 DOI: 10.1007/s12559-024-10266-4
Carmen De Maio, Giuseppe Fenza, Mariacristina Gallo, Vincenzo Loia, Alberto Volpe

Fires represent a significant threat to the environment, infrastructure, and human safety, often spreading rapidly with wide-ranging consequences such as economic losses and life risks. Early detection and swift response to fire outbreaks are crucial to mitigating their impact. While satellite-based monitoring is effective, it may miss brief or indoor fires. This paper introduces a novel Perceived Risk Index (PRI) that, complementing satellite data, leverages social media data to provide insights into the severity of fire events. In the light of the results of statistical analysis, the PRI incorporates the number of fire-related tweets and the associated emotional expressions to gauge the perceived risk. The index’s evaluation involves the development of a comprehensive system that collects, classifies, annotates, and correlates social media posts with satellite data, presenting the findings in an interactive dashboard. Experimental results using diverse datasets of real-fire tweets demonstrate an average best correlation of 77% between PRI and the brightness values of fires detected by satellites. This correlation extends to the real intensity of the corresponding fires, showcasing the potential of social media platforms in furnishing information for emergency response and decision-making. The proposed PRI proves to be a valuable tool for ongoing monitoring efforts, having the potential to capture data on fires missed by satellites. This contributes to the development to more effective strategies for mitigating the environmental, infrastructural, and safety impacts of fire events.

火灾对环境、基础设施和人类安全构成重大威胁,通常会迅速蔓延,造成经济损失和生命危险等广泛后果。及早发现并迅速应对火灾爆发对减轻其影响至关重要。基于卫星的监测虽然有效,但可能会错过短暂的火灾或室内火灾。本文介绍了一种新颖的感知风险指数(PRI),该指数与卫星数据互为补充,利用社交媒体数据来深入了解火灾事件的严重性。根据统计分析结果,PRI 将与火灾有关的推文数量和相关的情绪表达纳入其中,以衡量感知风险。该指数的评估包括开发一个综合系统,用于收集、分类、注释社交媒体帖子并将其与卫星数据相关联,同时在一个交互式仪表板中展示评估结果。使用各种真实火灾推文数据集的实验结果表明,PRI 与卫星探测到的火灾亮度值之间的平均最佳相关性为 77%。这种相关性延伸到了相应火灾的实际强度,展示了社交媒体平台在为应急响应和决策提供信息方面的潜力。拟议的 PRI 被证明是持续监测工作的宝贵工具,有可能捕捉到卫星遗漏的火灾数据。这有助于制定更有效的战略,减轻火灾事件对环境、基础设施和安全的影响。
{"title":"A Perceived Risk Index Leveraging Social Media Data: Assessing Severity of Fire on Microblogging","authors":"Carmen De Maio, Giuseppe Fenza, Mariacristina Gallo, Vincenzo Loia, Alberto Volpe","doi":"10.1007/s12559-024-10266-4","DOIUrl":"https://doi.org/10.1007/s12559-024-10266-4","url":null,"abstract":"<p>Fires represent a significant threat to the environment, infrastructure, and human safety, often spreading rapidly with wide-ranging consequences such as economic losses and life risks. Early detection and swift response to fire outbreaks are crucial to mitigating their impact. While satellite-based monitoring is effective, it may miss brief or indoor fires. This paper introduces a novel Perceived Risk Index (PRI) that, complementing satellite data, leverages social media data to provide insights into the severity of fire events. In the light of the results of statistical analysis, the PRI incorporates the number of fire-related tweets and the associated emotional expressions to gauge the perceived risk. The index’s evaluation involves the development of a comprehensive system that collects, classifies, annotates, and correlates social media posts with satellite data, presenting the findings in an interactive dashboard. Experimental results using diverse datasets of real-fire tweets demonstrate an average best correlation of 77% between PRI and the brightness values of fires detected by satellites. This correlation extends to the real intensity of the corresponding fires, showcasing the potential of social media platforms in furnishing information for emergency response and decision-making. The proposed PRI proves to be a valuable tool for ongoing monitoring efforts, having the potential to capture data on fires missed by satellites. This contributes to the development to more effective strategies for mitigating the environmental, infrastructural, and safety impacts of fire events.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cognitive Impairment Detection Based on Frontal Camera Scene While Performing Handwriting Tasks 基于执行手写任务时的正面相机场景的认知障碍检测
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-10 DOI: 10.1007/s12559-024-10279-z
Federico Candela, Santina Romeo, Marcos Faundez-Zanuy, Pau Ferrer-Ramos

Diagnosing cognitive impairment is an ongoing field of research especially in the elderly. Assessing the health status of the elderly can be a complex process that requires both subjective and objective measures. Subjective measures, such as self-reported responses to questions, can provide valuable information about a person’s experiences, feelings, and beliefs. However, from a scientific point of view, objective measures, based on quantifiable data that can be used to assess a person’s physical and cognitive functioning, are more appropriate and rigorous. The proposed system is based on the use of non-invasive instrumentation, which includes video images acquired with a frontal camera while the user performs different handwriting tasks on a Wacom tablet. We have acquired a new multimodal database of 191 elder subjects, which has been classified by human experts into healthy and cognitive impairment users by means of the standard pentagon copying test. The automatic classification was carried out using a video segmentation algorithm through the technique of shot boundary detection, in conjunction with a Transformer neural network. We obtain a multiclass classification accuracy of 77% and two-class accuracy of 83% based on frontal camera images, which basically detects head movements during handwriting tasks. Our automatic system can replicate human classification of handwritten pentagon copying test, opening a new method for cognitive impairment detection based on head movements. We also demonstrate the possibility to identifying the handwritten task performed by the user, based on frontal camera images and a Transformer neural network.

诊断认知障碍是一个持续的研究领域,尤其是对老年人。评估老年人的健康状况是一个复杂的过程,需要同时采用主观和客观的测量方法。主观测量,如对问题的自我报告,可以提供有关个人经历、感受和信念的宝贵信息。然而,从科学的角度来看,基于可量化数据的客观测量方法更为合适和严谨,这些数据可用来评估一个人的身体和认知功能。拟议的系统基于非侵入式仪器的使用,其中包括用户在 Wacom 手写板上执行不同手写任务时使用前置摄像头获取的视频图像。我们获得了一个包含 191 名老年受试者的新的多模态数据库,人类专家通过标准的五边形临摹测试将这些受试者分为健康用户和认知障碍用户。自动分类是通过镜头边界检测技术的视频分割算法,结合变形神经网络进行的。基于正面摄像头图像,我们获得了 77% 的多类分类准确率和 83% 的两类分类准确率。我们的自动系统可以复制人类对手写五边形抄写测试的分类,为基于头部运动的认知障碍检测开辟了一种新方法。我们还展示了根据正面摄像头图像和 Transformer 神经网络识别用户所执行的手写任务的可能性。
{"title":"Cognitive Impairment Detection Based on Frontal Camera Scene While Performing Handwriting Tasks","authors":"Federico Candela, Santina Romeo, Marcos Faundez-Zanuy, Pau Ferrer-Ramos","doi":"10.1007/s12559-024-10279-z","DOIUrl":"https://doi.org/10.1007/s12559-024-10279-z","url":null,"abstract":"<p>Diagnosing cognitive impairment is an ongoing field of research especially in the elderly. Assessing the health status of the elderly can be a complex process that requires both subjective and objective measures. Subjective measures, such as self-reported responses to questions, can provide valuable information about a person’s experiences, feelings, and beliefs. However, from a scientific point of view, objective measures, based on quantifiable data that can be used to assess a person’s physical and cognitive functioning, are more appropriate and rigorous. The proposed system is based on the use of non-invasive instrumentation, which includes video images acquired with a frontal camera while the user performs different handwriting tasks on a Wacom tablet. We have acquired a new multimodal database of 191 elder subjects, which has been classified by human experts into healthy and cognitive impairment users by means of the standard pentagon copying test. The automatic classification was carried out using a video segmentation algorithm through the technique of shot boundary detection, in conjunction with a Transformer neural network. We obtain a multiclass classification accuracy of 77% and two-class accuracy of 83% based on frontal camera images, which basically detects head movements during handwriting tasks. Our automatic system can replicate human classification of handwritten pentagon copying test, opening a new method for cognitive impairment detection based on head movements. We also demonstrate the possibility to identifying the handwritten task performed by the user, based on frontal camera images and a Transformer neural network.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Resilient and Intelligent Predictive Model for CPS-Enabled E-Health Applications 用于 CPS 电子健康应用的新型弹性智能预测模型
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-05 DOI: 10.1007/s12559-024-10278-0
Amjad Rehman, Khalid Haseeb, Teg Alam, Tanzila Saba, Gwanggil Jeon

Cyber-physical-social-systems interconnect diverse technologies and communication infrastructure to the Internet for environmental sensing and computation. They offer many real-time autonomous services for smart cities, industry, transportation, medical systems, etc. The Internet of Medical Things (IoMT) has gained the potential for developing cyber-physical system (CPS) to facilitate healthcare applications and analyze the records of patients. Such a communication paradigm is integrated into many wireless standards for managing crucial data with cloud computing. However, the limitations of low-powered resources of such healthcare infrastructures increase the complexity level of sustainable growth. Wireless connectivity in next-generation networks is another research goal due to unbalanced load distribution. Furthermore, low-powered computing devices can be easily accessible by intruders and eliminate the confidentiality of any data transmission, so privacy is another research concern for healthcare systems. Therefore, using intelligent computing, this paper proposed a novel resilient predictive model for e-health sensing. The proposed model provides an efficient CPS-enabled automated routing system by exploring the optimization process with edge intelligence. This particular solution increases the level of cooperation between communication devices with intelligent data processing and higher predictive services. Moreover, by offering a trustworthy scheme, it seeks to enhance digital communication, data aggregation, and data breach prevention. The experimental findings highlight significant outcomes of the proposed model for packet reception, network overhead, data delay, and reliability as compared to alternative solutions.

网络-物理-社会-系统将各种技术和通信基础设施与互联网互联,用于环境传感和计算。它们为智能城市、工业、交通、医疗系统等提供许多实时自主服务。医疗物联网(IoMT)具有开发网络物理系统(CPS)的潜力,以促进医疗保健应用和分析病人记录。这种通信范例已被纳入许多无线标准,用于通过云计算管理关键数据。然而,此类医疗基础设施的低功率资源限制增加了可持续发展的复杂性。由于负载分布不平衡,下一代网络中的无线连接是另一个研究目标。此外,低功耗计算设备很容易被入侵者访问,并消除任何数据传输的保密性,因此隐私问题是医疗保健系统的另一个研究关注点。因此,本文利用智能计算技术,为电子健康传感提出了一种新型弹性预测模型。所提出的模型通过利用边缘智能探索优化过程,提供了一个高效的 CPS 自动化路由系统。这一特殊解决方案通过智能数据处理和更高的预测服务提高了通信设备之间的合作水平。此外,通过提供一种可信赖的方案,它还能增强数字通信、数据聚合和数据泄露预防。与其他解决方案相比,实验结果凸显了所提模型在数据包接收、网络开销、数据延迟和可靠性方面的显著成果。
{"title":"A Novel Resilient and Intelligent Predictive Model for CPS-Enabled E-Health Applications","authors":"Amjad Rehman, Khalid Haseeb, Teg Alam, Tanzila Saba, Gwanggil Jeon","doi":"10.1007/s12559-024-10278-0","DOIUrl":"https://doi.org/10.1007/s12559-024-10278-0","url":null,"abstract":"<p>Cyber-physical-social-systems interconnect diverse technologies and communication infrastructure to the Internet for environmental sensing and computation. They offer many real-time autonomous services for smart cities, industry, transportation, medical systems, etc. The Internet of Medical Things (IoMT) has gained the potential for developing cyber-physical system (CPS) to facilitate healthcare applications and analyze the records of patients. Such a communication paradigm is integrated into many wireless standards for managing crucial data with cloud computing. However, the limitations of low-powered resources of such healthcare infrastructures increase the complexity level of sustainable growth. Wireless connectivity in next-generation networks is another research goal due to unbalanced load distribution. Furthermore, low-powered computing devices can be easily accessible by intruders and eliminate the confidentiality of any data transmission, so privacy is another research concern for healthcare systems. Therefore, using intelligent computing, this paper proposed a novel resilient predictive model for e-health sensing. The proposed model provides an efficient CPS-enabled automated routing system by exploring the optimization process with edge intelligence. This particular solution increases the level of cooperation between communication devices with intelligent data processing and higher predictive services. Moreover, by offering a trustworthy scheme, it seeks to enhance digital communication, data aggregation, and data breach prevention. The experimental findings highlight significant outcomes of the proposed model for packet reception, network overhead, data delay, and reliability as compared to alternative solutions.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent Inspection Guidance of Urethral Endoscopy Based on SLAM with Blood Vessel Attentional Features 基于血管注意特征 SLAM 的尿道内窥镜智能检查指导
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-04 DOI: 10.1007/s12559-024-10264-6
Jie Lin, Xiangyu Zeng, Yulong Pan, Shangqing Ren, Yige Bao

Due to small imaging range of lens, blurring by jitter in the operation process and high similarity of urethral image features observed in different positions, doctors often face challenges in conducting a quick and comprehensive microscopic examination. In this paper, we combine image processing, simultaneous localization and mapping (SLAM) and intelligent navigation technologies to build an ORB-SLAM-based auxiliary microscopy guiding system. It can automatically process real-time microscopy videos, analyze the doctor’s detection path and provide direction for areas that have not been detected, assisting the doctor in completing urethral wall detection. In this system, a generative adversarial network-based deblurring algorithm is used to deblur the urethral images before SLAM processing. We creatively propose a vascular attention-based feature extraction algorithm tailored for urethral images. This algorithm combines F3Net and U-Net networks to detect the main body and branch points of blood vessels, respectively, which demonstrates the capability to assist the SLAM system in tracking the urethra more stably. Moreover, we design the direction guidance rules to aid doctors in urethral endoscopy. The system has been evaluated with a real urethral endoscope video dataset. Compared to other mainstream feature extraction algorithms, the method proposed in this paper is more accurate and comprehensive in identifying urethral vascular features, resulting in a 4.34% accuracy improvement, which confirms its effectiveness.

由于镜头成像范围小、操作过程中的抖动造成模糊以及不同位置观察到的尿道图像特征相似度高,医生在进行快速、全面的显微镜检查时经常面临挑战。本文结合图像处理、同步定位与映射(SLAM)和智能导航技术,构建了基于 ORB-SLAM 的辅助显微镜引导系统。它能自动处理实时显微镜视频,分析医生的检测路径,并为未检测到的区域提供方向指引,协助医生完成尿道壁检测。在该系统中,基于生成对抗网络的去模糊算法可在 SLAM 处理之前对尿道图像进行去模糊处理。我们创造性地提出了一种为尿道图像量身定制的基于血管注意力的特征提取算法。该算法结合了 F3Net 和 U-Net 网络,分别检测血管的主体和分支点,证明了该算法能够帮助 SLAM 系统更稳定地跟踪尿道。此外,我们还设计了方向引导规则,以帮助医生进行尿道内窥镜检查。我们利用真实的尿道内窥镜视频数据集对该系统进行了评估。与其他主流特征提取算法相比,本文提出的方法在识别尿道血管特征方面更加准确和全面,准确率提高了 4.34%,证实了其有效性。
{"title":"Intelligent Inspection Guidance of Urethral Endoscopy Based on SLAM with Blood Vessel Attentional Features","authors":"Jie Lin, Xiangyu Zeng, Yulong Pan, Shangqing Ren, Yige Bao","doi":"10.1007/s12559-024-10264-6","DOIUrl":"https://doi.org/10.1007/s12559-024-10264-6","url":null,"abstract":"<p>Due to small imaging range of lens, blurring by jitter in the operation process and high similarity of urethral image features observed in different positions, doctors often face challenges in conducting a quick and comprehensive microscopic examination. In this paper, we combine image processing, simultaneous localization and mapping (SLAM) and intelligent navigation technologies to build an ORB-SLAM-based auxiliary microscopy guiding system. It can automatically process real-time microscopy videos, analyze the doctor’s detection path and provide direction for areas that have not been detected, assisting the doctor in completing urethral wall detection. In this system, a generative adversarial network-based deblurring algorithm is used to deblur the urethral images before SLAM processing. We creatively propose a vascular attention-based feature extraction algorithm tailored for urethral images. This algorithm combines F3Net and U-Net networks to detect the main body and branch points of blood vessels, respectively, which demonstrates the capability to assist the SLAM system in tracking the urethra more stably. Moreover, we design the direction guidance rules to aid doctors in urethral endoscopy. The system has been evaluated with a real urethral endoscope video dataset. Compared to other mainstream feature extraction algorithms, the method proposed in this paper is more accurate and comprehensive in identifying urethral vascular features, resulting in a 4.34% accuracy improvement, which confirms its effectiveness.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diabetic Foot Ulcer Detection: Combining Deep Learning Models for Improved Localization 糖尿病足溃疡检测:结合深度学习模型改进定位
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-04-01 DOI: 10.1007/s12559-024-10267-3
Rusab Sarmun, Muhammad E. H. Chowdhury, M. Murugappan, Ahmed Aqel, Maymouna Ezzuddin, Syed Mahfuzur Rahman, Amith Khandakar, Sanzida Akter, Rashad Alfkey, Md. Anwarul Hasan

Diabetes mellitus (DM) can cause chronic foot issues and severe infections, including Diabetic Foot Ulcers (DFUs) that heal slowly due to insufficient blood flow. A recurrence of these ulcers can lead to 84% of lower limb amputations and even cause death. High-risk diabetes patients require expensive medications, regular check-ups, and proper personal hygiene to prevent DFUs, which affect 15–25% of diabetics. Accurate diagnosis, appropriate care, and prompt response can prevent amputations and fatalities through early and reliable DFU detection from image analysis. We propose a comprehensive deep learning-based system for detecting DFUs from patients’ feet images by reliably localizing ulcer points. Our method utilizes innovative model ensemble techniques—non-maximum suppression (NMS), Soft-NMS, and weighted bounding box fusion (WBF)—to combine predictions from state-of-the-art object detection models. The performances of diverse cutting-edge model architectures used in this study complement each other, leading to more generalized and improved results when combined in an ensemble. Our WBF-based approach combining YOLOv8m and FRCNN-ResNet101 achieves a mean average precision (mAP) score of 86.4% at the IoU threshold of 0.5 on the DFUC2020 dataset, significantly outperforming the former benchmark by 12.4%. We also perform external validation on the IEEE DataPort Diabetic Foot dataset which has demonstrated robust and reliable model performance on the qualitative analysis. In conclusion, our study effectively developed an innovative diabetic foot ulcer (DFU) detection system using an ensemble model of deep neural networks (DNNs). This AI-driven tool serves as an initial screening aid for medical professionals, augmenting the diagnostic process by enhancing sensitivity to potential DFU cases. While recognizing the presence of false positives, our research contributes to improving patient care through the integration of human medical expertise with AI-based solutions in DFU management.

糖尿病(DM)可导致慢性足部问题和严重感染,包括因血流不足而愈合缓慢的糖尿病足溃疡(DFU)。这些溃疡的复发可导致 84% 的下肢截肢,甚至导致死亡。高危糖尿病患者需要昂贵的药物、定期检查和适当的个人卫生来预防 DFU,15%-25% 的糖尿病患者会受到 DFU 的影响。通过图像分析进行早期、可靠的 DFU 检测,准确的诊断、适当的护理和及时的响应可以防止截肢和死亡。我们提出了一种基于深度学习的综合系统,通过可靠地定位溃疡点,从患者的足部图像中检测出 DFU。我们的方法利用创新的模型集合技术--非最大抑制(NMS)、软-NMS 和加权边界框融合(WBF)--将最先进的物体检测模型的预测结果结合起来。本研究中使用的各种尖端模型架构性能互补,在组合使用时可获得更广泛和更好的结果。我们基于 WBF 的方法结合了 YOLOv8m 和 FRCNN-ResNet101,在 DFUC2020 数据集上,当 IoU 临界值为 0.5 时,平均精度 (mAP) 得分达到 86.4%,比前一个基准高出 12.4%。我们还在 IEEE DataPort 糖尿病足数据集上进行了外部验证,该数据集在定性分析中表现出了稳健可靠的模型性能。总之,我们的研究利用深度神经网络(DNN)的集合模型有效地开发了一种创新的糖尿病足溃疡(DFU)检测系统。这种人工智能驱动的工具可作为医疗专业人员的初步筛查辅助工具,通过提高对潜在 DFU 病例的敏感性来增强诊断过程。在认识到假阳性病例存在的同时,我们的研究通过将人类医疗专业知识与基于人工智能的 DFU 管理解决方案相结合,为改善患者护理做出了贡献。
{"title":"Diabetic Foot Ulcer Detection: Combining Deep Learning Models for Improved Localization","authors":"Rusab Sarmun, Muhammad E. H. Chowdhury, M. Murugappan, Ahmed Aqel, Maymouna Ezzuddin, Syed Mahfuzur Rahman, Amith Khandakar, Sanzida Akter, Rashad Alfkey, Md. Anwarul Hasan","doi":"10.1007/s12559-024-10267-3","DOIUrl":"https://doi.org/10.1007/s12559-024-10267-3","url":null,"abstract":"<p>Diabetes mellitus (DM) can cause chronic foot issues and severe infections, including Diabetic Foot Ulcers (DFUs) that heal slowly due to insufficient blood flow. A recurrence of these ulcers can lead to 84% of lower limb amputations and even cause death. High-risk diabetes patients require expensive medications, regular check-ups, and proper personal hygiene to prevent DFUs, which affect 15–25% of diabetics. Accurate diagnosis, appropriate care, and prompt response can prevent amputations and fatalities through early and reliable DFU detection from image analysis. We propose a comprehensive deep learning-based system for detecting DFUs from patients’ feet images by reliably localizing ulcer points. Our method utilizes innovative model ensemble techniques—non-maximum suppression (NMS), Soft-NMS, and weighted bounding box fusion (WBF)—to combine predictions from state-of-the-art object detection models. The performances of diverse cutting-edge model architectures used in this study complement each other, leading to more generalized and improved results when combined in an ensemble. Our WBF-based approach combining YOLOv8m and FRCNN-ResNet101 achieves a mean average precision (mAP) score of 86.4% at the IoU threshold of 0.5 on the DFUC2020 dataset, significantly outperforming the former benchmark by 12.4%. We also perform external validation on the IEEE DataPort Diabetic Foot dataset which has demonstrated robust and reliable model performance on the qualitative analysis. In conclusion, our study effectively developed an innovative diabetic foot ulcer (DFU) detection system using an ensemble model of deep neural networks (DNNs). This AI-driven tool serves as an initial screening aid for medical professionals, augmenting the diagnostic process by enhancing sensitivity to potential DFU cases. While recognizing the presence of false positives, our research contributes to improving patient care through the integration of human medical expertise with AI-based solutions in DFU management.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140567894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Mutual Information-Based Many-Objective Optimization Method for EEG Channel Selection in the Epileptic Seizure Prediction Task 基于互信息的多目标优化方法,用于癫痫发作预测任务中的脑电图信道选择
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-03-23 DOI: 10.1007/s12559-024-10261-9
Najwa Kouka, Rahma Fourati, Asma Baghdadi, Patrick Siarry, M. Adel

Epileptic seizure prediction using multi-channel electroencephalogram (EEG) signals is very important in clinical therapy. A large number of channels lead to high computational complexity with low model performance. To improve the performance and reduce the overfitting that arises due to the use of unrelevant channels, the present paper proposed a channel selection method to study the brain region activation related to epileptic seizure. Our method is bio-inspired and cognitive since it integrates the novel binary many-objective particle swarm optimization with a ConvLSTM model. The proposed method has two advantages. First, it performed a new initialization strategy based on channel weighting with mutual information, thereby promoting the fast convergence of the optimization algorithm. Second, it captures spatio-temporal information from raw EEG segments thanks to the ConvLSTM model. The selected sub-channels are optimized as many-objective optimization problem that includes maximizing F1-score, sensitivity, specificity, and minimizing the ratio rate of selected channels. Our results have shown a performance of up to (97.94%) with only one EEG channel. Interestingly, when using all the EEG channels available, lower performance was achieved compared to the case when EEG channels were selected by our approach. This study revealed that it is possible to predict epileptic seizures using a few channels, which provides evidence for the future development of portable EEG seizure prediction devices.

利用多通道脑电图(EEG)信号预测癫痫发作在临床治疗中非常重要。大量通道导致计算复杂度高,模型性能低。为了提高模型性能,减少因使用无关通道而导致的过拟合,本文提出了一种通道选择方法,用于研究与癫痫发作相关的脑区激活。我们的方法融合了新颖的二元多目标粒子群优化和 ConvLSTM 模型,因此具有生物启发和认知的特点。所提出的方法有两个优点。首先,它采用了基于信道加权与互信息的新初始化策略,从而促进了优化算法的快速收敛。其次,由于采用了 ConvLSTM 模型,它能从原始脑电图片段中捕捉时空信息。所选子通道的优化是一个多目标优化问题,包括最大化 F1 分数、灵敏度、特异性和最小化所选通道的比率。我们的研究结果表明,仅使用一个脑电图通道,性能可达(97.94%/)。有趣的是,当使用所有可用的脑电图通道时,与通过我们的方法选择脑电图通道的情况相比,取得的性能较低。这项研究揭示了使用几个通道预测癫痫发作是可能的,这为未来开发便携式脑电图癫痫发作预测设备提供了证据。
{"title":"A Mutual Information-Based Many-Objective Optimization Method for EEG Channel Selection in the Epileptic Seizure Prediction Task","authors":"Najwa Kouka, Rahma Fourati, Asma Baghdadi, Patrick Siarry, M. Adel","doi":"10.1007/s12559-024-10261-9","DOIUrl":"https://doi.org/10.1007/s12559-024-10261-9","url":null,"abstract":"<p>Epileptic seizure prediction using multi-channel electroencephalogram (EEG) signals is very important in clinical therapy. A large number of channels lead to high computational complexity with low model performance. To improve the performance and reduce the overfitting that arises due to the use of unrelevant channels, the present paper proposed a channel selection method to study the brain region activation related to epileptic seizure. Our method is bio-inspired and cognitive since it integrates the novel binary many-objective particle swarm optimization with a ConvLSTM model. The proposed method has two advantages. First, it performed a new initialization strategy based on channel weighting with mutual information, thereby promoting the fast convergence of the optimization algorithm. Second, it captures spatio-temporal information from raw EEG segments thanks to the ConvLSTM model. The selected sub-channels are optimized as many-objective optimization problem that includes maximizing F1-score, sensitivity, specificity, and minimizing the ratio rate of selected channels. Our results have shown a performance of up to <span>(97.94%)</span> with only one EEG channel. Interestingly, when using all the EEG channels available, lower performance was achieved compared to the case when EEG channels were selected by our approach. This study revealed that it is possible to predict epileptic seizures using a few channels, which provides evidence for the future development of portable EEG seizure prediction devices.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-03-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140197170","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shift-Equivariant Similarity-Preserving Hypervector Representations of Sequences 序列的移项-换元相似性保全超矢量表示
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-03-12 DOI: 10.1007/s12559-024-10258-4

Abstract

Hyperdimensional Computing (HDC), also known as Vector-Symbolic Architectures (VSA), is a promising framework for the development of cognitive architectures and artificial intelligence systems, as well as for technical applications and emerging neuromorphic and nanoscale hardware. HDC/VSA operate with hypervectors, i.e., neural-like distributed vector representations of large fixed dimension (usually > 1000). One of the key ingredients of HDC/VSA are the methods for encoding various data types (from numeric scalars and vectors to graphs) by hypervectors. In this paper, we propose an approach for the formation of hypervectors of sequences that provides both an equivariance with respect to the shift of sequences and preserves the similarity of sequences with identical elements at nearby positions. Our methods represent the sequence elements by compositional hypervectors and exploit permutations of hypervectors for representing the order of sequence elements. We experimentally explored the proposed representations using a diverse set of tasks with data in the form of symbolic strings. Although we did not use any features here (hypervector of a sequence was formed just from the hypervectors of its symbols at their positions), the proposed approach demonstrated the performance on a par with the methods that exploit various features, such as subsequences. The proposed techniques were designed for the HDC/VSA model known as Sparse Binary Distributed Representations. However, they can be adapted to hypervectors in formats of other HDC/VSA models, as well as for representing sequences of types other than symbolic strings. Directions for further research are discussed.

摘要 超维度计算(HDC),又称矢量-符号架构(VSA),是开发认知架构和人工智能系统以及技术应用和新兴神经形态和纳米级硬件的一个前景广阔的框架。HDC/VSA 使用超向量(即具有较大固定维度(通常为 1000)的类似神经的分布式向量表示)进行操作。HDC/VSA 的关键要素之一是用超向量对各种数据类型(从数字标量和向量到图形)进行编码的方法。在本文中,我们提出了一种形成序列超向量的方法,这种方法既能提供序列移动的等差数列,又能保持在附近位置具有相同元素的序列的相似性。我们的方法用组成超向量来表示序列元素,并利用超向量的排列来表示序列元素的顺序。我们使用一系列不同的任务和符号字符串形式的数据,对所提出的表示方法进行了实验探索。虽然我们在这里没有使用任何特征(序列的超向量只是由其符号在其位置上的超向量形成的),但所提出的方法表现出了与利用子序列等各种特征的方法相当的性能。所提出的技术是针对被称为稀疏二进制分布式表示的 HDC/VSA 模型而设计的。不过,这些技术也可适用于其他 HDC/VSA 模型格式的超向量,以及表示符号字符串以外类型的序列。本文讨论了进一步研究的方向。
{"title":"Shift-Equivariant Similarity-Preserving Hypervector Representations of Sequences","authors":"","doi":"10.1007/s12559-024-10258-4","DOIUrl":"https://doi.org/10.1007/s12559-024-10258-4","url":null,"abstract":"<h3>Abstract</h3> <p>Hyperdimensional Computing (HDC), also known as Vector-Symbolic Architectures (VSA), is a promising framework for the development of cognitive architectures and artificial intelligence systems, as well as for technical applications and emerging neuromorphic and nanoscale hardware. HDC/VSA operate with hypervectors, i.e., neural-like distributed vector representations of large fixed dimension (usually &gt; 1000). One of the key ingredients of HDC/VSA are the methods for encoding various data types (from numeric scalars and vectors to graphs) by hypervectors. In this paper, we propose an approach for the formation of hypervectors of sequences that provides both an equivariance with respect to the shift of sequences and preserves the similarity of sequences with identical elements at nearby positions. Our methods represent the sequence elements by compositional hypervectors and exploit permutations of hypervectors for representing the order of sequence elements. We experimentally explored the proposed representations using a diverse set of tasks with data in the form of symbolic strings. Although we did not use any features here (hypervector of a sequence was formed just from the hypervectors of its symbols at their positions), the proposed approach demonstrated the performance on a par with the methods that exploit various features, such as subsequences. The proposed techniques were designed for the HDC/VSA model known as Sparse Binary Distributed Representations. However, they can be adapted to hypervectors in formats of other HDC/VSA models, as well as for representing sequences of types other than symbolic strings. Directions for further research are discussed.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140127100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Normal Template Mapping: An Association-Inspired Handwritten Character Recognition Model 正常模板映射:一种受关联启发的手写字符识别模型
IF 5.4 3区 计算机科学 Q1 Computer Science Pub Date : 2024-03-12 DOI: 10.1007/s12559-024-10270-8
Jun Miao, Peng Liu, Chen Chen, Yuanhua Qiao

In identifying objects, people usually associate memory templates to guide visual attention and determine the category of an object. The initial character images that children learn are usually normal patterns. However, the variation in corresponding handwritten patterns is quite large. To learn these deformed images with large variance, current deep models must involve millions of parameters for such kind of classification tasks that seem much easier and simpler to children who learn to recognize new characters associated with their initially taught normal patterns. From the perspective of humans’ perception, when people see a new object, they first think of a template image in their memory, which is similar to the object. This mapping process makes it easier for humans to learn new objects. Inspired by this cognitive association mechanism, this study developed a cognition-inspired handwritten character recognition model using a proposed normal template mapping neural network. This model uses an encoder-decoder architecture to build a normal template mapping neural network that transforms handwritten character images of one class to normalized characters similar to a given printed template character image representing that class. Then, a simple shallow classifier recognizes these normalized images, which are easier to classify. The experimental results show that the proposed model completes handwritten character recognition with comparable or higher precision at a much lower parameter count than current representative deep models. The proposed model removes the individual styles of handwritten character images and maps them to patterns similar to normal template images. This greatly reduces the classification difficulty and enables the classifier to classify only known standard character images.

在识别物体时,人们通常会联想记忆模板来引导视觉注意力,并确定物体的类别。儿童最初学习的字符图像通常是正常的图案。然而,相应手写图案的差异却相当大。要学习这些差异较大的变形图像,当前的深度模型必须涉及数百万个参数,才能完成此类分类任务,而对于儿童来说,学习识别与他们最初学习的正常图案相关联的新字符要容易得多,也简单得多。从人类感知的角度来看,当人们看到一个新物体时,首先会想到记忆中与该物体相似的模板图像。这种映射过程使人类更容易学习新物体。受这种认知联想机制的启发,本研究利用一个拟议的正常模板映射神经网络,开发了一个受认知启发的手写字符识别模型。该模型采用编码器-解码器架构来构建法线模板映射神经网络,将一类手写字符图像转换为与代表该类的给定印刷模板字符图像相似的法线化字符。然后,一个简单的浅层分类器就能识别这些更容易分类的归一化图像。实验结果表明,与目前具有代表性的深度模型相比,所提出的模型能以更低的参数数完成手写字符识别,精度相当或更高。所提出的模型剔除了手写字符图像的个人风格,并将其映射为类似于正常模板图像的模式。这大大降低了分类难度,使分类器只能对已知的标准字符图像进行分类。
{"title":"Normal Template Mapping: An Association-Inspired Handwritten Character Recognition Model","authors":"Jun Miao, Peng Liu, Chen Chen, Yuanhua Qiao","doi":"10.1007/s12559-024-10270-8","DOIUrl":"https://doi.org/10.1007/s12559-024-10270-8","url":null,"abstract":"<p>In identifying objects, people usually associate memory templates to guide visual attention and determine the category of an object. The initial character images that children learn are usually normal patterns. However, the variation in corresponding handwritten patterns is quite large. To learn these deformed images with large variance, current deep models must involve millions of parameters for such kind of classification tasks that seem much easier and simpler to children who learn to recognize new characters associated with their initially taught normal patterns. From the perspective of humans’ perception, when people see a new object, they first think of a template image in their memory, which is similar to the object. This mapping process makes it easier for humans to learn new objects. Inspired by this cognitive association mechanism, this study developed a cognition-inspired handwritten character recognition model using a proposed normal template mapping neural network. This model uses an encoder-decoder architecture to build a normal template mapping neural network that transforms handwritten character images of one class to normalized characters similar to a given printed template character image representing that class. Then, a simple shallow classifier recognizes these normalized images, which are easier to classify. The experimental results show that the proposed model completes handwritten character recognition with comparable or higher precision at a much lower parameter count than current representative deep models. The proposed model removes the individual styles of handwritten character images and maps them to patterns similar to normal template images. This greatly reduces the classification difficulty and enables the classifier to classify only known standard character images.</p>","PeriodicalId":51243,"journal":{"name":"Cognitive Computation","volume":null,"pages":null},"PeriodicalIF":5.4,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140127066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Cognitive Computation
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1