首页 > 最新文献

International Journal of Intelligent Systems最新文献

英文 中文
Medical Image Description Based on Multimodal Auxiliary Signals and Transformer 基于多模态辅助信号和变压器的医学图像描述
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-13 DOI: 10.1155/2024/6680546
Yun Tan, Chunzhi Li, Jiaohua Qin, Youyuan Xue, Xuyu Xiang
Medical image description can be applied to clinical medical diagnosis, but the field still faces serious challenges. There is a serious problem of visual and textual data bias in medical datasets, which are the imbalanced distribution of health and disease data. This can greatly affect the learning performance of data-driven neural networks and finally lead to errors in the generated medical image descriptions. To address this problem, we propose a new medical image description network architecture named multimodal data-assisted knowledge fusion network (MDAKF), which introduces multimodal auxiliary signals to guide the Transformer network to generate more accurate medical reports. In detail, audio auxiliary signals provide clear abnormal visual regions to alleviate the visual data bias problem. However, the audio modality signals with similar pronunciation lack recognizability, which may lead to incorrect mapping of audio labels to medical image regions. Therefore, we further fuse the audio with text features as the auxiliary signal to improve the overall performance of the model. Through the experiments on two medical image description datasets, IU-X-ray and COV-CTR, it is found that the proposed model is superior to the previous models in terms of language generation evaluation indicators.
医学图像描述可应用于临床医学诊断,但该领域仍面临严峻挑战。医学数据集中存在严重的视觉和文本数据偏差问题,即健康和疾病数据分布不平衡。这会极大地影响数据驱动神经网络的学习性能,最终导致生成的医学图像描述出现错误。为解决这一问题,我们提出了一种新的医学图像描述网络架构,命名为多模态数据辅助知识融合网络(MDAKF),它引入了多模态辅助信号来引导变压器网络生成更准确的医疗报告。具体来说,音频辅助信号可提供清晰的异常视觉区域,以缓解视觉数据偏差问题。然而,发音相似的音频模态信号缺乏可识别性,可能导致音频标签与医学图像区域的映射不正确。因此,我们进一步将音频与文本特征作为辅助信号进行融合,以提高模型的整体性能。通过在 IU-X-ray 和 COV-CTR 两个医学图像描述数据集上的实验发现,所提出的模型在语言生成评价指标方面优于之前的模型。
{"title":"Medical Image Description Based on Multimodal Auxiliary Signals and Transformer","authors":"Yun Tan, Chunzhi Li, Jiaohua Qin, Youyuan Xue, Xuyu Xiang","doi":"10.1155/2024/6680546","DOIUrl":"https://doi.org/10.1155/2024/6680546","url":null,"abstract":"Medical image description can be applied to clinical medical diagnosis, but the field still faces serious challenges. There is a serious problem of visual and textual data bias in medical datasets, which are the imbalanced distribution of health and disease data. This can greatly affect the learning performance of data-driven neural networks and finally lead to errors in the generated medical image descriptions. To address this problem, we propose a new medical image description network architecture named multimodal data-assisted knowledge fusion network (MDAKF), which introduces multimodal auxiliary signals to guide the Transformer network to generate more accurate medical reports. In detail, audio auxiliary signals provide clear abnormal visual regions to alleviate the visual data bias problem. However, the audio modality signals with similar pronunciation lack recognizability, which may lead to incorrect mapping of audio labels to medical image regions. Therefore, we further fuse the audio with text features as the auxiliary signal to improve the overall performance of the model. Through the experiments on two medical image description datasets, IU-X-ray and COV-CTR, it is found that the proposed model is superior to the previous models in terms of language generation evaluation indicators.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139839632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Medical Image Description Based on Multimodal Auxiliary Signals and Transformer 基于多模态辅助信号和变压器的医学图像描述
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-13 DOI: 10.1155/2024/6680546
Yun Tan, Chunzhi Li, Jiaohua Qin, Youyuan Xue, Xuyu Xiang

Medical image description can be applied to clinical medical diagnosis, but the field still faces serious challenges. There is a serious problem of visual and textual data bias in medical datasets, which are the imbalanced distribution of health and disease data. This can greatly affect the learning performance of data-driven neural networks and finally lead to errors in the generated medical image descriptions. To address this problem, we propose a new medical image description network architecture named multimodal data-assisted knowledge fusion network (MDAKF), which introduces multimodal auxiliary signals to guide the Transformer network to generate more accurate medical reports. In detail, audio auxiliary signals provide clear abnormal visual regions to alleviate the visual data bias problem. However, the audio modality signals with similar pronunciation lack recognizability, which may lead to incorrect mapping of audio labels to medical image regions. Therefore, we further fuse the audio with text features as the auxiliary signal to improve the overall performance of the model. Through the experiments on two medical image description datasets, IU-X-ray and COV-CTR, it is found that the proposed model is superior to the previous models in terms of language generation evaluation indicators.

医学图像描述可应用于临床医学诊断,但该领域仍面临严峻挑战。医学数据集中存在严重的视觉和文本数据偏差问题,即健康和疾病数据分布不平衡。这会极大地影响数据驱动神经网络的学习性能,最终导致生成的医学图像描述出现错误。为解决这一问题,我们提出了一种新的医学图像描述网络架构,命名为多模态数据辅助知识融合网络(MDAKF),它引入了多模态辅助信号来引导变压器网络生成更准确的医疗报告。具体来说,音频辅助信号可提供清晰的异常视觉区域,以缓解视觉数据偏差问题。然而,发音相似的音频模态信号缺乏可识别性,可能导致音频标签与医学图像区域的映射不正确。因此,我们进一步将音频与文本特征作为辅助信号进行融合,以提高模型的整体性能。通过在 IU-X-ray 和 COV-CTR 两个医学图像描述数据集上的实验发现,所提出的模型在语言生成评价指标方面优于之前的模型。
{"title":"Medical Image Description Based on Multimodal Auxiliary Signals and Transformer","authors":"Yun Tan,&nbsp;Chunzhi Li,&nbsp;Jiaohua Qin,&nbsp;Youyuan Xue,&nbsp;Xuyu Xiang","doi":"10.1155/2024/6680546","DOIUrl":"10.1155/2024/6680546","url":null,"abstract":"<p>Medical image description can be applied to clinical medical diagnosis, but the field still faces serious challenges. There is a serious problem of visual and textual data bias in medical datasets, which are the imbalanced distribution of health and disease data. This can greatly affect the learning performance of data-driven neural networks and finally lead to errors in the generated medical image descriptions. To address this problem, we propose a new medical image description network architecture named multimodal data-assisted knowledge fusion network (MDAKF), which introduces multimodal auxiliary signals to guide the Transformer network to generate more accurate medical reports. In detail, audio auxiliary signals provide clear abnormal visual regions to alleviate the visual data bias problem. However, the audio modality signals with similar pronunciation lack recognizability, which may lead to incorrect mapping of audio labels to medical image regions. Therefore, we further fuse the audio with text features as the auxiliary signal to improve the overall performance of the model. Through the experiments on two medical image description datasets, IU-X-ray and COV-CTR, it is found that the proposed model is superior to the previous models in terms of language generation evaluation indicators.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139779671","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of Pilots’ Cognitive Competency Using Situation Awareness Recognition Model Based on Visual Characteristics 利用基于视觉特征的态势感知识别模型评估飞行员的认知能力
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-12 DOI: 10.1155/2024/5582660
Shaoqi Jiang, Ruifang Su, Zhenzhen Ren, Weijiong Chen, Yutao Kang

Visual characteristics have the potential to assess the navigational proficiency of ship pilots. A precise assessment of ship piloting competence is imperative to mitigate human errors in piloting. An exhaustive examination of cognitive capabilities plays a pivotal role in developing an enhanced and refined system for classifying, selecting, and training ship piloting proficiency. Insufficiency in situation awareness (SA), denoting the cognitive underpinning of hazardous behaviors among pilots, may lead to subpar performance in ship pilotage when faced with adverse conditions. To address this issue, we propose an SA recognition model based on the random forest-support vector machine (RF-SVM) algorithm, which utilizes wearable eye-tracking technology to detect pilots’ at-risk cognitive state, specifically low-SA levels. We rectify the relative error (RE) and root mean square error (RMSE) and employ principal component analysis (PCA) to enhance the RF algorithm, optimizing the combination of salient features in greater depth. Through the utilization of these feature combinations, we construct a SVM algorithm using the most suitable variables for SA recognition. Our proposed RF-SVM algorithm is compared to RF or SVM alone, and it achieves the highest accuracy in recognizing at-risk cognitive states under poor visibility conditions (an improvement of 86.79% to 93.43% in accuracy). Taken collectively, the present findings offer vital technical support for developing a technique-based intelligent system for adaptively evaluating the cognitive accomplishment of pilots. Furthermore, they establish the groundwork and framework for the surveillance of cognitive processes and capabilities in marine pilotage operations within China.

视觉特征具有评估船舶驾驶员导航能力的潜力。要减少人类在驾驶中的失误,就必须对船舶驾驶能力进行精确评估。对认知能力进行详尽的检查,对开发一个用于分类、选择和培训船舶驾驶员能力的强化和完善的系统起着关键作用。态势感知(SA)是引航员危险行为的认知基础,态势感知不足可能导致引航员在面临不利条件时表现不佳。针对这一问题,我们提出了一种基于随机森林支持向量机(RF-SVM)算法的 "态势感知 "识别模型,该模型利用可穿戴眼球跟踪技术来检测飞行员的危险认知状态,特别是低态势感知水平。我们修正了相对误差(RE)和均方根误差(RMSE),并采用主成分分析(PCA)增强了 RF 算法,更深入地优化了突出特征的组合。通过利用这些特征组合,我们构建了一种 SVM 算法,使用最适合 SA 识别的变量。我们提出的 RF-SVM 算法与单独的 RF 或 SVM 算法进行了比较,结果表明,在能见度较差的条件下,RF-SVM 算法识别高危认知状态的准确率最高(准确率从 86.79% 提高到 93.43%)。综合来看,本研究成果为开发基于技术的智能系统提供了重要的技术支持,该系统可用于自适应地评估飞行员的认知素养。此外,本研究还为中国海洋引航作业中认知过程和能力的监测奠定了基础,建立了框架。
{"title":"Assessment of Pilots’ Cognitive Competency Using Situation Awareness Recognition Model Based on Visual Characteristics","authors":"Shaoqi Jiang,&nbsp;Ruifang Su,&nbsp;Zhenzhen Ren,&nbsp;Weijiong Chen,&nbsp;Yutao Kang","doi":"10.1155/2024/5582660","DOIUrl":"10.1155/2024/5582660","url":null,"abstract":"<p>Visual characteristics have the potential to assess the navigational proficiency of ship pilots. A precise assessment of ship piloting competence is imperative to mitigate human errors in piloting. An exhaustive examination of cognitive capabilities plays a pivotal role in developing an enhanced and refined system for classifying, selecting, and training ship piloting proficiency. Insufficiency in situation awareness (SA), denoting the cognitive underpinning of hazardous behaviors among pilots, may lead to subpar performance in ship pilotage when faced with adverse conditions. To address this issue, we propose an SA recognition model based on the random forest-support vector machine (RF-SVM) algorithm, which utilizes wearable eye-tracking technology to detect pilots’ at-risk cognitive state, specifically low-SA levels. We rectify the relative error (RE) and root mean square error (RMSE) and employ principal component analysis (PCA) to enhance the RF algorithm, optimizing the combination of salient features in greater depth. Through the utilization of these feature combinations, we construct a SVM algorithm using the most suitable variables for SA recognition. Our proposed RF-SVM algorithm is compared to RF or SVM alone, and it achieves the highest accuracy in recognizing at-risk cognitive states under poor visibility conditions (an improvement of 86.79% to 93.43% in accuracy). Taken collectively, the present findings offer vital technical support for developing a technique-based intelligent system for adaptively evaluating the cognitive accomplishment of pilots. Furthermore, they establish the groundwork and framework for the surveillance of cognitive processes and capabilities in marine pilotage operations within China.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139844014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction of Knowledge Graph for Emergency Resources 构建应急资源知识图谱
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-12 DOI: 10.1155/2024/6668559
Heng Mu, Peng Wu, Wenyi Su
Knowledge graphs can effectively organize and represent information related to emergency resources for unforeseen sudden events. In this study, we construct a model layer for the knowledge graph of emergency resources, focused on sudden events, through the classification and analysis of unforeseen disaster measures. This study defines eight interconnected entity types, each characterised by a set of attributes and engaging in one or more relationships with other entity types. Utilizing 121 incident investigation reports from the emergency management departments of various provinces and cities over the past five years, we select five entities with the highest frequency of occurrence along with their corresponding four relationships. We then design an extraction plan for these entities and relationships. Based on the completed knowledge graph data, we formulate 14 questions related to emergency resources for sudden events and construct 19 corresponding question-and-answer templates using a template-based question-answering (QA) approach. We retrieve the corresponding Cypher statement templates through template mapping and obtain the question answers through querying. Finally, we design a knowledge graph question-and-answer system using the Django web framework, which includes entity queries and knowledge QA functions, specifically for emergency resources related to sudden events.
知识图谱可以有效地组织和表示与意外突发事件应急资源相关的信息。在本研究中,我们通过对不可预见灾害措施的分类和分析,构建了一个以突发事件为重点的应急资源知识图谱模型层。本研究定义了八种相互关联的实体类型,每种实体类型都有一组属性,并与其他实体类型存在一种或多种关系。利用各省市应急管理部门过去五年的 121 份事件调查报告,我们选择了出现频率最高的五个实体及其相应的四种关系。然后,我们为这些实体和关系设计了一个提取计划。根据已完成的知识图谱数据,我们提出了 14 个与突发事件应急资源相关的问题,并使用基于模板的问答(QA)方法构建了 19 个相应的问答模板。我们通过模板映射检索相应的 Cypher 语句模板,并通过查询获得问题答案。最后,我们利用 Django 网络框架设计了一个知识图谱问答系统,其中包括实体查询和知识 QA 功能,专门用于与突发事件相关的应急资源。
{"title":"Construction of Knowledge Graph for Emergency Resources","authors":"Heng Mu, Peng Wu, Wenyi Su","doi":"10.1155/2024/6668559","DOIUrl":"https://doi.org/10.1155/2024/6668559","url":null,"abstract":"Knowledge graphs can effectively organize and represent information related to emergency resources for unforeseen sudden events. In this study, we construct a model layer for the knowledge graph of emergency resources, focused on sudden events, through the classification and analysis of unforeseen disaster measures. This study defines eight interconnected entity types, each characterised by a set of attributes and engaging in one or more relationships with other entity types. Utilizing 121 incident investigation reports from the emergency management departments of various provinces and cities over the past five years, we select five entities with the highest frequency of occurrence along with their corresponding four relationships. We then design an extraction plan for these entities and relationships. Based on the completed knowledge graph data, we formulate 14 questions related to emergency resources for sudden events and construct 19 corresponding question-and-answer templates using a template-based question-answering (QA) approach. We retrieve the corresponding Cypher statement templates through template mapping and obtain the question answers through querying. Finally, we design a knowledge graph question-and-answer system using the Django web framework, which includes entity queries and knowledge QA functions, specifically for emergency resources related to sudden events.","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139783985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Construction of Knowledge Graph for Emergency Resources 构建应急资源知识图谱
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-02-12 DOI: 10.1155/2024/6668559
Heng Mu, Peng Wu, Wenyi Su

Knowledge graphs can effectively organize and represent information related to emergency resources for unforeseen sudden events. In this study, we construct a model layer for the knowledge graph of emergency resources, focused on sudden events, through the classification and analysis of unforeseen disaster measures. This study defines eight interconnected entity types, each characterised by a set of attributes and engaging in one or more relationships with other entity types. Utilizing 121 incident investigation reports from the emergency management departments of various provinces and cities over the past five years, we select five entities with the highest frequency of occurrence along with their corresponding four relationships. We then design an extraction plan for these entities and relationships. Based on the completed knowledge graph data, we formulate 14 questions related to emergency resources for sudden events and construct 19 corresponding question-and-answer templates using a template-based question-answering (QA) approach. We retrieve the corresponding Cypher statement templates through template mapping and obtain the question answers through querying. Finally, we design a knowledge graph question-and-answer system using the Django web framework, which includes entity queries and knowledge QA functions, specifically for emergency resources related to sudden events.

知识图谱可以有效地组织和表示与意外突发事件应急资源相关的信息。在本研究中,我们通过对不可预见灾害措施的分类和分析,构建了一个以突发事件为重点的应急资源知识图谱模型层。本研究定义了八种相互关联的实体类型,每种实体类型都有一组属性,并与其他实体类型存在一种或多种关系。利用各省市应急管理部门过去五年的 121 份事件调查报告,我们选择了出现频率最高的五个实体及其相应的四种关系。然后,我们为这些实体和关系设计了一个提取计划。根据已完成的知识图谱数据,我们提出了 14 个与突发事件应急资源相关的问题,并使用基于模板的问答(QA)方法构建了 19 个相应的问答模板。我们通过模板映射检索相应的 Cypher 语句模板,并通过查询获得问题答案。最后,我们利用 Django 网络框架设计了一个知识图谱问答系统,其中包括实体查询和知识 QA 功能,专门用于与突发事件相关的应急资源。
{"title":"Construction of Knowledge Graph for Emergency Resources","authors":"Heng Mu,&nbsp;Peng Wu,&nbsp;Wenyi Su","doi":"10.1155/2024/6668559","DOIUrl":"10.1155/2024/6668559","url":null,"abstract":"<p>Knowledge graphs can effectively organize and represent information related to emergency resources for unforeseen sudden events. In this study, we construct a model layer for the knowledge graph of emergency resources, focused on sudden events, through the classification and analysis of unforeseen disaster measures. This study defines eight interconnected entity types, each characterised by a set of attributes and engaging in one or more relationships with other entity types. Utilizing 121 incident investigation reports from the emergency management departments of various provinces and cities over the past five years, we select five entities with the highest frequency of occurrence along with their corresponding four relationships. We then design an extraction plan for these entities and relationships. Based on the completed knowledge graph data, we formulate 14 questions related to emergency resources for sudden events and construct 19 corresponding question-and-answer templates using a template-based question-answering (QA) approach. We retrieve the corresponding Cypher statement templates through template mapping and obtain the question answers through querying. Finally, we design a knowledge graph question-and-answer system using the Django web framework, which includes entity queries and knowledge QA functions, specifically for emergency resources related to sudden events.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139844054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted Subspace Fuzzy Clustering with Adaptive Projection 带自适应投影的加权子空间模糊聚类
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-01-31 DOI: 10.1155/2024/6696775
Jie Zhou, Chucheng Huang, Can Gao, Yangbo Wang, Xinrui Shen, Xu Wu

Available subspace clustering methods often contain two stages, finding low-dimensional subspaces of data and then conducting clustering in the subspaces. Therefore, how to find the subspaces that better represent the original data becomes a research challenge. However, most of the reported methods are based on the premise that the contributions of different features are equal, which may not be ideal for real scenarios, i.e., the contributions of the important features may be overwhelmed by a large amount of redundant features. In this study, a weighted subspace fuzzy clustering (WSFC) model with a locality preservation mechanism is presented, which can adaptively capture the importance of different features, achieve an optimal lower-dimensional subspace, and perform fuzzy clustering simultaneously. Since each feature can be well quantified in terms of its importance, the proposed model exhibits the sparsity and robustness of fuzzy clustering. The intrinsic geometrical structures of data can also be preserved while enhancing the interpretability of clustering tasks. Extensive experimental results show that WSFC can allocate appropriate weights to different features according to data distributions and clustering tasks and achieve superior performance compared to other clustering models on real-world datasets.

现有的子空间聚类方法通常包含两个阶段,即寻找数据的低维子空间,然后在子空间中进行聚类。因此,如何找到更能代表原始数据的子空间就成了一个研究难题。然而,大多数已报道的方法都建立在不同特征贡献相等的前提下,这对于实际场景来说可能并不理想,即重要特征的贡献可能会被大量冗余特征所淹没。本研究提出了一种具有局部性保持机制的加权子空间模糊聚类(WSFC)模型,它可以自适应地捕捉不同特征的重要性,实现最优的低维子空间,并同时进行模糊聚类。由于每个特征的重要性都可以很好地量化,因此所提出的模型表现出了模糊聚类的稀疏性和鲁棒性。在增强聚类任务可解释性的同时,还能保留数据的内在几何结构。广泛的实验结果表明,WSFC 可以根据数据分布和聚类任务为不同特征分配适当的权重,并在实际数据集上取得优于其他聚类模型的性能。
{"title":"Weighted Subspace Fuzzy Clustering with Adaptive Projection","authors":"Jie Zhou,&nbsp;Chucheng Huang,&nbsp;Can Gao,&nbsp;Yangbo Wang,&nbsp;Xinrui Shen,&nbsp;Xu Wu","doi":"10.1155/2024/6696775","DOIUrl":"https://doi.org/10.1155/2024/6696775","url":null,"abstract":"<p>Available subspace clustering methods often contain two stages, finding low-dimensional subspaces of data and then conducting clustering in the subspaces. Therefore, how to find the subspaces that better represent the original data becomes a research challenge. However, most of the reported methods are based on the premise that the contributions of different features are equal, which may not be ideal for real scenarios, i.e., the contributions of the important features may be overwhelmed by a large amount of redundant features. In this study, a weighted subspace fuzzy clustering (WSFC) model with a locality preservation mechanism is presented, which can adaptively capture the importance of different features, achieve an optimal lower-dimensional subspace, and perform fuzzy clustering simultaneously. Since each feature can be well quantified in terms of its importance, the proposed model exhibits the sparsity and robustness of fuzzy clustering. The intrinsic geometrical structures of data can also be preserved while enhancing the interpretability of clustering tasks. Extensive experimental results show that WSFC can allocate appropriate weights to different features according to data distributions and clustering tasks and achieve superior performance compared to other clustering models on real-world datasets.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141165031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Aquila Optimizer-Based Hybrid Predictive Model for Traffic Congestion in an IoT-Enabled Smart City 基于 Aquila 优化器的物联网智能城市交通拥堵混合预测模型
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-01-31 DOI: 10.1155/2024/5577278
Ayushi Chahal, Preeti Gulia, Nasib Singh Gill, Nishat Sultana

Effective traffic congestion prediction is need of the hour in a modern smart city to save time and improve the quality of life for citizens. In this study, AB_AO (ARIMA Bi-LSTM using Aquila optimizer), a hybrid predictive model, is proposed using the most effective time-series data prediction statistical model ARIMA (Autoregressive Integrated Moving Average) and sequential predictive Deep Learning (DL) technique LSTM (Long Short-Term Memory) which helps in traffic congestion prediction with a minimum error rate. Also, the Aquila optimizer (AO) is used to elevate the adequacy of the AB_AO model. Three road traffic datasets of different cities from the “CityPulse EU FP7 project” are used to implement the proposed hybrid model. In a time-series dataset, two components need to be handled with care, i.e., linear and nonlinear. In this study, the ARIMA model has been used to manage linear components and Bi-LSTM is used to handle nonlinear components of the time-series dataset. The Aquila Optimizer (AO) is used for hyperparametric tuning to enhance the performance of Bi-LSTM. Error measurement parameters like the Mean Absolute Error (MAE), Mean Squared Error (MSE), and Mean Absolute Percentage Error (MAPE) are used to validate the results. A detailed mathematical and empirical analysis is given to justify the performance of the AB_AO model using an ablation study and comparative analysis. The AB_AO model acquires more stable and precise results with MSE as 18.78, MAE as 3.18, and MAPE as 0.21 than other models. It may further help to predict the vehicle count on the road, which may be of great help in reducing wastage of time in traffic congestion.

现代智能城市需要有效的交通拥堵预测,以节省时间并提高市民的生活质量。本研究提出了一种混合预测模型 AB_AO(ARIMA Bi-LSTM using Aquila optimizer),它采用了最有效的时间序列数据预测统计模型 ARIMA(自回归整合移动平均)和序列预测深度学习(DL)技术 LSTM(长短期记忆),有助于以最小的误差率进行交通拥堵预测。此外,Aquila 优化器(AO)也用于提高 AB_AO 模型的适当性。为实现所提出的混合模型,我们使用了 "CityPulse EU FP7 项目 "中不同城市的三个道路交通数据集。在时间序列数据集中,需要谨慎处理两个组成部分,即线性和非线性。在本研究中,ARIMA 模型用于管理线性成分,Bi-LSTM 用于处理时间序列数据集的非线性成分。Aquila 优化器 (AO) 用于超参数调整,以提高 Bi-LSTM 的性能。平均绝对误差 (MAE)、平均平方误差 (MSE) 和平均绝对百分比误差 (MAPE) 等误差测量参数用于验证结果。详细的数学和实证分析证明了 AB_AO 模型的性能,并使用了消融研究和比较分析。与其他模型相比,AB_AO 模型获得了更加稳定和精确的结果,MSE 为 18.78,MAE 为 3.18,MAPE 为 0.21。它可以进一步帮助预测道路上的车辆数量,对减少交通拥堵中的时间浪费有很大帮助。
{"title":"Aquila Optimizer-Based Hybrid Predictive Model for Traffic Congestion in an IoT-Enabled Smart City","authors":"Ayushi Chahal,&nbsp;Preeti Gulia,&nbsp;Nasib Singh Gill,&nbsp;Nishat Sultana","doi":"10.1155/2024/5577278","DOIUrl":"https://doi.org/10.1155/2024/5577278","url":null,"abstract":"<p>Effective traffic congestion prediction is need of the hour in a modern smart city to save time and improve the quality of life for citizens. In this study, AB_AO (ARIMA Bi-LSTM using Aquila optimizer), a hybrid predictive model, is proposed using the most effective time-series data prediction statistical model ARIMA (Autoregressive Integrated Moving Average) and sequential predictive Deep Learning (DL) technique LSTM (Long Short-Term Memory) which helps in traffic congestion prediction with a minimum error rate. Also, the Aquila optimizer (AO) is used to elevate the adequacy of the AB_AO model. Three road traffic datasets of different cities from the “CityPulse EU FP7 project” are used to implement the proposed hybrid model. In a time-series dataset, two components need to be handled with care, i.e., linear and nonlinear. In this study, the ARIMA model has been used to manage linear components and Bi-LSTM is used to handle nonlinear components of the time-series dataset. The Aquila Optimizer (AO) is used for hyperparametric tuning to enhance the performance of Bi-LSTM. Error measurement parameters like the Mean Absolute Error (MAE), Mean Squared Error (MSE), and Mean Absolute Percentage Error (MAPE) are used to validate the results. A detailed mathematical and empirical analysis is given to justify the performance of the AB_AO model using an ablation study and comparative analysis. The AB_AO model acquires more stable and precise results with MSE as 18.78, MAE as 3.18, and MAPE as 0.21 than other models. It may further help to predict the vehicle count on the road, which may be of great help in reducing wastage of time in traffic congestion.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141165030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Greedy-Mine: A Profitable Mining Attack Strategy in Bitcoin-NG 贪婪挖矿比特币-NG 中有利可图的挖矿攻击策略
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1155/2024/9998126
Junjie Hu, Zhe Jiang, Chunxiang Xu

Bitcoin-NG is an extensible blockchain protocol based on the same trust model as Bitcoin. It divides each epoch into one keyblock and multiple microblocks, effectively improving the transaction processing capacity. Bitcoin-NG adopts a special incentive mechanism (i.e., the transaction fees in each epoch are split to the current and next leader) to maintain its security. However, there are some limitations to the existing incentive analysis of Bitcoin-NG in recent works. First, the incentive division method of Bitcoin-NG only includes some specific mining attack strategies of the adversary, while ignoring more stubborn attack strategies. Second, once adversaries find a whale transaction, they will deviate from the honest mining strategies to obtain an extra reward. In this paper, we are committed to solving these two limitations. First, we propose a novel mining strategy named Greedy-Mine attack. Then, we formulate a Markov reward process (MRP) model to analyze the competition of honest miners and adversaries. Furthermore, we analyze the extra reward of adversaries and summarize the mining power proportion required for malicious adversaries to launch Greedy-Mine to obtain extra returns. Meanwhile, we make a backward-compatibility progressive modification to Bitcoin-NG protocol that would raise the threshold of propagation factor from 0 to 1. Finally, we get the winning condition of adversaries when adopting Greedy-Mine, compared with honest mining. Simulation and experimental results indicate that Bitcoin-NG is not incentive compatible, which is vulnerable to Greedy-Mine attack.

比特币-NG 是一种可扩展的区块链协议,与比特币基于相同的信任模型。它将每个纪元分为一个关键区块和多个微区块,有效提高了交易处理能力。Bitcoin-NG 采用一种特殊的激励机制(即每个纪元的交易费用分给当前和下一个领导者)来维护其安全性。然而,近年来对 Bitcoin-NG 的现有激励分析存在一些局限性。首先,Bitcoin-NG 的激励划分方法只包括了对手的一些特定挖矿攻击策略,而忽略了更顽固的攻击策略。其次,对手一旦发现鲸鱼交易,就会偏离诚实的挖矿策略以获取额外奖励。在本文中,我们致力于解决这两个局限性。首先,我们提出了一种名为 "贪婪挖矿"(Greedy-Mine)攻击的新型挖矿策略。然后,我们建立了一个马尔可夫奖励过程(MRP)模型来分析诚实矿工和对手的竞争。此外,我们还分析了对手的额外奖励,并总结了恶意对手发动 Greedy-Mine 获得额外回报所需的矿力比例。同时,我们对 Bitcoin-NG 协议进行了向后兼容的渐进式修改,将传播因子的阈值从 0 提高到 1。最后,与诚实挖矿相比,我们得到了对手采用贪婪挖矿的获胜条件。模拟和实验结果表明,Bitcoin-NG 不兼容激励机制,容易受到 Greedy-Mine 攻击。
{"title":"Greedy-Mine: A Profitable Mining Attack Strategy in Bitcoin-NG","authors":"Junjie Hu,&nbsp;Zhe Jiang,&nbsp;Chunxiang Xu","doi":"10.1155/2024/9998126","DOIUrl":"https://doi.org/10.1155/2024/9998126","url":null,"abstract":"<p>Bitcoin-NG is an extensible blockchain protocol based on the same trust model as Bitcoin. It divides each epoch into one keyblock and multiple microblocks, effectively improving the transaction processing capacity. Bitcoin-NG adopts a special incentive mechanism (i.e., the transaction fees in each epoch are split to the current and next leader) to maintain its security. However, there are some limitations to the existing incentive analysis of Bitcoin-NG in recent works. First, the incentive division method of Bitcoin-NG only includes some specific mining attack strategies of the adversary, while ignoring more stubborn attack strategies. Second, once adversaries find a whale transaction, they will deviate from the honest mining strategies to obtain an extra reward. In this paper, we are committed to solving these two limitations. First, we propose a novel mining strategy named Greedy-Mine attack. Then, we formulate a Markov reward process (MRP) model to analyze the competition of honest miners and adversaries. Furthermore, we analyze the extra reward of adversaries and summarize the mining power proportion required for malicious adversaries to launch Greedy-Mine to obtain extra returns. Meanwhile, we make a backward-compatibility progressive modification to Bitcoin-NG protocol that would raise the threshold of propagation factor from 0 to 1. Finally, we get the winning condition of adversaries when adopting Greedy-Mine, compared with honest mining. Simulation and experimental results indicate that Bitcoin-NG is not incentive compatible, which is vulnerable to Greedy-Mine attack.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141164933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-Driven Digital Twin Model for Reliable Lithium-Ion Battery Discharge Capacity Predictions 用于可靠预测锂离子电池放电容量的人工智能驱动数字双胞胎模型
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1155/2024/8185044
Pranav Nair, Vinay Vakharia, Milind Shah, Yogesh Kumar, Marcin Woźniak, Jana Shafi, Muhammad Fazal Ijaz

The present study proposes a novel method for predicting the discharge capabilities of lithium-ion (Li-ion) batteries using a digital twin model in practice. By combining cutting-edge machine learning techniques, such as AdaBoost and long short-term memory (LSTM) network, with a semiempirical mathematical structure, the digital twin (DT)—a virtual representation that mimics the behavior of actual batteries in real time is constructed. Various metaheuristic optimization methods, such as antlion, grey wolf optimization (GWO), and improved grey wolf optimization (IGWO), are used to adjust hyperparameters in order to optimize the models. As indicators of performance, mean absolute error (MAE) and root-mean-square error (RMSE) are applied to the models after they have undergone extensive training and ten-fold cross-validation. The models are rigorously trained and cross-validated using the NASA battery aging dataset, a widely accepted benchmark dataset for battery research. The IGWO-AdaBoost digital twin model emerges as the standout performer, achieving exceptional accuracy in predicting the discharge capacity. This model demonstrates the lowest mean absolute error (MAE) of 0.01, showcasing its superior precision in estimating discharge capabilities. Additionally, the root mean square error (RMSE) for the IGWO-AdaBoost DT model is also the lowest at 0.01. The findings of this study offer insightful information about the potential utilization of the digital twin model to accurately predict the discharge capacity of batteries.

本研究提出了一种在实践中使用数字孪生模型预测锂离子(Li-ion)电池放电能力的新方法。通过将 AdaBoost 和长短期记忆(LSTM)网络等尖端机器学习技术与半经验数学结构相结合,构建了数字孪生(DT)--一种模仿实际电池实时行为的虚拟表示。为了优化模型,使用了各种元启发式优化方法,如蚂蚁、灰狼优化(GWO)和改进的灰狼优化(IGWO)来调整超参数。作为性能指标,平均绝对误差(MAE)和均方根误差(RMSE)被应用于经过大量训练和十倍交叉验证的模型。这些模型使用 NASA 电池老化数据集进行了严格的训练和交叉验证,该数据集是电池研究领域公认的基准数据集。IGWO-AdaBoost 数字孪生模型表现突出,在预测放电容量方面达到了极高的准确度。该模型的平均绝对误差(MAE)最低,仅为 0.01,显示了其在估计放电能力方面的卓越精度。此外,IGWO-AdaBoost DT 模型的均方根误差 (RMSE) 也最小,仅为 0.01。这项研究的结果为利用数字孪生模型准确预测电池放电能力提供了深刻的信息。
{"title":"AI-Driven Digital Twin Model for Reliable Lithium-Ion Battery Discharge Capacity Predictions","authors":"Pranav Nair,&nbsp;Vinay Vakharia,&nbsp;Milind Shah,&nbsp;Yogesh Kumar,&nbsp;Marcin Woźniak,&nbsp;Jana Shafi,&nbsp;Muhammad Fazal Ijaz","doi":"10.1155/2024/8185044","DOIUrl":"10.1155/2024/8185044","url":null,"abstract":"<p>The present study proposes a novel method for predicting the discharge capabilities of lithium-ion (Li-ion) batteries using a digital twin model in practice. By combining cutting-edge machine learning techniques, such as AdaBoost and long short-term memory (LSTM) network, with a semiempirical mathematical structure, the digital twin (DT)—a virtual representation that mimics the behavior of actual batteries in real time is constructed. Various metaheuristic optimization methods, such as antlion, grey wolf optimization (GWO), and improved grey wolf optimization (IGWO), are used to adjust hyperparameters in order to optimize the models. As indicators of performance, mean absolute error (MAE) and root-mean-square error (RMSE) are applied to the models after they have undergone extensive training and ten-fold cross-validation. The models are rigorously trained and cross-validated using the NASA battery aging dataset, a widely accepted benchmark dataset for battery research. The IGWO-AdaBoost digital twin model emerges as the standout performer, achieving exceptional accuracy in predicting the discharge capacity. This model demonstrates the lowest mean absolute error (MAE) of 0.01, showcasing its superior precision in estimating discharge capabilities. Additionally, the root mean square error (RMSE) for the IGWO-AdaBoost DT model is also the lowest at 0.01. The findings of this study offer insightful information about the potential utilization of the digital twin model to accurately predict the discharge capacity of batteries.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140486992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FPT-Former: A Flexible Parallel Transformer of Recognizing Depression by Using Audiovisual Expert-Knowledge-Based Multimodal Measures FPT-Former:利用基于视听专家知识的多模态测量方法识别抑郁症的灵活并行转换器
IF 7 2区 计算机科学 Q1 Mathematics Pub Date : 2024-01-29 DOI: 10.1155/2024/1564574
Yifu Li, Xueping Yang, Meng Zhao, Zihao Wang, Yudong Yao, Wei Qian, Shouliang Qi

Background and Objective. Currently, depression is a widespread global issue that imposes a significant burden and disability on individuals, families, and society. Deep learning (DL) has emerged as a valuable approach for automatically detecting depression by extracting cues from audiovisual data and making a diagnosis. PHQ-8 is considered a validated diagnostic tool for depressive disorders in clinical studies, and the objective of this experiment is to improve the accuracy of PHQ-8 prediction. Furthermore, this paper aims to demonstrate the effectiveness of expert knowledge in depression diagnosis and discuss a novel multimodal network architecture. Methods. This research paper focuses on multimodal depression analysis, proposing a flexible parallel transformer (FPT) model capable of extracting data from three distinct modalities (i.e., one video and two audio descriptors). The FPT-Former model incorporates three paths, each using expert-knowledge-based descriptors from one modality as inputs. These descriptors are represented into 32 features by the encoder part of a transformer module, and these features are fused to realize the final regression of PHQ-8 score. The extended distress analysis interview corpus (E-DAIC) is an expansion of WOZ-DAIC which comprises semiclinical interviews intended to assist in the diagnosis of psychological distress conditions. It encompasses a sample size of 275 participants, and in this study, it was utilized to test the model in a way of 10-fold cross-validation. Results. The FPT presented herein achieved comparable performance to the state-of-the-art works, with a root mean square error (RMSE) of 4.80 and a mean absolute error (MAE) of 4.58. The ablation experiments demonstrate that the three-modality-fused model outperforms other two-modality-fused and single-modality models. While using a PHQ-8 score threshold of 10, the accuracy of the depression classification is 0.79. Conclusions. Leveraging the strength of expert-knowledge-based multimodal measures and parallel transformer structure, the FPT model exhibits promising performance in depression detection. This model improved the accuracy of depression diagnosis through audio and video, and it also proved the effectiveness of using expert-knowledge in the diagnosis of depression. The traits of flexible structure, high predictive efficiency, and secure privacy protection make our model a promotable intelligent system in mental healthcare.

背景和目的。目前,抑郁症是一个普遍的全球性问题,给个人、家庭和社会带来了巨大的负担和残疾。通过从视听数据中提取线索并做出诊断,深度学习(DL)已成为自动检测抑郁症的一种有价值的方法。在临床研究中,PHQ-8 被认为是一种有效的抑郁障碍诊断工具,本实验的目的是提高 PHQ-8 预测的准确性。此外,本文还旨在证明专家知识在抑郁症诊断中的有效性,并讨论一种新颖的多模态网络架构。研究方法本文的研究重点是多模态抑郁分析,提出了一种能从三种不同模态(即一个视频和两个音频描述符)中提取数据的灵活并行变换器(FPT)模型。FPT-Former 模型包含三条路径,每条路径都使用一种模式中基于专家知识的描述符作为输入。转换器模块的编码器部分将这些描述符表示成 32 个特征,然后将这些特征融合起来,实现 PHQ-8 分数的最终回归。扩展的心理困扰分析访谈语料库(E-DAIC)是 WOZ-DAIC 的扩展,由半临床访谈组成,旨在帮助诊断心理困扰状况。它包含 275 个参与者的样本量,在本研究中,它被用来以 10 倍交叉验证的方式测试模型。结果。本文介绍的 FPT 与最先进的作品性能相当,均方根误差 (RMSE) 为 4.80,平均绝对误差 (MAE) 为 4.58。消融实验表明,三模态融合模型优于其他双模态融合模型和单模态模型。当 PHQ-8 评分阈值为 10 时,抑郁分类的准确率为 0.79。结论利用基于专家知识的多模态测量和并行变换器结构的优势,FPT 模型在抑郁症检测方面表现出了良好的性能。该模型通过音频和视频提高了抑郁症诊断的准确性,同时也证明了利用专家知识诊断抑郁症的有效性。灵活的结构、较高的预测效率和安全的隐私保护使我们的模型成为精神医疗领域可推广的智能系统。
{"title":"FPT-Former: A Flexible Parallel Transformer of Recognizing Depression by Using Audiovisual Expert-Knowledge-Based Multimodal Measures","authors":"Yifu Li,&nbsp;Xueping Yang,&nbsp;Meng Zhao,&nbsp;Zihao Wang,&nbsp;Yudong Yao,&nbsp;Wei Qian,&nbsp;Shouliang Qi","doi":"10.1155/2024/1564574","DOIUrl":"10.1155/2024/1564574","url":null,"abstract":"<p><i>Background and Objective</i>. Currently, depression is a widespread global issue that imposes a significant burden and disability on individuals, families, and society. Deep learning (DL) has emerged as a valuable approach for automatically detecting depression by extracting cues from audiovisual data and making a diagnosis. PHQ-8 is considered a validated diagnostic tool for depressive disorders in clinical studies, and the objective of this experiment is to improve the accuracy of PHQ-8 prediction. Furthermore, this paper aims to demonstrate the effectiveness of expert knowledge in depression diagnosis and discuss a novel multimodal network architecture. <i>Methods</i>. This research paper focuses on multimodal depression analysis, proposing a flexible parallel transformer (FPT) model capable of extracting data from three distinct modalities (i.e., one video and two audio descriptors). The FPT-Former model incorporates three paths, each using expert-knowledge-based descriptors from one modality as inputs. These descriptors are represented into 32 features by the encoder part of a transformer module, and these features are fused to realize the final regression of PHQ-8 score. The extended distress analysis interview corpus (E-DAIC) is an expansion of WOZ-DAIC which comprises semiclinical interviews intended to assist in the diagnosis of psychological distress conditions. It encompasses a sample size of 275 participants, and in this study, it was utilized to test the model in a way of 10-fold cross-validation. <i>Results</i>. The FPT presented herein achieved comparable performance to the state-of-the-art works, with a root mean square error (RMSE) of 4.80 and a mean absolute error (MAE) of 4.58. The ablation experiments demonstrate that the three-modality-fused model outperforms other two-modality-fused and single-modality models. While using a PHQ-8 score threshold of 10, the accuracy of the depression classification is 0.79. <i>Conclusions</i>. Leveraging the strength of expert-knowledge-based multimodal measures and parallel transformer structure, the FPT model exhibits promising performance in depression detection. This model improved the accuracy of depression diagnosis through audio and video, and it also proved the effectiveness of using expert-knowledge in the diagnosis of depression. The traits of flexible structure, high predictive efficiency, and secure privacy protection make our model a promotable intelligent system in mental healthcare.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":null,"pages":null},"PeriodicalIF":7.0,"publicationDate":"2024-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140487911","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1