[基于深度学习的电子健康记录多模态数据融合研究进展]。

Q4 Medicine 生物医学工程学杂志 Pub Date : 2024-10-25 DOI:10.7507/1001-5515.202310011

Yong Fan, Zhengbo Zhang, Jing Wang

{"title":"[基于深度学习的电子健康记录多模态数据融合研究进展]。","authors":"Yong Fan, Zhengbo Zhang, Jing Wang","doi":"10.7507/1001-5515.202310011","DOIUrl":null,"url":null,"abstract":"Currently, the development of deep learning-based multimodal learning is advancing rapidly, and is widely used in the field of artificial intelligence-generated content, such as image-text conversion and image-text generation. Electronic health records are digital information such as numbers, charts, and texts generated by medical staff using information systems in the process of medical activities. The multimodal fusion method of electronic health records based on deep learning can assist medical staff in the medical field to comprehensively analyze a large number of medical multimodal data generated in the process of diagnosis and treatment, thereby achieving accurate diagnosis and timely intervention for patients. In this article, we firstly introduce the methods and development trends of deep learning-based multimodal data fusion. Secondly, we summarize and compare the fusion of structured electronic medical records with other medical data such as images and texts, focusing on the clinical application types, sample sizes, and the fusion methods involved in the research. Through the analysis and summary of the literature, the deep learning methods for fusion of different medical modal data are as follows: first, selecting the appropriate pre-trained model according to the data modality for feature representation and post-fusion, and secondly, fusing based on the attention mechanism. Lastly, the difficulties encountered in multimodal medical data fusion and its developmental directions, including modeling methods, evaluation and application of models, are discussed. Through this review article, we expect to provide reference information for the establishment of models that can comprehensively utilize various modal medical data.","PeriodicalId":39324,"journal":{"name":"生物医学工程学杂志","volume":"41 5","pages":"1062-1071"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527755/pdf/","citationCount":"0","resultStr":"{\"title\":\"[Research progress on electronic health records multimodal data fusion based on deep learning].\",\"authors\":\"Yong Fan, Zhengbo Zhang, Jing Wang\",\"doi\":\"10.7507/1001-5515.202310011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Currently, the development of deep learning-based multimodal learning is advancing rapidly, and is widely used in the field of artificial intelligence-generated content, such as image-text conversion and image-text generation. Electronic health records are digital information such as numbers, charts, and texts generated by medical staff using information systems in the process of medical activities. The multimodal fusion method of electronic health records based on deep learning can assist medical staff in the medical field to comprehensively analyze a large number of medical multimodal data generated in the process of diagnosis and treatment, thereby achieving accurate diagnosis and timely intervention for patients. In this article, we firstly introduce the methods and development trends of deep learning-based multimodal data fusion. Secondly, we summarize and compare the fusion of structured electronic medical records with other medical data such as images and texts, focusing on the clinical application types, sample sizes, and the fusion methods involved in the research. Through the analysis and summary of the literature, the deep learning methods for fusion of different medical modal data are as follows: first, selecting the appropriate pre-trained model according to the data modality for feature representation and post-fusion, and secondly, fusing based on the attention mechanism. Lastly, the difficulties encountered in multimodal medical data fusion and its developmental directions, including modeling methods, evaluation and application of models, are discussed. Through this review article, we expect to provide reference information for the establishment of models that can comprehensively utilize various modal medical data.\",\"PeriodicalId\":39324,\"journal\":{\"name\":\"生物医学工程学杂志\",\"volume\":\"41 5\",\"pages\":\"1062-1071\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-10-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527755/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"生物医学工程学杂志\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://doi.org/10.7507/1001-5515.202310011\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"生物医学工程学杂志","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.7507/1001-5515.202310011","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

摘要

目前，基于深度学习的多模态学习发展迅速，并广泛应用于人工智能生成内容领域，如图像-文本转换、图像-文本生成等。电子病历是医务人员在医疗活动过程中利用信息系统生成的数字、图表、文本等数字化信息。基于深度学习的电子健康档案多模态融合方法可以帮助医疗领域的医务人员对诊疗过程中产生的大量医疗多模态数据进行综合分析，从而实现对患者的准确诊断和及时干预。本文首先介绍了基于深度学习的多模态数据融合的方法和发展趋势。其次，我们对结构化电子病历与图像、文本等其他医疗数据的融合进行了总结和比较，重点介绍了研究中涉及的临床应用类型、样本量以及融合方法。通过对文献的分析和总结，不同医疗模态数据融合的深度学习方法主要有以下几种：首先，根据数据模态选择合适的预训练模型进行特征表示和后融合；其次，基于注意力机制进行融合。最后，讨论了多模态医学数据融合中遇到的困难及其发展方向，包括建模方法、模型的评估和应用。我们希望通过这篇综述文章，为建立能综合利用各种模态医疗数据的模型提供参考信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

[Research progress on electronic health records multimodal data fusion based on deep learning].

Currently, the development of deep learning-based multimodal learning is advancing rapidly, and is widely used in the field of artificial intelligence-generated content, such as image-text conversion and image-text generation. Electronic health records are digital information such as numbers, charts, and texts generated by medical staff using information systems in the process of medical activities. The multimodal fusion method of electronic health records based on deep learning can assist medical staff in the medical field to comprehensively analyze a large number of medical multimodal data generated in the process of diagnosis and treatment, thereby achieving accurate diagnosis and timely intervention for patients. In this article, we firstly introduce the methods and development trends of deep learning-based multimodal data fusion. Secondly, we summarize and compare the fusion of structured electronic medical records with other medical data such as images and texts, focusing on the clinical application types, sample sizes, and the fusion methods involved in the research. Through the analysis and summary of the literature, the deep learning methods for fusion of different medical modal data are as follows: first, selecting the appropriate pre-trained model according to the data modality for feature representation and post-fusion, and secondly, fusing based on the attention mechanism. Lastly, the difficulties encountered in multimodal medical data fusion and its developmental directions, including modeling methods, evaluation and application of models, are discussed. Through this review article, we expect to provide reference information for the establishment of models that can comprehensively utilize various modal medical data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊