护理记录文件编码切换自动语音识别:系统开发与评估。

JMIR nursing Pub Date : 2022-12-07 DOI:10.2196/37562
Shih-Yen Hou, Ya-Lun Wu, Kai-Ching Chen, Ting-An Chang, Yi-Min Hsu, Su-Jung Chuang, Ying Chang, Kai-Cheng Hsu
{"title":"护理记录文件编码切换自动语音识别:系统开发与评估。","authors":"Shih-Yen Hou,&nbsp;Ya-Lun Wu,&nbsp;Kai-Ching Chen,&nbsp;Ting-An Chang,&nbsp;Yi-Min Hsu,&nbsp;Su-Jung Chuang,&nbsp;Ying Chang,&nbsp;Kai-Cheng Hsu","doi":"10.2196/37562","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary.</p><p><strong>Objective: </strong>The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits.</p><p><strong>Methods: </strong>Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied.</p><p><strong>Results: </strong>As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words.</p><p><strong>Conclusions: </strong>This paper has two main contributions. First, the CMaiSpeech data set-a Mandarin-English corpus-has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers.</p>","PeriodicalId":73556,"journal":{"name":"JMIR nursing","volume":"5 1","pages":"e37562"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9773023/pdf/","citationCount":"0","resultStr":"{\"title\":\"Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation.\",\"authors\":\"Shih-Yen Hou,&nbsp;Ya-Lun Wu,&nbsp;Kai-Ching Chen,&nbsp;Ting-An Chang,&nbsp;Yi-Min Hsu,&nbsp;Su-Jung Chuang,&nbsp;Ying Chang,&nbsp;Kai-Cheng Hsu\",\"doi\":\"10.2196/37562\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary.</p><p><strong>Objective: </strong>The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits.</p><p><strong>Methods: </strong>Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied.</p><p><strong>Results: </strong>As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words.</p><p><strong>Conclusions: </strong>This paper has two main contributions. First, the CMaiSpeech data set-a Mandarin-English corpus-has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers.</p>\",\"PeriodicalId\":73556,\"journal\":{\"name\":\"JMIR nursing\",\"volume\":\"5 1\",\"pages\":\"e37562\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9773023/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR nursing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2196/37562\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR nursing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/37562","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景:台湾护理资源不足,主要是因为医疗服务提供者的高流动率。因此,减轻这些员工繁重的工作量是必不可少的。本研究将语音转录技术应用于护理记录,具有多种潜在的临床应用价值。每次转录只包含一个说话人的要求促进了数据收集和系统开发。此外,也不需要患者的授权。目的:构建护理记录语音识别系统,使医护人员无需打字或只需少量修改即可完成护理记录。方法:台湾地区护理记录以中文书写为主,专业术语及缩略语以中文和英文同时呈现。因此,训练集由英语码换信息组成。接下来,应用迁移学习(TL)和元迁移学习(MTL)方法,它们在代码转换场景中表现良好。截至2021年9月,中国医科大学附属医院人工智能语音(cmaisspeech)数据集通过对525名说话者约100小时的录音进行手动注释而建立。基于音节的翻译基准模型在语码转换时的错误率为29.54%。所提出的基于音节的MTL模型在语码转换方面的应答率为22.20%。测试集包含17,247个单词。此外,在一个临床案例中,基于音节的MTL模型在语码转换方面的应答率为31.06%。临床测试集包含1159个单词。结论:本文有两个主要贡献。首先,建立了cmaisspeech数据集——汉语-英语语料库。台湾的医疗服务提供者经常被迫在护理记录中使用普通话和英语的混合语言。其次,提出了一种用于护理记录文件转换的语音自动识别系统。该系统可以缩短工作交接时间,进一步减少医护人员的工作量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

摘要图片

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation.

Background: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary.

Objective: The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits.

Methods: Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied.

Results: As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words.

Conclusions: This paper has two main contributions. First, the CMaiSpeech data set-a Mandarin-English corpus-has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.20
自引率
0.00%
发文量
0
审稿时长
16 weeks
期刊最新文献
Impact of Attached File Formats on the Performance of ChatGPT-4 on the Japanese National Nursing Examination: Evaluation Study. Exploring Educators' Perceptions and Experiences of Online Teaching to Foster Caring Profession Students' Development of Virtual Caring Skills: Sequential Explanatory Mixed Methods Study. Unobtrusive Nighttime Movement Monitoring to Support Nursing Home Continence Care: Algorithm Development and Validation Study. Assessing Visitor Expectations of AI Nursing Robots in Hospital Settings: Cross-Sectional Study Using the Kano Model. Calculating Optimal Patient to Nursing Capacity: Comparative Analysis of Traditional and New Methods.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1