首页 > 最新文献

Big Data最新文献

英文 中文
Sharing Medical Big Data While Preserving Patient Confidentiality in Innovative Medicines Initiative: A Summary and Case Report from BigData@Heart. 在创新药物倡议中共享医疗大数据同时保护患者机密:来自BigData@Heart.
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-12-01 Epub Date: 2023-10-27 DOI: 10.1089/big.2022.0178
Megan Schröder, Sam H A Muller, Eleni Vradi, Johanna Mielke, Yvonne M F Lim, Fabrice Couvelard, Menno Mostert, Stefan Koudstaal, Marinus J C Eijkemans, Christoph Gerlinger

Sharing individual patient data (IPD) is a simple concept but complex to achieve due to data privacy and data security concerns, underdeveloped guidelines, and legal barriers. Sharing IPD is additionally difficult in big data-driven collaborations such as Bigdata@Heart in the Innovative Medicines Initiative, due to competing interests between diverse consortium members. One project within BigData@Heart, case study 1, needed to pool data from seven heterogeneous data sets: five randomized controlled trials from three different industry partners, and two disease registries. Sharing IPD was not considered feasible due to legal requirements and the sensitive medical nature of these data. In addition, harmonizing the data sets for a federated data analysis was difficult due to capacity constraints and the heterogeneity of the data sets. An alternative option was to share summary statistics through contingency tables. Here it is demonstrated that this method along with anonymization methods to ensure patient anonymity had minimal loss of information. Although sharing IPD should continue to be encouraged and strived for, our approach achieved a good balance between data transparency while protecting patient privacy. It also allowed a successful collaboration between industry and academia.

共享个人患者数据(IPD)是一个简单的概念,但由于数据隐私和数据安全问题、指导方针不完善以及法律障碍,实现起来很复杂。在诸如Bigdata@Heart在创新药物倡议中,由于不同联盟成员之间的利益竞争。一个项目BigData@Heart,案例研究1,需要汇集来自七个异质数据集的数据:来自三个不同行业合作伙伴的五项随机对照试验,以及两个疾病登记处。由于法律要求和这些数据的敏感医学性质,共享IPD被认为是不可行的。此外,由于容量限制和数据集的异质性,统一联邦数据分析的数据集很困难。另一种选择是通过列联表共享汇总统计数据。这里证明了这种方法以及确保患者匿名性的匿名化方法具有最小的信息损失。尽管应该继续鼓励和努力共享IPD,但我们的方法在数据透明度和保护患者隐私之间取得了良好的平衡。它还促成了工业界和学术界之间的成功合作。
{"title":"Sharing Medical Big Data While Preserving Patient Confidentiality in Innovative Medicines Initiative: A Summary and Case Report from BigData@Heart.","authors":"Megan Schröder, Sam H A Muller, Eleni Vradi, Johanna Mielke, Yvonne M F Lim, Fabrice Couvelard, Menno Mostert, Stefan Koudstaal, Marinus J C Eijkemans, Christoph Gerlinger","doi":"10.1089/big.2022.0178","DOIUrl":"10.1089/big.2022.0178","url":null,"abstract":"<p><p>Sharing individual patient data (IPD) is a simple concept but complex to achieve due to data privacy and data security concerns, underdeveloped guidelines, and legal barriers. Sharing IPD is additionally difficult in big data-driven collaborations such as Bigdata@Heart in the Innovative Medicines Initiative, due to competing interests between diverse consortium members. One project within BigData@Heart, case study 1, needed to pool data from seven heterogeneous data sets: five randomized controlled trials from three different industry partners, and two disease registries. Sharing IPD was not considered feasible due to legal requirements and the sensitive medical nature of these data. In addition, harmonizing the data sets for a federated data analysis was difficult due to capacity constraints and the heterogeneity of the data sets. An alternative option was to share summary statistics through contingency tables. Here it is demonstrated that this method along with anonymization methods to ensure patient anonymity had minimal loss of information. Although sharing IPD should continue to be encouraged and strived for, our approach achieved a good balance between data transparency while protecting patient privacy. It also allowed a successful collaboration between industry and academia.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"399-407"},"PeriodicalIF":4.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10733752/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"61566098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The incidence and prevalence of coeliac disease in the United Kingdom 英国乳糜泻的发病率和流行率
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.5051
Yvonne Nartey, C. Crooks, Joe West, Timothy R. Card, Laila J. Tata
{"title":"The incidence and prevalence of coeliac disease in the United Kingdom","authors":"Yvonne Nartey, C. Crooks, Joe West, Timothy R. Card, Laila J. Tata","doi":"10.1370/afm.22.s1.5051","DOIUrl":"https://doi.org/10.1370/afm.22.s1.5051","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"18 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139303896","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine Learning Analysis of Serious Illness Conversations Predicts Patient Reports of Feeling Heard & Understood 重症患者对话的机器学习分析可预测患者关于被倾听和理解的报告
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.5279
Bob Gramling, Donna Rizzo, Margaret Eppstein, Bradford Demarest
{"title":"Machine Learning Analysis of Serious Illness Conversations Predicts Patient Reports of Feeling Heard & Understood","authors":"Bob Gramling, Donna Rizzo, Margaret Eppstein, Bradford Demarest","doi":"10.1370/afm.22.s1.5279","DOIUrl":"https://doi.org/10.1370/afm.22.s1.5279","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"284 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139291867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Changes in Reasons for Visits to Primary Care as a Result of the COVID-19 Pandemic: by INTRePID COVID-19 大流行导致初级保健就诊原因的变化:按 INTRePID 分类
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.5425
Karen Tu, M. Lapadula
{"title":"Changes in Reasons for Visits to Primary Care as a Result of the COVID-19 Pandemic: by INTRePID","authors":"Karen Tu, M. Lapadula","doi":"10.1370/afm.22.s1.5425","DOIUrl":"https://doi.org/10.1370/afm.22.s1.5425","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"11 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139301044","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breast cancer screening during the COVID-19 Pandemic in the United States: Results from real-world health records data 美国 COVID-19 大流行期间的乳腺癌筛查:来自真实世界健康记录数据的结果
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.4885
William Curry, Wen-Jan Tuan, Qiushi Chen, Andrew Chung
{"title":"Breast cancer screening during the COVID-19 Pandemic in the United States: Results from real-world health records data","authors":"William Curry, Wen-Jan Tuan, Qiushi Chen, Andrew Chung","doi":"10.1370/afm.22.s1.4885","DOIUrl":"https://doi.org/10.1370/afm.22.s1.4885","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"48 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139292120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Method for Utilizing Electronic Health Record Data in Condition-specific Research 在特定病症研究中利用电子健康记录数据的新方法
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.4955
Tarin Clay, Melissa Filippi, Elise Robertson, Cory B. Lutgen, Elisabeth F. Callen
{"title":"A Novel Method for Utilizing Electronic Health Record Data in Condition-specific Research","authors":"Tarin Clay, Melissa Filippi, Elise Robertson, Cory B. Lutgen, Elisabeth F. Callen","doi":"10.1370/afm.22.s1.4955","DOIUrl":"https://doi.org/10.1370/afm.22.s1.4955","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"12 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139294842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harmonized Healthcare Database across Family Medicine Institutions 全科医疗机构的统一医疗保健数据库
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.5404
Chance R. Strenth, David Schneider, U. Sambamoorthi, Sravan Mattevada, Kimberly Fulda, Bhaskar Thakur, Anna Espinoza
{"title":"Harmonized Healthcare Database across Family Medicine Institutions","authors":"Chance R. Strenth, David Schneider, U. Sambamoorthi, Sravan Mattevada, Kimberly Fulda, Bhaskar Thakur, Anna Espinoza","doi":"10.1370/afm.22.s1.5404","DOIUrl":"https://doi.org/10.1370/afm.22.s1.5404","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"14 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139291188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying the Factors Associated with the Accumulation of Diabetes Complications to Inform a Prediction Tool 确定糖尿病并发症累积的相关因素,为预测工具提供依据
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 DOI: 10.1370/afm.22.s1.5071
Winston R. Liaw, Ben King, Omolola E. Adepoju, Jiangtao Luo, Ioannis Kakadiaris, Todd Prewitt, Jessica Dobbins, Pete Womack
{"title":"Identifying the Factors Associated with the Accumulation of Diabetes Complications to Inform a Prediction Tool","authors":"Winston R. Liaw, Ben King, Omolola E. Adepoju, Jiangtao Luo, Ioannis Kakadiaris, Todd Prewitt, Jessica Dobbins, Pete Womack","doi":"10.1370/afm.22.s1.5071","DOIUrl":"https://doi.org/10.1370/afm.22.s1.5071","url":null,"abstract":"","PeriodicalId":51314,"journal":{"name":"Big Data","volume":"23 1","pages":""},"PeriodicalIF":4.6,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139291940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Service Level Agreement Monitoring as a Service: An Independent Monitoring Service for Service Level Agreements in Clouds. 服务级别协议监控即服务:云中服务级别协议的独立监控服务。
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-01 Epub Date: 2022-01-24 DOI: 10.1089/big.2021.0274
Afzal Badshah, Ateeqa Jalal, Umar Farooq, Ghani-Ur Rehman, Shahab S Band, Celestine Iwendi

The cloud network is rapidly growing due to a massive increase in interconnected devices and the emergence of different technologies such as the Internet of things, fog computing, and artificial intelligence. In response, cloud computing needs reliable dealings among the service providers, brokers, and consumers. The existing cloud monitoring frameworks such as Amazon Cloud Watch, Paraleap Azure Watch, and Rack Space Cloud Kick work under the control of service providers. They work fine; however, this may create dissatisfaction among customers over Service Level Agreement (SLA) violations. Customers' dissatisfaction may drastically reduce the businesses of service providers. To cope with the earlier mentioned issue and get in line with cloud philosophy, Monitoring as a Service (MaaS), completely independent in nature, is needed for observing and regulating the cloud businesses. However, the existing MaaS frameworks do not address the comprehensive SLA for customer satisfaction and penalties management. This article proposes a reliable framework for monitoring the provider's services by adopting third-party monitoring services with clearcut SLA and penalties management. Since this framework monitors SLA as a cloud monitoring service, it is named as SLA-MaaS. On violations, it penalizes those who are found in breach of terms and condition enlisted in SLA. Simulation results confirmed that the proposed framework adequately satisfies the customers (as well as service providers). This helps in developing a trustworthy relationship among cloud partners and increases customer attention and retention.

由于互联设备的大量增加以及物联网、雾计算和人工智能等不同技术的出现,云网络正在迅速发展。作为回应,云计算需要服务提供商、经纪人和消费者之间的可靠交易。现有的云监控框架,如Amazon cloud Watch、Paraleap Azure Watch和Rack Space cloud Kick,在服务提供商的控制下工作。它们工作良好;然而,这可能会引起客户对违反服务水平协议(SLA)的不满。客户的不满可能会大大减少服务提供商的业务。为了解决前面提到的问题并符合云的理念,监控即服务(MaaS)在本质上是完全独立的,需要用于观察和监管云业务。然而,现有的MaaS框架没有解决客户满意度和处罚管理的全面SLA问题。本文提出了一个可靠的框架,通过采用具有clearcut SLA和惩罚管理的第三方监控服务来监控提供商的服务。由于该框架将SLA作为云监控服务进行监控,因此将其命名为SLA-MaaS。关于违规行为,它惩罚那些被发现违反苏丹解放军招募的条款和条件的人。仿真结果证实,所提出的框架充分满足了客户(以及服务提供商)的要求。这有助于在云合作伙伴之间建立值得信赖的关系,并提高客户的关注度和忠诚度。
{"title":"Service Level Agreement Monitoring as a Service: An Independent Monitoring Service for Service Level Agreements in Clouds.","authors":"Afzal Badshah,&nbsp;Ateeqa Jalal,&nbsp;Umar Farooq,&nbsp;Ghani-Ur Rehman,&nbsp;Shahab S Band,&nbsp;Celestine Iwendi","doi":"10.1089/big.2021.0274","DOIUrl":"10.1089/big.2021.0274","url":null,"abstract":"<p><p>The cloud network is rapidly growing due to a massive increase in interconnected devices and the emergence of different technologies such as the Internet of things, fog computing, and artificial intelligence. In response, cloud computing needs reliable dealings among the service providers, brokers, and consumers. The existing cloud monitoring frameworks such as Amazon Cloud Watch, Paraleap Azure Watch, and Rack Space Cloud Kick work under the control of service providers. They work fine; however, this may create dissatisfaction among customers over Service Level Agreement (SLA) violations. Customers' dissatisfaction may drastically reduce the businesses of service providers. To cope with the earlier mentioned issue and get in line with cloud philosophy, Monitoring as a Service (MaaS), completely independent in nature, is needed for observing and regulating the cloud businesses. However, the existing MaaS frameworks do not address the comprehensive SLA for customer satisfaction and penalties management. This article proposes a reliable framework for monitoring the provider's services by adopting third-party monitoring services with clearcut SLA and penalties management. Since this framework monitors SLA as a cloud monitoring service, it is named as SLA-MaaS. On violations, it penalizes those who are found in breach of terms and condition enlisted in SLA. Simulation results confirmed that the proposed framework adequately satisfies the customers (as well as service providers). This helps in developing a trustworthy relationship among cloud partners and increases customer attention and retention.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"339-354"},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39857084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
HUTNet: An Efficient Convolutional Neural Network for Handwritten Uchen Tibetan Character Recognition. HUTNet:一种高效的卷积神经网络,用于乌琴藏文手写体识别。
IF 4.6 4区 计算机科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-10-01 Epub Date: 2023-01-19 DOI: 10.1089/big.2021.0333
Guowei Zhang, Weilan Wang, Ce Zhang, Penghai Zhao, Mingkai Zhang

Recognition of handwritten Uchen Tibetan characters input has been considered an efficient way of acquiring mass data in the digital era. However, it still faces considerable challenges due to seriously touching letters and various morphological features of identical characters. Thus, deeper neural networks are required to achieve decent recognition accuracy, making an efficient, lightweight model design important to balance the inevitable trade-off between accuracy and latency. To reduce the learnable parameters of the network as much as possible and maintain acceptable accuracy, we introduce an efficient model named HUTNet based on the internal relationship between floating-point operations per second (FLOPs) and Memory Access Cost. The proposed network achieves a ResNet-18-level accuracy of 96.86%, with only a tenth of the parameters. The subsequent pruning and knowledge distillation strategies were applied to further reduce the inference latency of the model. Experiments on the test set (Handwritten Uchen Tibetan Data set by Wang [HUTDW]) containing 562 classes of 42,068 samples show that the compressed model achieves a 96.83% accuracy while maintaining lower FLOPs and fewer parameters. To verify the effectiveness of HUTNet, we tested it on the Chinese Handwriting Data sets Handwriting Database 1.1 (HWDB1.1), in which HUTNet achieved an accuracy of 97.24%, higher than that of ResNet-18 and ResNet-34. In general, we conduct extensive experiments on resource and accuracy trade-offs and show a stronger performance compared with other famous models on HUTDW and HWDB1.1. It also unlocks the critical bottleneck for handwritten Uchen Tibetan recognition on low-power computing devices.

在数字时代,识别输入的乌陈藏文手写体被认为是获取海量数据的有效途径。然而,由于字母的感人性和相同字符的各种形态特征,它仍然面临着相当大的挑战。因此,需要更深入的神经网络来实现良好的识别精度,这使得高效、轻量级的模型设计对于平衡精度和延迟之间不可避免的权衡非常重要。为了尽可能减少网络的可学习参数并保持可接受的精度,我们引入了一个基于每秒浮点运算(FLOP)和内存访问成本之间的内部关系的高效模型HUTNet。所提出的网络实现了96.86%的ResNet-18级精度,仅具有十分之一的参数。随后的修剪和知识提取策略被应用于进一步减少模型的推理延迟。在包含562类42068个样本的测试集(王[HUTDW]的手写乌陈藏文数据集)上的实验表明,压缩模型在保持较低的FLOP和较少的参数的同时,实现了96.83%的准确率。为了验证HUTNet的有效性,我们在中文手写数据集手写数据库1.1(HWDB1.1)上对其进行了测试,其中HUTNet实现了97.24%的准确率,高于ResNet-18和ResNet-34。总的来说,我们在资源和精度权衡方面进行了广泛的实验,并在HUTDW和HWDB1.1上显示出与其他著名模型相比更强的性能。它还解开了在低功耗计算设备上手写乌琴藏文识别的关键瓶颈。
{"title":"HUTNet: An Efficient Convolutional Neural Network for Handwritten Uchen Tibetan Character Recognition.","authors":"Guowei Zhang,&nbsp;Weilan Wang,&nbsp;Ce Zhang,&nbsp;Penghai Zhao,&nbsp;Mingkai Zhang","doi":"10.1089/big.2021.0333","DOIUrl":"10.1089/big.2021.0333","url":null,"abstract":"<p><p>Recognition of handwritten Uchen Tibetan characters input has been considered an efficient way of acquiring mass data in the digital era. However, it still faces considerable challenges due to seriously touching letters and various morphological features of identical characters. Thus, deeper neural networks are required to achieve decent recognition accuracy, making an efficient, lightweight model design important to balance the inevitable trade-off between accuracy and latency. To reduce the learnable parameters of the network as much as possible and maintain acceptable accuracy, we introduce an efficient model named HUTNet based on the internal relationship between floating-point operations per second (FLOPs) and Memory Access Cost. The proposed network achieves a ResNet-18-level accuracy of 96.86%, with only a tenth of the parameters. The subsequent pruning and knowledge distillation strategies were applied to further reduce the inference latency of the model. Experiments on the test set (Handwritten Uchen Tibetan Data set by Wang [HUTDW]) containing 562 classes of 42,068 samples show that the compressed model achieves a 96.83% accuracy while maintaining lower FLOPs and fewer parameters. To verify the effectiveness of HUTNet, we tested it on the Chinese Handwriting Data sets Handwriting Database 1.1 (HWDB1.1), in which HUTNet achieved an accuracy of 97.24%, higher than that of ResNet-18 and ResNet-34. In general, we conduct extensive experiments on resource and accuracy trade-offs and show a stronger performance compared with other famous models on HUTDW and HWDB1.1. It also unlocks the critical bottleneck for handwritten Uchen Tibetan recognition on low-power computing devices.</p>","PeriodicalId":51314,"journal":{"name":"Big Data","volume":" ","pages":"387-398"},"PeriodicalIF":4.6,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10543391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Big Data
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1