The Feasibility of Using Classification and Identification Techniques to Auto-Assess the Quality of Health Information on the Web

P. Chang, F. Huang, Min-Ling Lai
{"title":"The Feasibility of Using Classification and Identification Techniques to Auto-Assess the Quality of Health Information on the Web","authors":"P. Chang, F. Huang, Min-Ling Lai","doi":"10.4258/JKSMI.2009.15.3.247","DOIUrl":null,"url":null,"abstract":"Objective: An automatic detection tool was created for examining health-related webpage quality we went further by examining its feasibility and performance. Methods: We developed an automatic detection system to auto-assess the authorship quality indicator of an health-related information webpage for governmental websites in Taiwan. The system was integrated with the Chinese word segmentation system developed by the Academia Sinica in Taiwan and the SVM light , which serve as an SVM (Support Vector Machine) Classifiers and a method of information extraction and identification. The system was coded in Visual Basic 6.0, using SQL 2000. Results: We developed the first Chinese automatic webpage classification and information identifier to evaluate the quality of web information. The sensitivity and specificity of the classifier on the training set of webpages were both as high as 100% and only one health webpage in the test set was misclassified, due to the fact that it contained both health and non-health information content. The sensitivity of our authorship identifier is 75.3% ,with a specificity of 87.9%. Conclusion: The technical feasibility of auto-assessment for the quality of health information on the web is acceptable. Although it is not sufficient to assure the total quality of web contents, it is good enough to be used to support the entire quality assurance program. (Journal of Korean Society of Medical Informatics 15-3, 247-254, 2009)","PeriodicalId":255087,"journal":{"name":"Journal of Korean Society of Medical Informatics","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Society of Medical Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/JKSMI.2009.15.3.247","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Objective: An automatic detection tool was created for examining health-related webpage quality we went further by examining its feasibility and performance. Methods: We developed an automatic detection system to auto-assess the authorship quality indicator of an health-related information webpage for governmental websites in Taiwan. The system was integrated with the Chinese word segmentation system developed by the Academia Sinica in Taiwan and the SVM light , which serve as an SVM (Support Vector Machine) Classifiers and a method of information extraction and identification. The system was coded in Visual Basic 6.0, using SQL 2000. Results: We developed the first Chinese automatic webpage classification and information identifier to evaluate the quality of web information. The sensitivity and specificity of the classifier on the training set of webpages were both as high as 100% and only one health webpage in the test set was misclassified, due to the fact that it contained both health and non-health information content. The sensitivity of our authorship identifier is 75.3% ,with a specificity of 87.9%. Conclusion: The technical feasibility of auto-assessment for the quality of health information on the web is acceptable. Although it is not sufficient to assure the total quality of web contents, it is good enough to be used to support the entire quality assurance program. (Journal of Korean Society of Medical Informatics 15-3, 247-254, 2009)
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用分类与识别技术自动评估网路健康资讯品质的可行性
目的:建立一种健康网页质量自动检测工具,并对其可行性和性能进行了进一步的研究。方法:开发一套自动检测系统,自动评估台湾政府网站健康资讯网页的作者质量指标。该系统与台湾中央研究院开发的中文分词系统和支持向量机集成,作为支持向量机(SVM)分类器和一种信息提取和识别方法。本系统是用Visual Basic 6.0编写的,使用SQL 2000。结果:开发了首个中文网页自动分类和信息标识符,用于评价网页信息质量。分类器在网页训练集上的灵敏度和特异性均高达100%,测试集中只有一个健康网页被误分类,这是由于该网页同时包含健康和非健康信息内容。作者标识符的敏感性为75.3%,特异性为87.9%。结论:网络健康信息质量自动评估的技术可行性是可以接受的。虽然它不足以保证网络内容的总质量,但它足以用于支持整个质量保证计划。(韩国医学信息学会杂志15- 3,247 -254,2009)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Assessing the Quality of Structured Data Entry for the Secondary Use of Electronic Medical Records A Korean Version of the WHO International Classification for Patient Safety: A Validity Study Development and Validation of Archetypes for Nursing Problems in Breast Cancer Patients Comparison of Physicians' and Patients' Perception on the Effect of Internet Health Information Practical Guide to Clinical Data Management by Susanne Prokscha, 2007
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1