The Feasibility of Using Classification and Identification Techniques to Auto-Assess the Quality of Health Information on the Web

Journal of Korean Society of Medical Informatics Pub Date : 2009-09-01 DOI:10.4258/JKSMI.2009.15.3.247

P. Chang, F. Huang, Min-Ling Lai

{"title":"The Feasibility of Using Classification and Identification Techniques to Auto-Assess the Quality of Health Information on the Web","authors":"P. Chang, F. Huang, Min-Ling Lai","doi":"10.4258/JKSMI.2009.15.3.247","DOIUrl":null,"url":null,"abstract":"Objective: An automatic detection tool was created for examining health-related webpage quality we went further by examining its feasibility and performance. Methods: We developed an automatic detection system to auto-assess the authorship quality indicator of an health-related information webpage for governmental websites in Taiwan. The system was integrated with the Chinese word segmentation system developed by the Academia Sinica in Taiwan and the SVM light , which serve as an SVM (Support Vector Machine) Classifiers and a method of information extraction and identification. The system was coded in Visual Basic 6.0, using SQL 2000. Results: We developed the first Chinese automatic webpage classification and information identifier to evaluate the quality of web information. The sensitivity and specificity of the classifier on the training set of webpages were both as high as 100% and only one health webpage in the test set was misclassified, due to the fact that it contained both health and non-health information content. The sensitivity of our authorship identifier is 75.3% ,with a specificity of 87.9%. Conclusion: The technical feasibility of auto-assessment for the quality of health information on the web is acceptable. Although it is not sufficient to assure the total quality of web contents, it is good enough to be used to support the entire quality assurance program. (Journal of Korean Society of Medical Informatics 15-3, 247-254, 2009)","PeriodicalId":255087,"journal":{"name":"Journal of Korean Society of Medical Informatics","volume":"96 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Korean Society of Medical Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4258/JKSMI.2009.15.3.247","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Objective: An automatic detection tool was created for examining health-related webpage quality we went further by examining its feasibility and performance. Methods: We developed an automatic detection system to auto-assess the authorship quality indicator of an health-related information webpage for governmental websites in Taiwan. The system was integrated with the Chinese word segmentation system developed by the Academia Sinica in Taiwan and the SVM light , which serve as an SVM (Support Vector Machine) Classifiers and a method of information extraction and identification. The system was coded in Visual Basic 6.0, using SQL 2000. Results: We developed the first Chinese automatic webpage classification and information identifier to evaluate the quality of web information. The sensitivity and specificity of the classifier on the training set of webpages were both as high as 100% and only one health webpage in the test set was misclassified, due to the fact that it contained both health and non-health information content. The sensitivity of our authorship identifier is 75.3% ,with a specificity of 87.9%. Conclusion: The technical feasibility of auto-assessment for the quality of health information on the web is acceptable. Although it is not sufficient to assure the total quality of web contents, it is good enough to be used to support the entire quality assurance program. (Journal of Korean Society of Medical Informatics 15-3, 247-254, 2009)

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用分类与识别技术自动评估网路健康资讯品质的可行性

目的:建立一种健康网页质量自动检测工具，并对其可行性和性能进行了进一步的研究。方法:开发一套自动检测系统，自动评估台湾政府网站健康资讯网页的作者质量指标。该系统与台湾中央研究院开发的中文分词系统和支持向量机集成，作为支持向量机(SVM)分类器和一种信息提取和识别方法。本系统是用Visual Basic 6.0编写的，使用SQL 2000。结果:开发了首个中文网页自动分类和信息标识符，用于评价网页信息质量。分类器在网页训练集上的灵敏度和特异性均高达100%，测试集中只有一个健康网页被误分类，这是由于该网页同时包含健康和非健康信息内容。作者标识符的敏感性为75.3%，特异性为87.9%。结论:网络健康信息质量自动评估的技术可行性是可以接受的。虽然它不足以保证网络内容的总质量，但它足以用于支持整个质量保证计划。(韩国医学信息学会杂志15- 3,247 -254,2009)

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Korean Society of Medical Informatics

自引率

0.00%

发文量