使用机器学习方法过滤医学新闻项目。

Proceedings. AMIA Symposium Pub Date : 2002-01-01

Wanhong Zheng, Evangelos Milios, Carolyn Watters

{"title":"使用机器学习方法过滤医学新闻项目。","authors":"Wanhong Zheng, Evangelos Milios, Carolyn Watters","doi":"","DOIUrl":null,"url":null,"abstract":"We address the problem of filtering medical news articles for targeted audiences. The approach is based on terms and one of the difficulties is extracting a feature set appropriate for the domain. This paper addresses the medical news-filtering problem using a machine learning approach. We describe the application of two supervised machine learning techniques, Decision Trees and Naïve Bayes, to automatically construct classifiers on the basis of a training set, in which news articles have been pre-classified by a medical expert and four other human readers. The goal is to classify the news articles into three groups: non-medical, medical intended for experts, and medical intended for other readers. While the general accuracy of the machine learning approach is around 78%, the accuracy of distinguishing non-medical articles from medical ones is shown to be 92%.","PeriodicalId":79712,"journal":{"name":"Proceedings. AMIA Symposium","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2002-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2244368/pdf/procamiasymp00001-0990.pdf","citationCount":"0","resultStr":"{\"title\":\"Filtering for medical news items using a machine learning approach.\",\"authors\":\"Wanhong Zheng, Evangelos Milios, Carolyn Watters\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We address the problem of filtering medical news articles for targeted audiences. The approach is based on terms and one of the difficulties is extracting a feature set appropriate for the domain. This paper addresses the medical news-filtering problem using a machine learning approach. We describe the application of two supervised machine learning techniques, Decision Trees and Naïve Bayes, to automatically construct classifiers on the basis of a training set, in which news articles have been pre-classified by a medical expert and four other human readers. The goal is to classify the news articles into three groups: non-medical, medical intended for experts, and medical intended for other readers. While the general accuracy of the machine learning approach is around 78%, the accuracy of distinguishing non-medical articles from medical ones is shown to be 92%.\",\"PeriodicalId\":79712,\"journal\":{\"name\":\"Proceedings. AMIA Symposium\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2244368/pdf/procamiasymp00001-0990.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings. AMIA Symposium\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. AMIA Symposium","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

我们解决了为目标受众过滤医学新闻文章的问题。该方法基于术语，难点之一是提取适合该领域的特征集。本文使用机器学习方法解决了医学新闻过滤问题。我们描述了两种监督机器学习技术的应用，决策树和Naïve贝叶斯，在训练集的基础上自动构建分类器，其中新闻文章已经由医学专家和其他四个人类读者预分类。目标是将新闻文章分为三组:非医疗类、针对专家的医疗类和针对其他读者的医疗类。虽然机器学习方法的一般准确率约为78%，但区分非医学文章和医学文章的准确率为92%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Filtering for medical news items using a machine learning approach.

We address the problem of filtering medical news articles for targeted audiences. The approach is based on terms and one of the difficulties is extracting a feature set appropriate for the domain. This paper addresses the medical news-filtering problem using a machine learning approach. We describe the application of two supervised machine learning techniques, Decision Trees and Naïve Bayes, to automatically construct classifiers on the basis of a training set, in which news articles have been pre-classified by a medical expert and four other human readers. The goal is to classify the news articles into three groups: non-medical, medical intended for experts, and medical intended for other readers. While the general accuracy of the machine learning approach is around 78%, the accuracy of distinguishing non-medical articles from medical ones is shown to be 92%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings. AMIA Symposium

自引率

0.00%

发文量

期刊最新文献

Electronic Patient Record Medical informatics as a market for IS/IT Perceived Information Needs and Communication Difficulties of Inpatient Physicians and Nurses Disambiguation Data: Extracting Information from Anonymized Sources The Operating Room Charge Nurse: Coordinator and Communicator