半监督机器学习与MixMatch和等价类。

Lecture notes-monograph series Pub Date : 2020-01-01 Epub Date: 2020-10-02
Colin B Hansen, Vishwesh Nath, Riqiang Gao, Camilo Bermudez, Yuankai Huo, Kim L Sandler, Pierre P Massion, Jeffrey D Blume, Thomas A Lasko, Bennett A Landman
{"title":"半监督机器学习与MixMatch和等价类。","authors":"Colin B Hansen,&nbsp;Vishwesh Nath,&nbsp;Riqiang Gao,&nbsp;Camilo Bermudez,&nbsp;Yuankai Huo,&nbsp;Kim L Sandler,&nbsp;Pierre P Massion,&nbsp;Jeffrey D Blume,&nbsp;Thomas A Lasko,&nbsp;Bennett A Landman","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.</p>","PeriodicalId":93329,"journal":{"name":"Lecture notes-monograph series","volume":"12446 ","pages":"112-121"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8388309/pdf/nihms-1647070.pdf","citationCount":"0","resultStr":"{\"title\":\"Semi-supervised Machine Learning with MixMatch and Equivalence Classes.\",\"authors\":\"Colin B Hansen,&nbsp;Vishwesh Nath,&nbsp;Riqiang Gao,&nbsp;Camilo Bermudez,&nbsp;Yuankai Huo,&nbsp;Kim L Sandler,&nbsp;Pierre P Massion,&nbsp;Jeffrey D Blume,&nbsp;Thomas A Lasko,&nbsp;Bennett A Landman\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.</p>\",\"PeriodicalId\":93329,\"journal\":{\"name\":\"Lecture notes-monograph series\",\"volume\":\"12446 \",\"pages\":\"112-121\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8388309/pdf/nihms-1647070.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Lecture notes-monograph series\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/10/2 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lecture notes-monograph series","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/10/2 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

半监督方法对计算机视觉任务的影响越来越大,以利用大型数据集上的稀缺标签,但这些方法尚未很好地转化为医学成像。特别有趣的是,MixMatch方法比流行的CIFAR-10数据集中具有稀缺标签的半监督学习方法实现了显着的性能改进。在一种补充方法中,对等价类的Nullspace调优提供了在主题的基本真相未知时利用多个主题扫描的潜力。这项工作是第一个(1)探索混合匹配与零空间调谐在医学成像的背景下,(2)表征与递减标签的方法的影响。我们考虑两个不同的医学成像领域:皮肤病变诊断和肺癌预测。在这两种情况下,我们使用监督、MixMatch和Nullspace调优方法以及混合匹配和Nullspace调优一起评估用递减标记数据训练的模型。MixMatch和Nullspace Tuning结合在一起,在国家肺筛查试验中,只有200个标记的受试者,肺癌诊断的AUC为0.755;在HAM10000上,只有779个标记的样本,平衡的多类准确率为77%。这种性能类似于所有标签可用时的完全监督方法。在医学成像中推进数据驱动方法的过程中,重要的是要考虑使用来自更大的机器学习社区的当前最先进的半监督学习方法,以及它们对数据采集和注释局限性的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Semi-supervised Machine Learning with MixMatch and Equivalence Classes.

Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Semi-supervised Machine Learning with MixMatch and Equivalence Classes Federated Gradient Averaging for Multi-Site Training with Momentum-Based Optimizers. Semi-supervised Machine Learning with MixMatch and Equivalence Classes. Distanced LSTM: Time-Distanced Gates in Long Short-Term Memory Models for Lung Cancer Detection A Modified Kendall Rank-Order Association Test For Evaluating The Repeatability Of Two Studies With A Large Number Of Objects.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1