半监督机器学习与MixMatch和等价类。

Lecture notes-monograph series Pub Date : 2020-01-01 Epub Date: 2020-10-02

Colin B Hansen, Vishwesh Nath, Riqiang Gao, Camilo Bermudez, Yuankai Huo, Kim L Sandler, Pierre P Massion, Jeffrey D Blume, Thomas A Lasko, Bennett A Landman

{"title":"半监督机器学习与MixMatch和等价类。","authors":"Colin B Hansen, Vishwesh Nath, Riqiang Gao, Camilo Bermudez, Yuankai Huo, Kim L Sandler, Pierre P Massion, Jeffrey D Blume, Thomas A Lasko, Bennett A Landman","doi":"","DOIUrl":null,"url":null,"abstract":"Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.","PeriodicalId":93329,"journal":{"name":"Lecture notes-monograph series","volume":"12446 ","pages":"112-121"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8388309/pdf/nihms-1647070.pdf","citationCount":"0","resultStr":"{\"title\":\"Semi-supervised Machine Learning with MixMatch and Equivalence Classes.\",\"authors\":\"Colin B Hansen, Vishwesh Nath, Riqiang Gao, Camilo Bermudez, Yuankai Huo, Kim L Sandler, Pierre P Massion, Jeffrey D Blume, Thomas A Lasko, Bennett A Landman\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.\",\"PeriodicalId\":93329,\"journal\":{\"name\":\"Lecture notes-monograph series\",\"volume\":\"12446 \",\"pages\":\"112-121\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8388309/pdf/nihms-1647070.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Lecture notes-monograph series\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2020/10/2 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lecture notes-monograph series","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/10/2 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

半监督方法对计算机视觉任务的影响越来越大，以利用大型数据集上的稀缺标签，但这些方法尚未很好地转化为医学成像。特别有趣的是，MixMatch方法比流行的CIFAR-10数据集中具有稀缺标签的半监督学习方法实现了显着的性能改进。在一种补充方法中，对等价类的Nullspace调优提供了在主题的基本真相未知时利用多个主题扫描的潜力。这项工作是第一个(1)探索混合匹配与零空间调谐在医学成像的背景下，(2)表征与递减标签的方法的影响。我们考虑两个不同的医学成像领域:皮肤病变诊断和肺癌预测。在这两种情况下，我们使用监督、MixMatch和Nullspace调优方法以及混合匹配和Nullspace调优一起评估用递减标记数据训练的模型。MixMatch和Nullspace Tuning结合在一起，在国家肺筛查试验中，只有200个标记的受试者，肺癌诊断的AUC为0.755;在HAM10000上，只有779个标记的样本，平衡的多类准确率为77%。这种性能类似于所有标签可用时的完全监督方法。在医学成像中推进数据驱动方法的过程中，重要的是要考虑使用来自更大的机器学习社区的当前最先进的半监督学习方法，以及它们对数据采集和注释局限性的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Semi-supervised Machine Learning with MixMatch and Equivalence Classes.

Semi-supervised methods have an increasing impact on computer vision tasks to make use of scarce labels on large datasets, yet these approaches have not been well translated to medical imaging. Of particular interest, the MixMatch method achieves significant performance improvement over popular semi-supervised learning methods with scarce labels in the CIFAR-10 dataset. In a complementary approach, Nullspace Tuning on equivalence classes offers the potential to leverage multiple subject scans when the ground truth for the subject is unknown. This work is the first to (1) explore MixMatch with Nullspace Tuning in the context of medical imaging and (2) characterize the impacts of the methods with diminishing labels. We consider two distinct medical imaging domains: skin lesion diagnosis and lung cancer prediction. In both cases we evaluate models trained with diminishing labeled data using supervised, MixMatch, and Nullspace Tuning methods as well as MixMatch with Nullspace Tuning together. MixMatch with Nullspace Tuning together is able to achieve an AUC of 0.755 in lung cancer diagnosis with only 200 labeled subjects on the National Lung Screening Trial and a balanced multi-class accuracy of 77% with only 779 labeled examples on HAM10000. This performance is similar to that of the fully supervised methods when all labels are available. In advancing data driven methods in medical imaging, it is important to consider the use of current state-of-the-art semi-supervised learning methods from the greater machine learning community and their impact on the limitations of data acquisition and annotation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Lecture notes-monograph series

自引率

0.00%

发文量