汇总每位患者的多项生物学测量

V. Zubek, F. Khan
{"title":"汇总每位患者的多项生物学测量","authors":"V. Zubek, F. Khan","doi":"10.1109/ICMLA.2010.120","DOIUrl":null,"url":null,"abstract":"Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent biopsies. The challenge in these applications is how to integrate multiple observations of the same predictor feature per record. In this paper, two data aggregation methods are compared, one method is a simple median aggregation of feature values, while the other is a novel method which constructs intervals of values for each feature. The aggregated features are passed as input to a novel support vector regression method for modeling survival data in a prostate cancer setting. The performance of both methods was similar in predicting prostate cancer progression on three data cohorts.","PeriodicalId":336514,"journal":{"name":"2010 Ninth International Conference on Machine Learning and Applications","volume":"230 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Aggregating Multiple Biological Measurements Per Patient\",\"authors\":\"V. Zubek, F. Khan\",\"doi\":\"10.1109/ICMLA.2010.120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent biopsies. The challenge in these applications is how to integrate multiple observations of the same predictor feature per record. In this paper, two data aggregation methods are compared, one method is a simple median aggregation of feature values, while the other is a novel method which constructs intervals of values for each feature. The aggregated features are passed as input to a novel support vector regression method for modeling survival data in a prostate cancer setting. The performance of both methods was similar in predicting prostate cancer progression on three data cohorts.\",\"PeriodicalId\":336514,\"journal\":{\"name\":\"2010 Ninth International Conference on Machine Learning and Applications\",\"volume\":\"230 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2010 Ninth International Conference on Machine Learning and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2010.120\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 Ninth International Conference on Machine Learning and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2010.120","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

许多机器学习算法需要每个特征每个记录的单个值进行建模。然而,在某些应用中,特别是在医学领域,单个记录可能对同一特征有多个观察结果。例如,患者可以在活检的多个组织切片中分析相同的基因,或者可以在随后的多个活检中进行相同的基因检测。这些应用程序中的挑战是如何将每个记录的相同预测器特征的多个观察结果集成在一起。本文比较了两种数据聚合方法,一种是简单的特征值中值聚合,另一种是为每个特征构造值区间的新方法。将聚合的特征作为输入传递给一种新的支持向量回归方法,用于在前列腺癌设置中建模生存数据。在三个数据队列中,两种方法在预测前列腺癌进展方面的表现相似。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Aggregating Multiple Biological Measurements Per Patient
Many machine learning algorithms require a single value per feature per record for modeling. However, there are applications, in the medical domain particularly, where a single record may have multiple observations for the same feature. For example, a patient could have the same gene analyzed in multiple tissue slides of a biopsy, or could have the same genetic test performed on multiple subsequent biopsies. The challenge in these applications is how to integrate multiple observations of the same predictor feature per record. In this paper, two data aggregation methods are compared, one method is a simple median aggregation of feature values, while the other is a novel method which constructs intervals of values for each feature. The aggregated features are passed as input to a novel support vector regression method for modeling survival data in a prostate cancer setting. The performance of both methods was similar in predicting prostate cancer progression on three data cohorts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Comparative Analysis of DNA Microarray Data through the Use of Feature Selection Techniques Learning from Multiple Related Data Streams with Asynchronous Flowing Speeds Bayesian Inferences and Forecasting in Spatial Time Series Models A Framework for Comprehensive Electronic QA in Radiation Therapy Model-Based Co-clustering for Continuous Data
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1