DART: Distance Assisted Recursive Testing.

IF 5.2 3区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS Journal of Machine Learning Research Pub Date : 2023-01-01

Xuechan Li, Anthony D Sung, Jichun Xie

引用次数: 0

Abstract

Multiple testing is a commonly used tool in modern data science. Sometimes, the hypotheses are embedded in a space; the distances between the hypotheses reflect their co-null/co-alternative patterns. Properly incorporating the distance information in testing will boost testing power. Hence, we developed a new multiple testing framework named Distance Assisted Recursive Testing (DART). DART features in joint artificial intelligence (AI) and statistics modeling. It has two stages. The first stage uses AI models to construct an aggregation tree that reflects the distance information. The second stage uses statistical models to embed the testing on the tree and control the false discovery rate. Theoretical analysis and numerical experiments demonstrated that DART generates valid, robust, and powerful results. We applied DART to a clinical trial in the allogeneic stem cell transplantation study to identify the gut microbiota whose abundance was impacted by post-transplant care.

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DART：距离辅助递归测试。

多重测试是现代数据科学常用的工具。有时，假设被嵌入一个空间；假设之间的距离反映了它们的共空/共变模式。在测试中适当纳入距离信息将提高测试能力。因此，我们开发了一种新的多重测试框架，名为 "距离辅助递归测试（DART）"。DART 的特点是联合人工智能（AI）和统计建模。它分为两个阶段。第一阶段使用人工智能模型构建反映距离信息的聚合树。第二阶段使用统计模型对聚合树进行嵌入测试并控制误发现率。理论分析和数值实验证明，DART 能生成有效、稳健和强大的结果。我们将 DART 应用于异体干细胞移植研究中的一项临床试验，以确定其丰度受移植后护理影响的肠道微生物群。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Machine Learning Research 工程技术-计算机：人工智能

CiteScore

18.80

自引率

0.00%

发文量

审稿时长

3 months

期刊介绍： The Journal of Machine Learning Research (JMLR) provides an international forum for the electronic and paper publication of high-quality scholarly articles in all areas of machine learning. All published papers are freely available online. JMLR has a commitment to rigorous yet rapid reviewing. JMLR seeks previously unpublished papers on machine learning that contain: new principled algorithms with sound empirical validation, and with justification of theoretical, psychological, or biological nature; experimental and/or theoretical studies yielding new insight into the design and behavior of learning in intelligent systems; accounts of applications of existing techniques that shed light on the strengths and weaknesses of the methods; formalization of new learning tasks (e.g., in the context of new applications) and of methods for assessing performance on those tasks; development of new analytical frameworks that advance theoretical studies of practical learning methods; computational models of data from natural learning systems at the behavioral or neural level; or extremely well-written surveys of existing work.