KDD : proceedings. International Conference on Knowledge Discovery & Data Mining最新文献_第2页

MolSearch: Search-based Multi-objective Molecular Generation and Property Optimization. MolSearch:基于搜索的多目标分子生成和性能优化。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2022-08-01 DOI: 10.1145/3534678.3542676

Mengying Sun, Huijun Wang, Jing Xing, Bin Chen, Han Meng, Jiayu Zhou

Leveraging computational methods to generate small molecules with desired properties has been an active research area in the drug discovery field. Towards real-world applications, however, efficient generation of molecules that satisfy multiple property requirements simultaneously remains a key challenge. In this paper, we tackle this challenge using a search-based approach and propose a simple yet effective framework called MolSearch for multi-objective molecular generation (optimization). We show that given proper design and sufficient information, search-based methods can achieve performance comparable or even better than deep learning methods while being computationally efficient. Such efficiency enables massive exploration of chemical space given constrained computational resources. In particular, MolSearch starts with existing molecules and uses a two-stage search strategy to gradually modify them into new ones, based on transformation rules derived systematically and exhaustively from large compound libraries. We evaluate MolSearch in multiple benchmark generation settings and demonstrate its effectiveness and efficiency.

利用计算方法生成具有所需性质的小分子一直是药物发现领域的一个活跃研究领域。然而，在现实应用中，同时满足多种特性要求的分子的高效生成仍然是一个关键挑战。在本文中，我们使用基于搜索的方法来解决这一挑战，并提出了一个简单而有效的框架，称为MolSearch，用于多目标分子生成(优化)。我们表明，给定适当的设计和足够的信息，基于搜索的方法可以达到与深度学习方法相当甚至更好的性能，同时具有计算效率。这样的效率使得在有限的计算资源下对化学空间进行大规模的探索成为可能。特别的是，MolSearch从现有分子开始，使用两阶段的搜索策略，根据从大型化合物库中系统而详尽地推导出的转换规则，逐渐将它们修改成新的分子。我们在多个基准生成设置中评估了MolSearch，并证明了它的有效性和效率。

{"title":"MolSearch: Search-based Multi-objective Molecular Generation and Property Optimization.","authors":"Mengying Sun, Huijun Wang, Jing Xing, Bin Chen, Han Meng, Jiayu Zhou","doi":"10.1145/3534678.3542676","DOIUrl":"https://doi.org/10.1145/3534678.3542676","url":null,"abstract":"Leveraging computational methods to generate small molecules with desired properties has been an active research area in the drug discovery field. Towards real-world applications, however, efficient generation of molecules that satisfy multiple property requirements simultaneously remains a key challenge. In this paper, we tackle this challenge using a search-based approach and propose a simple yet effective framework called MolSearch for multi-objective molecular generation (optimization). We show that given proper design and sufficient information, search-based methods can achieve performance comparable or even better than deep learning methods while being computationally efficient. Such efficiency enables massive exploration of chemical space given constrained computational resources. In particular, MolSearch starts with existing molecules and uses a two-stage search strategy to gradually modify them into new ones, based on transformation rules derived systematically and exhaustively from large compound libraries. We evaluate MolSearch in multiple benchmark generation settings and demonstrate its effectiveness and efficiency.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2022 ","pages":"4724-4732"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10097503/pdf/nihms-1888099.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9580527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Deconfounding Actor-Critic Network with Policy Adaptation for Dynamic Treatment Regimes. 基于动态治疗机制的政策适应解构行为者-批评者网络。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2022-08-01 Epub Date: 2022-08-13 DOI: 10.1145/3534678.3539413

Changchang Yin, Ruoqi Liu, Jeffrey Caterino, Ping Zhang

Despite intense efforts in basic and clinical research, an individualized ventilation strategy for critically ill patients remains a major challenge. Recently, dynamic treatment regime (DTR) with reinforcement learning (RL) on electronic health records (EHR) has attracted interest from both the healthcare industry and machine learning research community. However, most learned DTR policies might be biased due to the existence of confounders. Although some treatment actions non-survivors received may be helpful, if confounders cause the mortality, the training of RL models guided by long-term outcomes (e.g., 90-day mortality) would punish those treatment actions causing the learned DTR policies to be suboptimal. In this study, we develop a new deconfounding actor-critic network (DAC) to learn optimal DTR policies for patients. To alleviate confounding issues, we incorporate a patient resampling module and a confounding balance module into our actor-critic framework. To avoid punishing the effective treatment actions non-survivors received, we design a short-term reward to capture patients' immediate health state changes. Combining short-term with long-term rewards could further improve the model performance. Moreover, we introduce a policy adaptation method to successfully transfer the learned model to new-source small-scale datasets. The experimental results on one semi-synthetic and two different real-world datasets show the proposed model outperforms the state-of-the-art models. The proposed model provides individualized treatment decisions for mechanical ventilation that could improve patient outcomes.

尽管在基础和临床研究方面做出了巨大努力，但危重患者的个性化通气策略仍然是一项重大挑战。最近，基于电子健康记录(EHR)的动态治疗方案(DTR)与强化学习(RL)引起了医疗保健行业和机器学习研究界的兴趣。然而，由于混杂因素的存在，大多数学习到的DTR策略可能存在偏差。虽然一些非幸存者接受的治疗措施可能是有帮助的，但如果混杂因素导致死亡率，以长期结果(例如，90天死亡率)为指导的RL模型的训练将惩罚那些导致学习到的DTR政策不理想的治疗措施。在这项研究中，我们开发了一个新的反建立行为者批评网络(DAC)来学习患者的最佳DTR策略。为了减轻混淆问题，我们将患者重新采样模块和混淆平衡模块合并到我们的参与者-批评框架中。为了避免惩罚非幸存者接受的有效治疗行动，我们设计了一个短期奖励来捕捉患者的即时健康状态变化。将短期奖励与长期奖励相结合，可以进一步提高模型的性能。此外，我们还引入了一种策略自适应方法，将学习到的模型成功地转移到新的源小尺度数据集上。在一个半合成数据集和两个不同的真实世界数据集上的实验结果表明，所提出的模型优于最先进的模型。提出的模型为机械通气提供了个性化的治疗决策，可以改善患者的预后。

{"title":"Deconfounding Actor-Critic Network with Policy Adaptation for Dynamic Treatment Regimes.","authors":"Changchang Yin, Ruoqi Liu, Jeffrey Caterino, Ping Zhang","doi":"10.1145/3534678.3539413","DOIUrl":"https://doi.org/10.1145/3534678.3539413","url":null,"abstract":"Despite intense efforts in basic and clinical research, an individualized ventilation strategy for critically ill patients remains a major challenge. Recently, dynamic treatment regime (DTR) with reinforcement learning (RL) on electronic health records (EHR) has attracted interest from both the healthcare industry and machine learning research community. However, most learned DTR policies might be biased due to the existence of confounders. Although some treatment actions non-survivors received may be helpful, if confounders cause the mortality, the training of RL models guided by long-term outcomes (e.g., 90-day mortality) would punish those treatment actions causing the learned DTR policies to be suboptimal. In this study, we develop a new deconfounding actor-critic network (DAC) to learn optimal DTR policies for patients. To alleviate confounding issues, we incorporate a patient resampling module and a confounding balance module into our actor-critic framework. To avoid punishing the effective treatment actions non-survivors received, we design a short-term reward to capture patients' immediate health state changes. Combining short-term with long-term rewards could further improve the model performance. Moreover, we introduce a policy adaptation method to successfully transfer the learned model to new-source small-scale datasets. The experimental results on one semi-synthetic and two different real-world datasets show the proposed model outperforms the state-of-the-art models. The proposed model provides individualized treatment decisions for mechanical ventilation that could improve patient outcomes.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":" ","pages":"2316-2326"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9466407/pdf/nihms-1830314.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40354004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Predicting Age-Related Macular Degeneration Progression with Contrastive Attention and Time-Aware LSTM. 对比注意和时间感知LSTM预测年龄相关性黄斑变性进展。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2022-08-01 DOI: 10.1145/3534678.3539163

Changchang Yin, Sayoko E Moroi, Ping Zhang

Age-related macular degeneration (AMD) is the leading cause of irreversible blindness in developed countries. Identifying patients at high risk of progression to late AMD, the sight-threatening stage, is critical for clinical actions, including medical interventions and timely monitoring. Recently, deep-learning-based models have been developed and achieved superior performance for late AMD prediction. However, most existing methods are limited to the color fundus photography (CFP) from the last ophthalmic visit and do not include the longitudinal CFP history and AMD progression during the previous years' visits. Patients in different AMD subphenotypes might have various speeds of progression in different stages of AMD disease. Capturing the progression information during the previous years' visits might be useful for the prediction of AMD progression. In this work, we propose a Contrastive-Attention-based Time-aware Long Short-Term Memory network (CAT-LSTM) to predict AMD progression. First, we adopt a convolutional neural network (CNN) model with a contrastive attention module (CA) to extract abnormal features from CFPs. Then we utilize a time-aware LSTM (T-LSTM) to model the patients' history and consider the AMD progression information. The combination of disease progression, genotype information, demographics, and CFP features are sent to T-LSTM. Moreover, we leverage an auto-encoder to represent temporal CFP sequences as fixed-size vectors and adopt k-means to cluster them into subphenotypes. We evaluate the proposed model based on real-world datasets, and the results show that the proposed model could achieve 0.925 on area under the receiver operating characteristic (AUROC) for 5-year late-AMD prediction and outperforms the state-of-the-art methods by more than 3%, which demonstrates the effectiveness of the proposed CAT-LSTM. After analyzing patient representation learned by an auto-encoder, we identify 3 novel subphenotypes of AMD patients with different characteristics and progression rates to late AMD, paving the way for improved personalization of AMD management. The code of CAT-LSTM can be found at GitHub.

在发达国家，年龄相关性黄斑变性(AMD)是导致不可逆失明的主要原因。识别进展到晚期AMD(视力威胁阶段)的高风险患者对临床行动至关重要，包括医疗干预和及时监测。近年来，基于深度学习的模型得到了发展，并在AMD后期预测方面取得了优异的成绩。然而，大多数现有方法仅限于最后一次眼科就诊时的彩色眼底摄影(CFP)，而不包括前几年就诊时的CFP纵向病史和AMD进展。不同AMD亚表型的患者在AMD疾病的不同阶段可能有不同的进展速度。在前几年的访问中获取进展信息可能有助于预测AMD的进展。在这项工作中，我们提出了一个基于对比注意的时间感知长短期记忆网络(CAT-LSTM)来预测AMD的进展。首先，我们采用卷积神经网络(CNN)模型和对比注意模块(CA)从cfp中提取异常特征。然后，我们利用时间感知LSTM (T-LSTM)来建模患者的病史并考虑AMD的进展信息。将疾病进展、基因型信息、人口统计学和CFP特征的组合发送到T-LSTM。此外，我们利用自编码器将时序CFP序列表示为固定大小的向量，并采用k-means将它们聚类到亚表型中。基于实际数据集对该模型进行了评估，结果表明，该模型在接收机工作特性(AUROC)下的5年晚期amd预测面积达到0.925，优于现有方法3%以上，证明了所提CAT-LSTM的有效性。在分析了通过自动编码器学习到的患者表征后，我们确定了AMD患者的3种新的亚表型，它们具有不同的特征和进展到晚期AMD的速度，为改进AMD的个性化管理铺平了道路。CAT-LSTM的代码可以在GitHub上找到。

{"title":"Predicting Age-Related Macular Degeneration Progression with Contrastive Attention and Time-Aware LSTM.","authors":"Changchang Yin, Sayoko E Moroi, Ping Zhang","doi":"10.1145/3534678.3539163","DOIUrl":"https://doi.org/10.1145/3534678.3539163","url":null,"abstract":"Age-related macular degeneration (AMD) is the leading cause of irreversible blindness in developed countries. Identifying patients at high risk of progression to late AMD, the sight-threatening stage, is critical for clinical actions, including medical interventions and timely monitoring. Recently, deep-learning-based models have been developed and achieved superior performance for late AMD prediction. However, most existing methods are limited to the color fundus photography (CFP) from the last ophthalmic visit and do not include the longitudinal CFP history and AMD progression during the previous years' visits. Patients in different AMD subphenotypes might have various speeds of progression in different stages of AMD disease. Capturing the progression information during the previous years' visits might be useful for the prediction of AMD progression. In this work, we propose a Contrastive-Attention-based Time-aware Long Short-Term Memory network (CAT-LSTM) to predict AMD progression. First, we adopt a convolutional neural network (CNN) model with a contrastive attention module (CA) to extract abnormal features from CFPs. Then we utilize a time-aware LSTM (T-LSTM) to model the patients' history and consider the AMD progression information. The combination of disease progression, genotype information, demographics, and CFP features are sent to T-LSTM. Moreover, we leverage an auto-encoder to represent temporal CFP sequences as fixed-size vectors and adopt k-means to cluster them into subphenotypes. We evaluate the proposed model based on real-world datasets, and the results show that the proposed model could achieve 0.925 on area under the receiver operating characteristic (AUROC) for 5-year late-AMD prediction and outperforms the state-of-the-art methods by more than 3%, which demonstrates the effectiveness of the proposed CAT-LSTM. After analyzing patient representation learned by an auto-encoder, we identify 3 novel subphenotypes of AMD patients with different characteristics and progression rates to late AMD, paving the way for improved personalization of AMD management. The code of CAT-LSTM can be found at GitHub.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2022 ","pages":"4402-4412"},"PeriodicalIF":0.0,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9505703/pdf/nihms-1830315.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9234125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

MoCL: Data-driven Molecular Fingerprint via Knowledge-aware Contrastive Learning from Molecular Graph. MoCL：通过分子图谱的知识感知对比学习实现数据驱动的分子指纹。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2021-08-01 Epub Date: 2021-08-14 DOI: 10.1145/3447548.3467186

Mengying Sun, Jing Xing, Huijun Wang, Bin Chen, Jiayu Zhou

Recent years have seen a rapid growth of utilizing graph neural networks (GNNs) in the biomedical domain for tackling drug-related problems. However, like any other deep architectures, GNNs are data hungry. While requiring labels in real world is often expensive, pretraining GNNs in an unsupervised manner has been actively explored. Among them, graph contrastive learning, by maximizing the mutual information between paired graph augmentations, has been shown to be effective on various downstream tasks. However, the current graph contrastive learning framework has two limitations. First, the augmentations are designed for general graphs and thus may not be suitable or powerful enough for certain domains. Second, the contrastive scheme only learns representations that are invariant to local perturbations and thus does not consider the global structure of the dataset, which may also be useful for downstream tasks. In this paper, we study graph contrastive learning designed specifically for the biomedical domain, where molecular graphs are present. We propose a novel framework called MoCL, which utilizes domain knowledge at both local- and global-level to assist representation learning. The local-level domain knowledge guides the augmentation process such that variation is introduced without changing graph semantics. The global-level knowledge encodes the similarity information between graphs in the entire dataset and helps to learn representations with richer semantics. The entire model is learned through a double contrast objective. We evaluate MoCL on various molecular datasets under both linear and semi-supervised settings and results show that MoCL achieves state-of-the-art performance.

近年来，图神经网络（GNN）在生物医学领域的应用迅速发展，用于解决与药物相关的问题。然而，与其他深度架构一样，图神经网络也有数据饥渴症。虽然需要真实世界中的标签往往成本高昂，但人们一直在积极探索以无监督方式预训练 GNN。其中，图对比学习（graph contrastive learning）通过最大化配对图增强之间的互信息，已被证明在各种下游任务中非常有效。然而，目前的图对比学习框架有两个局限性。首先，扩增是为一般图设计的，因此可能不适合某些领域或不够强大。其次，对比方案只学习对局部扰动不变的表征，因此不考虑数据集的全局结构，而这对下游任务可能也很有用。在本文中，我们研究了专门为生物医学领域设计的图对比学习，该领域存在分子图。我们提出了一个名为 MoCL 的新框架，它利用局部和全局层面的领域知识来辅助表征学习。局部级领域知识可指导增强过程，从而在不改变图语义的情况下引入变化。全局级知识编码了整个数据集中图之间的相似性信息，有助于学习具有更丰富语义的表征。整个模型是通过双重对比目标学习的。我们在线性和半监督设置下对各种分子数据集进行了评估，结果表明 MoCL 达到了最先进的性能。

{"title":"MoCL: Data-driven Molecular Fingerprint via Knowledge-aware Contrastive Learning from Molecular Graph.","authors":"Mengying Sun, Jing Xing, Huijun Wang, Bin Chen, Jiayu Zhou","doi":"10.1145/3447548.3467186","DOIUrl":"10.1145/3447548.3467186","url":null,"abstract":"Recent years have seen a rapid growth of utilizing graph neural networks (GNNs) in the biomedical domain for tackling drug-related problems. However, like any other deep architectures, GNNs are data hungry. While requiring labels in real world is often expensive, pretraining GNNs in an unsupervised manner has been actively explored. Among them, graph contrastive learning, by maximizing the mutual information between paired graph augmentations, has been shown to be effective on various downstream tasks. However, the current graph contrastive learning framework has two limitations. First, the augmentations are designed for general graphs and thus may not be suitable or powerful enough for certain domains. Second, the contrastive scheme only learns representations that are invariant to local perturbations and thus does not consider the global structure of the dataset, which may also be useful for downstream tasks. In this paper, we study graph contrastive learning designed specifically for the biomedical domain, where molecular graphs are present. We propose a novel framework called MoCL, which utilizes domain knowledge at both local- and global-level to assist representation learning. The local-level domain knowledge guides the augmentation process such that variation is introduced without changing graph semantics. The global-level knowledge encodes the similarity information between graphs in the entire dataset and helps to learn representations with richer semantics. The entire model is learned through a double contrast objective. We evaluate MoCL on various molecular datasets under both linear and semi-supervised settings and results show that MoCL achieves state-of-the-art performance.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2021 ","pages":"3585-3594"},"PeriodicalIF":0.0,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9105980/pdf/nihms-1798075.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10249436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Federated Adversarial Debiasing for Fair and Transferable Representations. 为公平和可转移的表述而进行的联合对抗性消除。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2021-08-01 Epub Date: 2021-08-14 DOI: 10.1145/3447548.3467281

Junyuan Hong, Zhuangdi Zhu, Shuyang Yu, Zhangyang Wang, Hiroko Dodge, Jiayu Zhou

Federated learning is a distributed learning framework that is communication efficient and provides protection over participating users' raw training data. One outstanding challenge of federate learning comes from the users' heterogeneity, and learning from such data may yield biased and unfair models for minority groups. While adversarial learning is commonly used in centralized learning for mitigating bias, there are significant barriers when extending it to the federated framework. In this work, we study these barriers and address them by proposing a novel approach Federated Adversarial DEbiasing (FADE). FADE does not require users' sensitive group information for debiasing and offers users the freedom to opt-out from the adversarial component when privacy or computational costs become a concern. We show that ideally, FADE can attain the same global optimality as the one by the centralized algorithm. We then analyze when its convergence may fail in practice and propose a simple yet effective method to address the problem. Finally, we demonstrate the effectiveness of the proposed framework through extensive empirical studies, including the problem settings of unsupervised domain adaptation and fair learning. Our codes and pre-trained models are available at: https://github.com/illidanlab/FADE.

联盟学习是一种分布式学习框架，它具有通信效率高的特点，并能保护参与用户的原始训练数据。联盟学习的一个突出挑战来自用户的异质性，从这些数据中学习可能会产生对少数群体有偏见和不公平的模型。虽然对抗学习通常用于集中学习以减少偏差，但将其扩展到联合框架时会遇到重大障碍。在这项工作中，我们对这些障碍进行了研究，并通过提出一种新方法联邦对抗消除偏差（FADE）来解决这些障碍。FADE 不需要用户的敏感群组信息进行去逆，而且当隐私或计算成本成为一个问题时，用户可以自由选择退出对抗部分。我们证明，在理想情况下，FADE 可以达到与集中式算法相同的全局最优性。然后，我们分析了在实践中收敛可能失败的情况，并提出了一种简单而有效的方法来解决这个问题。最后，我们通过广泛的实证研究，包括无监督领域适应和公平学习的问题设置，证明了所提框架的有效性。我们的代码和预训练模型可在以下网址获取：https://github.com/illidanlab/FADE.

{"title":"Federated Adversarial Debiasing for Fair and Transferable Representations.","authors":"Junyuan Hong, Zhuangdi Zhu, Shuyang Yu, Zhangyang Wang, Hiroko Dodge, Jiayu Zhou","doi":"10.1145/3447548.3467281","DOIUrl":"10.1145/3447548.3467281","url":null,"abstract":"Federated learning is a distributed learning framework that is communication efficient and provides protection over participating users' raw training data. One outstanding challenge of federate learning comes from the users' heterogeneity, and learning from such data may yield biased and unfair models for minority groups. While adversarial learning is commonly used in centralized learning for mitigating bias, there are significant barriers when extending it to the federated framework. In this work, we study these barriers and address them by proposing a novel approach Federated Adversarial DEbiasing (FADE). FADE does not require users' sensitive group information for debiasing and offers users the freedom to opt-out from the adversarial component when privacy or computational costs become a concern. We show that ideally, FADE can attain the same global optimality as the one by the centralized algorithm. We then analyze when its convergence may fail in practice and propose a simple yet effective method to address the problem. Finally, we demonstrate the effectiveness of the proposed framework through extensive empirical studies, including the problem settings of unsupervised domain adaptation and fair learning. Our codes and pre-trained models are available at: https://github.com/illidanlab/FADE.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2021 ","pages":"617-627"},"PeriodicalIF":0.0,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9105979/pdf/nihms-1798074.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10249439","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values. 具有缺失值的时间二元数据的LogPar: Logistic PARAFAC2分解。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2020-08-01 DOI: 10.1145/3394486.3403213

Kejing Yin, Ardavan Afshar, Joyce C Ho, William K Cheung, Chao Zhang, Jimeng Sun

Binary data with one-class missing values are ubiquitous in real-world applications. They can be represented by irregular tensors with varying sizes in one dimension, where value one means presence of a feature while zero means unknown (i.e., either presence or absence of a feature). Learning accurate low-rank approximations from such binary irregular tensors is a challenging task. However, none of the existing models developed for factorizing irregular tensors take the missing values into account, and they assume Gaussian distributions, resulting in a distribution mismatch when applied to binary data. In this paper, we propose Logistic PARAFAC2 (LogPar) by modeling the binary irregular tensor with Bernoulli distribution parameterized by an underlying real-valued tensor. Then we approximate the underlying tensor with a positive-unlabeled learning loss function to account for the missing values. We also incorporate uniqueness and temporal smoothness regularization to enhance the interpretability. Extensive experiments using large-scale real-world datasets show that LogPar outperforms all baselines in both irregular tensor completion and downstream predictive tasks. For the irregular tensor completion, LogPar achieves up to 26% relative improvement compared to the best baseline. Besides, LogPar obtains relative improvement of 13.2% for heart failure prediction and 14% for mortality prediction on average compared to the state-of-the-art PARAFAC2 models.

具有一类缺失值的二进制数据在实际应用程序中普遍存在。它们可以用一维中不同大小的不规则张量表示，其中值1表示存在特征，而0表示未知(即存在或不存在特征)。从这种二元不规则张量中学习精确的低秩近似是一项具有挑战性的任务。然而，现有的用于分解不规则张量的模型都没有考虑缺失值，并且它们假设高斯分布，导致在应用于二进制数据时分布不匹配。本文通过对具有伯努利分布的二元不规则张量进行建模，提出了Logistic PARAFAC2 (LogPar)。然后，我们用一个正的无标记学习损失函数来近似底层张量，以解释缺失的值。我们还结合唯一性和时间平滑正则化来增强可解释性。使用大规模真实数据集进行的大量实验表明，LogPar在不规则张量完井和下游预测任务中都优于所有基线。对于不规则张量完井，与最佳基线相比，LogPar实现了高达26%的相对改进。此外，与最先进的PARAFAC2模型相比，LogPar在心力衰竭预测方面平均提高13.2%，在死亡率预测方面平均提高14%。

{"title":"LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values.","authors":"Kejing Yin, Ardavan Afshar, Joyce C Ho, William K Cheung, Chao Zhang, Jimeng Sun","doi":"10.1145/3394486.3403213","DOIUrl":"https://doi.org/10.1145/3394486.3403213","url":null,"abstract":"Binary data with one-class missing values are ubiquitous in real-world applications. They can be represented by irregular tensors with varying sizes in one dimension, where value one means presence of a feature while zero means unknown (i.e., either presence or absence of a feature). Learning accurate low-rank approximations from such binary irregular tensors is a challenging task. However, none of the existing models developed for factorizing irregular tensors take the missing values into account, and they assume Gaussian distributions, resulting in a distribution mismatch when applied to binary data. In this paper, we propose Logistic PARAFAC2 (LogPar) by modeling the binary irregular tensor with Bernoulli distribution parameterized by an underlying real-valued tensor. Then we approximate the underlying tensor with a positive-unlabeled learning loss function to account for the missing values. We also incorporate uniqueness and temporal smoothness regularization to enhance the interpretability. Extensive experiments using large-scale real-world datasets show that LogPar outperforms all baselines in both irregular tensor completion and downstream predictive tasks. For the irregular tensor completion, LogPar achieves up to 26% relative improvement compared to the best baseline. Besides, LogPar obtains relative improvement of 13.2% for heart failure prediction and 14% for mortality prediction on average compared to the state-of-the-art PARAFAC2 models.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2020 ","pages":"1625-1635"},"PeriodicalIF":0.0,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3394486.3403213","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39079152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 16

MetaPred: Meta-Learning for Clinical Risk Prediction with Limited Patient Electronic Health Records. meta - apred:基于有限患者电子健康记录的临床风险预测元学习。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-08-01 DOI: 10.1145/3292500.3330779

Xi Sheryl Zhang, Fengyi Tang, Hiroko H Dodge, Jiayu Zhou, Fei Wang

In recent years, large amounts of health data, such as patient Electronic Health Records (EHR), are becoming readily available. This provides an unprecedented opportunity for knowledge discovery and data mining algorithms to dig insights from them, which can, later on, be helpful to the improvement of the quality of care delivery. Predictive modeling of clinical risks, including in-hospital mortality, hospital readmission, chronic disease onset, condition exacerbation, etc., from patient EHR, is one of the health data analytic problems that attract lots of the interests. The reason is not only because the problem is important in clinical settings, but also is challenging when working with EHR such as sparsity, irregularity, temporality, etc. Different from applications in other domains such as computer vision and natural language processing, the data samples in medicine (patients) are relatively limited, which creates lots of troubles for building effective predictive models, especially for complicated ones such as deep learning. In this paper, we propose MetaPred, a meta-learning framework for clinical risk prediction from longitudinal patient EHR. In particular, in order to predict the target risk with limited data samples, we train a meta-learner from a set of related risk prediction tasks which learns how a good predictor is trained. The meta-learned can then be directly used in target risk prediction, and the limited available samples in the target domain can be used for further fine-tuning the model performance. The effectiveness of MetaPred is tested on a real patient EHR repository from Oregon Health & Science University. We are able to demonstrate that with Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) as base predictors, MetaPred can achieve much better performance for predicting target risk with low resources comparing with the predictor trained on the limited samples available for this risk alone.

近年来，大量的健康数据，如患者电子健康记录(EHR)，变得越来越容易获得。这为知识发现和数据挖掘算法从中挖掘见解提供了前所未有的机会，这些见解可以在以后有助于提高护理质量。基于患者电子病历的临床风险预测建模，包括住院死亡率、再入院率、慢性病发作率、病情加重率等，是健康数据分析领域备受关注的问题之一。原因不仅是因为这个问题在临床环境中很重要，而且在处理电子病历时也具有挑战性，如稀疏性、不规则性、时间性等。与计算机视觉和自然语言处理等其他领域的应用不同，医学(患者)的数据样本相对有限，这给建立有效的预测模型带来了很多麻烦，特别是对于深度学习等复杂的预测模型。在本文中，我们提出了MetaPred，这是一个用于从纵向患者电子病历中预测临床风险的元学习框架。特别是，为了用有限的数据样本预测目标风险，我们从一组相关的风险预测任务中训练一个元学习器，它学习如何训练一个好的预测器。然后，元学习可以直接用于目标风险预测，并且可以使用目标域中有限的可用样本进一步微调模型性能。MetaPred的有效性在俄勒冈健康与科学大学的真实患者电子病历库上进行了测试。我们能够证明，使用卷积神经网络(CNN)和循环神经网络(RNN)作为基本预测器，与仅针对该风险的有限样本训练的预测器相比，MetaPred可以在低资源下预测目标风险方面取得更好的性能。

{"title":"MetaPred: Meta-Learning for Clinical Risk Prediction with Limited Patient Electronic Health Records.","authors":"Xi Sheryl Zhang, Fengyi Tang, Hiroko H Dodge, Jiayu Zhou, Fei Wang","doi":"10.1145/3292500.3330779","DOIUrl":"https://doi.org/10.1145/3292500.3330779","url":null,"abstract":"In recent years, large amounts of health data, such as patient Electronic Health Records (EHR), are becoming readily available. This provides an unprecedented opportunity for knowledge discovery and data mining algorithms to dig insights from them, which can, later on, be helpful to the improvement of the quality of care delivery. Predictive modeling of clinical risks, including in-hospital mortality, hospital readmission, chronic disease onset, condition exacerbation, etc., from patient EHR, is one of the health data analytic problems that attract lots of the interests. The reason is not only because the problem is important in clinical settings, but also is challenging when working with EHR such as sparsity, irregularity, temporality, etc. Different from applications in other domains such as computer vision and natural language processing, the data samples in medicine (patients) are relatively limited, which creates lots of troubles for building effective predictive models, especially for complicated ones such as deep learning. In this paper, we propose MetaPred, a meta-learning framework for clinical risk prediction from longitudinal patient EHR. In particular, in order to predict the target risk with limited data samples, we train a meta-learner from a set of related risk prediction tasks which learns how a good predictor is trained. The meta-learned can then be directly used in target risk prediction, and the limited available samples in the target domain can be used for further fine-tuning the model performance. The effectiveness of MetaPred is tested on a real patient EHR repository from Oregon Health & Science University. We are able to demonstrate that with Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN) as base predictors, MetaPred can achieve much better performance for predicting target risk with low resources comparing with the predictor trained on the limited samples available for this risk alone.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2019 ","pages":"2487-2495"},"PeriodicalIF":0.0,"publicationDate":"2019-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3292500.3330779","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38879115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 87

A Free Energy Based Approach for Distance Metric Learning. 基于自由能的距离度量学习方法。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-01 DOI: 10.1145/3292500.3330975

Sho Inaba, Carl T Fakhry, Rahul V Kulkarni, Kourosh Zarringhalam

We present a reformulation of the distance metric learning problem as a penalized optimization problem, with a penalty term corresponding to the von Neumann entropy of the distance metric. This formulation leads to a mapping to statistical mechanics such that the metric learning optimization problem becomes equivalent to free energy minimization. Correspondingly, our approach leads to an analytical solution of the optimization problem based on the Boltzmann distribution. The mapping established in this work suggests new approaches for dimensionality reduction and provides insights into determination of optimal parameters for the penalty term. Furthermore, we demonstrate that the metric projects the data onto direction of maximum dissimilarity with optimal and tunable separation between classes and thus the transformation can be used for high dimensional data visualization, classification, and clustering tasks. We benchmark our method against previous distance learning methods and provide an efficient implementation in an R package available to download at: https://github.com/kouroshz/fenn.

我们将距离度量学习问题重新表述为一个惩罚优化问题，其中惩罚项对应于距离度量的冯·诺伊曼熵。这个公式可以映射到统计力学，使得度量学习优化问题等同于自由能最小化。相应地，我们的方法导致了基于玻尔兹曼分布的优化问题的解析解。在这项工作中建立的映射提出了降维的新方法，并为确定惩罚项的最佳参数提供了见解。此外，我们证明了该度量将数据投影到最大不相似的方向上，并具有最佳和可调的类之间的分离，因此该转换可用于高维数据可视化，分类和聚类任务。我们将我们的方法与以前的远程学习方法进行了基准测试，并在R包中提供了一个有效的实现，可以从https://github.com/kouroshz/fenn下载。

{"title":"A Free Energy Based Approach for Distance Metric Learning.","authors":"Sho Inaba, Carl T Fakhry, Rahul V Kulkarni, Kourosh Zarringhalam","doi":"10.1145/3292500.3330975","DOIUrl":"https://doi.org/10.1145/3292500.3330975","url":null,"abstract":"We present a reformulation of the distance metric learning problem as a penalized optimization problem, with a penalty term corresponding to the von Neumann entropy of the distance metric. This formulation leads to a mapping to statistical mechanics such that the metric learning optimization problem becomes equivalent to free energy minimization. Correspondingly, our approach leads to an analytical solution of the optimization problem based on the Boltzmann distribution. The mapping established in this work suggests new approaches for dimensionality reduction and provides insights into determination of optimal parameters for the penalty term. Furthermore, we demonstrate that the metric projects the data onto direction of maximum dissimilarity with optimal and tunable separation between classes and thus the transformation can be used for high dimensional data visualization, classification, and clustering tasks. We benchmark our method against previous distance learning methods and provide an efficient implementation in an R package available to download at: https://github.com/kouroshz/fenn.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2019 ","pages":"5-13"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3292500.3330975","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"25467271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Retaining Privileged Information for Multi-Task Learning. 为多任务学习保留特权信息。

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2019-07-01 DOI: 10.1145/3292500.3330907

Fengyi Tang, Cao Xiao, Fei Wang, Jiayu Zhou, Li-Wei H Lehman

Knowledge transfer has been of great interest in current machine learning research, as many have speculated its importance in modeling the human ability to rapidly generalize learned models to new scenarios. Particularly in cases where training samples are limited, knowledge transfer shows improvement on both the learning speed and generalization performance of related tasks. Recently, Learning Using Privileged Information (LUPI) has presented a new direction in knowledge transfer by modeling the transfer of prior knowledge as a Teacher-Student interaction process. Under LUPI, a Teacher model uses Privileged Information (PI) that is only available at training time to improve the sample complexity required to train a Student learner for a given task. In this work, we present a LUPI formulation that allows privileged information to be retained in a multi-task learning setting. We propose a novel feature matching algorithm that projects samples from the original feature space and the privilege information space into a joint latent space in a way that informs similarity between training samples. Our experiments show that useful knowledge from PI is maintained in the latent space and greatly improves the sample efficiency of other related learning tasks. We also provide an analysis of sample complexity of the proposed LUPI method, which under some favorable assumptions can achieve a greater sample efficiency than brute force methods.

在当前的机器学习研究中，知识转移一直是人们非常感兴趣的问题，因为许多人已经推测它在模拟人类快速将所学模型推广到新场景的能力方面的重要性。特别是在训练样本有限的情况下，知识迁移对相关任务的学习速度和泛化性能都有提高。近年来，利用特权信息学习(LUPI)将先验知识的转移建模为师生互动过程，为知识转移研究提供了新的方向。在LUPI下，教师模型使用仅在训练时可用的特权信息(PI)来提高训练学生学习者完成给定任务所需的样本复杂性。在这项工作中，我们提出了一个允许在多任务学习环境中保留特权信息的LUPI公式。我们提出了一种新的特征匹配算法，该算法将原始特征空间和特权信息空间中的样本投影到联合潜在空间中，以告知训练样本之间的相似性。我们的实验表明，来自PI的有用知识被保留在潜在空间中，大大提高了其他相关学习任务的样本效率。我们还对所提出的LUPI方法的样本复杂度进行了分析，在一些有利的假设下，该方法可以比暴力破解方法获得更高的样本效率。

{"title":"Retaining Privileged Information for Multi-Task Learning.","authors":"Fengyi Tang, Cao Xiao, Fei Wang, Jiayu Zhou, Li-Wei H Lehman","doi":"10.1145/3292500.3330907","DOIUrl":"https://doi.org/10.1145/3292500.3330907","url":null,"abstract":"Knowledge transfer has been of great interest in current machine learning research, as many have speculated its importance in modeling the human ability to rapidly generalize learned models to new scenarios. Particularly in cases where training samples are limited, knowledge transfer shows improvement on both the learning speed and generalization performance of related tasks. Recently, Learning Using Privileged Information (LUPI) has presented a new direction in knowledge transfer by modeling the transfer of prior knowledge as a Teacher-Student interaction process. Under LUPI, a Teacher model uses Privileged Information (PI) that is only available at training time to improve the sample complexity required to train a Student learner for a given task. In this work, we present a LUPI formulation that allows privileged information to be retained in a multi-task learning setting. We propose a novel feature matching algorithm that projects samples from the original feature space and the privilege information space into a joint latent space in a way that informs similarity between training samples. Our experiments show that useful knowledge from PI is maintained in the latent space and greatly improves the sample efficiency of other related learning tasks. We also provide an analysis of sample complexity of the proposed LUPI method, which under some favorable assumptions can achieve a greater sample efficiency than brute force methods.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2019 ","pages":"1369-1377"},"PeriodicalIF":0.0,"publicationDate":"2019-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/3292500.3330907","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39890793","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

Voxel Deconvolutional Networks for 3D Brain Image Labeling. 用于三维脑图像标注的体素去卷积网络

KDD : proceedings. International Conference on Knowledge Discovery & Data Mining

Pub Date : 2018-08-01 DOI: 10.1145/3219819.3219974

Yongjun Chen, Min Shi, Hongyang Gao, Dinggang Shen, Lei Cai, Shuiwang Ji

Deep learning methods have shown great success in pixel-wise prediction tasks. One of the most popular methods employs an encoder-decoder network in which deconvolutional layers are used for up-sampling feature maps. However, a key limitation of the deconvolutional layer is that it suffers from the checkerboard artifact problem, which harms the prediction accuracy. This is caused by the independency among adjacent pixels on the output feature maps. Previous work only solved the checkerboard artifact issue of deconvolutional layers in the 2D space. Since the number of intermediate feature maps needed to generate a deconvolutional layer grows exponentially with dimensionality, it is more challenging to solve this issue in higher dimensions. In this work, we propose the voxel deconvolutional layer (VoxelDCL) to solve the checkerboard artifact problem of deconvolutional layers in 3D space. We also provide an efficient approach to implement VoxelDCL. To demonstrate the effectiveness of VoxelDCL, we build four variations of voxel deconvolutional networks (VoxelDCN) based on the U-Net architecture with VoxelDCL. We apply our networks to address volumetric brain images labeling tasks using the ADNI and LONI LPBA40 datasets. The experimental results show that the proposed iVoxelDCNa achieves improved performance in all experiments. It reaches 83.34% in terms of dice ratio on the ADNI dataset and 79.12% on the LONI LPBA40 dataset, which increases 1.39% and 2.21% respectively compared with the baseline. In addition, all the variations of VoxelDCN we proposed outperform the baseline methods on the above datasets, which demonstrates the effectiveness of our methods.

深度学习方法在像素预测任务中取得了巨大成功。最流行的方法之一是采用编码器-解码器网络，其中的解卷积层用于上采样特征图。然而，去卷积层的一个主要局限性是存在棋盘式伪影问题，这会损害预测精度。这是由于输出特征图上相邻像素之间的独立性造成的。以前的工作只解决了二维空间中去卷积层的棋盘伪影问题。由于生成去卷积层所需的中间特征图的数量会随维度的增加而呈指数增长，因此在更高的维度上解决这一问题更具挑战性。在这项工作中，我们提出了体素去卷积层（VoxelDCL）来解决三维空间中去卷积层的棋盘伪影问题。我们还提供了实现 VoxelDCL 的有效方法。为了证明 VoxelDCL 的有效性，我们在 U-Net 架构的基础上利用 VoxelDCL 构建了四种不同的体素去卷积网络（VoxelDCN）。我们将我们的网络应用于使用 ADNI 和 LONI LPBA40 数据集进行的大脑容积图像标记任务。实验结果表明，所提出的 iVoxelDCNa 在所有实验中都取得了更好的性能。它在 ADNI 数据集上的骰子比率达到 83.34%，在 LONI LPBA40 数据集上达到 79.12%，与基线相比分别提高了 1.39% 和 2.21%。此外，我们提出的所有 VoxelDCN 变体在上述数据集上的表现都优于基线方法，这证明了我们方法的有效性。

{"title":"Voxel Deconvolutional Networks for 3D Brain Image Labeling.","authors":"Yongjun Chen, Min Shi, Hongyang Gao, Dinggang Shen, Lei Cai, Shuiwang Ji","doi":"10.1145/3219819.3219974","DOIUrl":"10.1145/3219819.3219974","url":null,"abstract":"Deep learning methods have shown great success in pixel-wise prediction tasks. One of the most popular methods employs an encoder-decoder network in which deconvolutional layers are used for up-sampling feature maps. However, a key limitation of the deconvolutional layer is that it suffers from the checkerboard artifact problem, which harms the prediction accuracy. This is caused by the independency among adjacent pixels on the output feature maps. Previous work only solved the checkerboard artifact issue of deconvolutional layers in the 2D space. Since the number of intermediate feature maps needed to generate a deconvolutional layer grows exponentially with dimensionality, it is more challenging to solve this issue in higher dimensions. In this work, we propose the voxel deconvolutional layer (VoxelDCL) to solve the checkerboard artifact problem of deconvolutional layers in 3D space. We also provide an efficient approach to implement VoxelDCL. To demonstrate the effectiveness of VoxelDCL, we build four variations of voxel deconvolutional networks (VoxelDCN) based on the U-Net architecture with VoxelDCL. We apply our networks to address volumetric brain images labeling tasks using the ADNI and LONI LPBA40 datasets. The experimental results show that the proposed iVoxelDCNa achieves improved performance in all experiments. It reaches 83.34% in terms of dice ratio on the ADNI dataset and 79.12% on the LONI LPBA40 dataset, which increases 1.39% and 2.21% respectively compared with the baseline. In addition, all the variations of VoxelDCN we proposed outperform the baseline methods on the above datasets, which demonstrates the effectiveness of our methods.","PeriodicalId":74037,"journal":{"name":"KDD : proceedings. International Conference on Knowledge Discovery & Data Mining","volume":"2018 ","pages":"1226-1234"},"PeriodicalIF":0.0,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6426146/pdf/nihms-988372.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"37087638","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0