An Automated Tool to Classify and Transform Unstructured MRI Data into BIDS Datasets.

IF 2.7 4区医学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Neuroinformatics Pub Date : 2024-07-01 Epub Date: 2024-03-26 DOI:10.1007/s12021-024-09659-5

Alexander Bartnik, Sujal Singh, Conan Sum, Mackenzie Smith, Niels Bergsland, Robert Zivadinov, Michael G Dwyer

{"title":"An Automated Tool to Classify and Transform Unstructured MRI Data into BIDS Datasets.","authors":"Alexander Bartnik, Sujal Singh, Conan Sum, Mackenzie Smith, Niels Bergsland, Robert Zivadinov, Michael G Dwyer","doi":"10.1007/s12021-024-09659-5","DOIUrl":null,"url":null,"abstract":"<p><p>The increasing use of neuroimaging in clinical research has driven the creation of many large imaging datasets. However, these datasets often rely on inconsistent naming conventions in image file headers to describe acquisition, and time-consuming manual curation is necessary. Therefore, we sought to automate the process of classifying and organizing magnetic resonance imaging (MRI) data according to acquisition types common to the clinical routine, as well as automate the transformation of raw, unstructured images into Brain Imaging Data Structure (BIDS) datasets. To do this, we trained an XGBoost model to classify MRI acquisition types using relatively few acquisition parameters that are automatically stored by the MRI scanner in image file metadata, which are then mapped to the naming conventions prescribed by BIDS to transform the input images to the BIDS structure. The model recognizes MRI types with 99.475% accuracy, as well as a micro/macro-averaged precision of 0.9995/0.994, a micro/macro-averaged recall of 0.9995/0.989, and a micro/macro-averaged F1 of 0.9995/0.991. Our approach accurately and quickly classifies MRI types and transforms unstructured data into standardized structures with little-to-no user intervention, reducing the barrier of entry for clinical scientists and increasing the accessibility of existing neuroimaging data.</p>","PeriodicalId":49761,"journal":{"name":"Neuroinformatics","volume":" ","pages":"229-238"},"PeriodicalIF":2.7000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neuroinformatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12021-024-09659-5","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/26 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

The increasing use of neuroimaging in clinical research has driven the creation of many large imaging datasets. However, these datasets often rely on inconsistent naming conventions in image file headers to describe acquisition, and time-consuming manual curation is necessary. Therefore, we sought to automate the process of classifying and organizing magnetic resonance imaging (MRI) data according to acquisition types common to the clinical routine, as well as automate the transformation of raw, unstructured images into Brain Imaging Data Structure (BIDS) datasets. To do this, we trained an XGBoost model to classify MRI acquisition types using relatively few acquisition parameters that are automatically stored by the MRI scanner in image file metadata, which are then mapped to the naming conventions prescribed by BIDS to transform the input images to the BIDS structure. The model recognizes MRI types with 99.475% accuracy, as well as a micro/macro-averaged precision of 0.9995/0.994, a micro/macro-averaged recall of 0.9995/0.989, and a micro/macro-averaged F1 of 0.9995/0.991. Our approach accurately and quickly classifies MRI types and transforms unstructured data into standardized structures with little-to-no user intervention, reducing the barrier of entry for clinical scientists and increasing the accessibility of existing neuroimaging data.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

将非结构化核磁共振成像数据分类和转换为 BIDS 数据集的自动工具。

随着神经成像技术在临床研究中的应用日益广泛，许多大型成像数据集应运而生。然而，这些数据集往往依赖于图像文件头中不一致的命名约定来描述采集情况，因此必须进行耗时的人工整理。因此，我们试图将磁共振成像（MRI）数据的分类和整理过程自动化，使其符合临床常规的采集类型，并将原始、非结构化图像自动转换为脑成像数据结构（BIDS）数据集。为此，我们训练了一个 XGBoost 模型，利用核磁共振扫描仪自动存储在图像文件元数据中的相对较少的采集参数对核磁共振成像采集类型进行分类，然后将这些参数映射到 BIDS 规定的命名约定，将输入图像转换为 BIDS 结构。该模型识别磁共振成像类型的准确率为 99.475%，微观/宏观平均精确度为 0.9995/0.994，微观/宏观平均召回率为 0.9995/0.989，微观/宏观平均 F1 为 0.9995/0.991。我们的方法能准确、快速地对核磁共振成像类型进行分类，并将非结构化数据转化为标准化结构，几乎不需要用户干预，从而降低了临床科学家的入门门槛，提高了现有神经成像数据的可访问性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Neuroinformatics 医学-计算机：跨学科应用

CiteScore

6.00

自引率

6.70%

发文量

审稿时长

3 months

期刊介绍： Neuroinformatics publishes original articles and reviews with an emphasis on data structure and software tools related to analysis, modeling, integration, and sharing in all areas of neuroscience research. The editors particularly invite contributions on: (1) Theory and methodology, including discussions on ontologies, modeling approaches, database design, and meta-analyses; (2) Descriptions of developed databases and software tools, and of the methods for their distribution; (3) Relevant experimental results, such as reports accompanie by the release of massive data sets; (4) Computational simulations of models integrating and organizing complex data; and (5) Neuroengineering approaches, including hardware, robotics, and information theory studies.