Assisted neuroscience knowledge extraction via machine learning applied to neural reconstruction metadata on NeuroMorpho.Org.

Q1 Computer Science Brain Informatics Pub Date : 2022-11-07 DOI:10.1186/s40708-022-00174-4

Kayvan Bijari, Yasmeen Zoubi, Giorgio A Ascoli

{"title":"Assisted neuroscience knowledge extraction via machine learning applied to neural reconstruction metadata on NeuroMorpho.Org.","authors":"Kayvan Bijari, Yasmeen Zoubi, Giorgio A Ascoli","doi":"10.1186/s40708-022-00174-4","DOIUrl":null,"url":null,"abstract":"<p><p>The amount of unstructured text produced daily in scholarly journals is enormous. Systematically identifying, sorting, and structuring information from such a volume of data is increasingly challenging for researchers even in delimited domains. Named entity recognition is a fundamental natural language processing tool that can be trained to annotate, structure, and extract information from scientific articles. Here, we harness state-of-the-art machine learning techniques and develop a smart neuroscience metadata suggestion system accessible by both humans through a user-friendly graphical interface and machines via Application Programming Interface. We demonstrate a practical application to the public repository of neural reconstructions, NeuroMorpho.Org, thus expanding the existing web-based metadata management system currently in use. Quantitative analysis indicates that the suggestion system reduces personnel labor by at least 50%. Moreover, our results show that larger training datasets with the same software architecture are unlikely to further improve performance without ad-hoc heuristics due to intrinsic ambiguities in neuroscience nomenclature. All components of this project are released open source for community enhancement and extensions to additional applications.</p>","PeriodicalId":37465,"journal":{"name":"Brain Informatics","volume":"9 1","pages":"26"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9640520/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Brain Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s40708-022-00174-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

Abstract

The amount of unstructured text produced daily in scholarly journals is enormous. Systematically identifying, sorting, and structuring information from such a volume of data is increasingly challenging for researchers even in delimited domains. Named entity recognition is a fundamental natural language processing tool that can be trained to annotate, structure, and extract information from scientific articles. Here, we harness state-of-the-art machine learning techniques and develop a smart neuroscience metadata suggestion system accessible by both humans through a user-friendly graphical interface and machines via Application Programming Interface. We demonstrate a practical application to the public repository of neural reconstructions, NeuroMorpho.Org, thus expanding the existing web-based metadata management system currently in use. Quantitative analysis indicates that the suggestion system reduces personnel labor by at least 50%. Moreover, our results show that larger training datasets with the same software architecture are unlikely to further improve performance without ad-hoc heuristics due to intrinsic ambiguities in neuroscience nomenclature. All components of this project are released open source for community enhancement and extensions to additional applications.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

应用于 NeuroMorpho.Org 上神经重建元数据的机器学习辅助神经科学知识提取。

学术期刊每天产生的非结构化文本数量巨大。对于研究人员来说，从如此大量的数据中系统地识别、分类和结构化信息，即使是在有限制的领域也越来越具有挑战性。命名实体识别是一种基本的自然语言处理工具，可以通过训练从科学文章中注释、构建和提取信息。在这里，我们利用最先进的机器学习技术，开发了一个智能神经科学元数据建议系统，人类可以通过友好的图形界面访问，机器也可以通过应用程序接口访问。我们展示了神经重建公共资料库 NeuroMorpho.Org 的实际应用，从而扩展了目前使用的基于网络的元数据管理系统。定量分析表明，建议系统至少减少了 50%的人力。此外，我们的研究结果表明，由于神经科学术语的内在模糊性，如果不采用临时启发式方法，采用相同软件架构的大型训练数据集不太可能进一步提高性能。该项目的所有组件均已开源，供社区改进和扩展到其他应用中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Brain Informatics Computer Science-Computer Science Applications

CiteScore

9.50

自引率

0.00%

发文量

审稿时长

13 weeks

期刊介绍： Brain Informatics is an international, peer-reviewed, interdisciplinary open-access journal published under the brand SpringerOpen, which provides a unique platform for researchers and practitioners to disseminate original research on computational and informatics technologies related to brain. This journal addresses the computational, cognitive, physiological, biological, physical, ecological and social perspectives of brain informatics. It also welcomes emerging information technologies and advanced neuro-imaging technologies, such as big data analytics and interactive knowledge discovery related to various large-scale brain studies and their applications. This journal will publish high-quality original research papers, brief reports and critical reviews in all theoretical, technological, clinical and interdisciplinary studies that make up the field of brain informatics and its applications in brain-machine intelligence, brain-inspired intelligent systems, mental health and brain disorders, etc. The scope of papers includes the following five tracks: Track 1: Cognitive and Computational Foundations of Brain Science Track 2: Human Information Processing Systems Track 3: Brain Big Data Analytics, Curation and Management Track 4: Informatics Paradigms for Brain and Mental Health Research Track 5: Brain-Machine Intelligence and Brain-Inspired Computing