Fang Tan , Tongyang Zhang , Siting Yang , Xiaoyan Wu , Jian Xu
{"title":"Discovering Booming Bio-entities and Their Relationship with Funds","authors":"Fang Tan , Tongyang Zhang , Siting Yang , Xiaoyan Wu , Jian Xu","doi":"10.2478/dim-2021-0007","DOIUrl":null,"url":null,"abstract":"<div><p>With the increasing pressure on the National Institutes of Health (NIH) budget nowadays, it is such a major challenge to cut waste and improve efficiency in the research funding allocation. To meet this challenge, this paper explores research hotspots and disciplinary trends of the biomedical area, and discusses the relationship between these factors and the government funding, thereby uncovering biomedical hotspots of interest to academia and the evolution law of the U.S. federal government funding through an entitymetrics analysis. Considering that the rapid proliferation of biomedical literature provides large amounts of information resources for knowledge discovery, entities extracted from articles in PubMed and NIH-funded projects during 1988–2017 are taken as experimental data. They are divided into four categories: species, diseases, genes, and drugs. Subsequently, a comparative analysis of entity trajectories in the four domains is performed, which includes occurrence frequency calculations of disease entities to explore frequency variation trends in high-frequency entities and the situation of the distribution of research funds. Finally, we conduct an evolutionary analysis of two sides, respectively: the relationship between research popularity and the amount of funding; the relationship between research popularity and the number of funded projects. The results suggest that research on gene and disease entities is at the stage of rapid development. Diseases with high prevalence rate and mortality and diseases associated with genetic factors will be the emphasis of research trends in the future. The distribution of NIH grant appears obvious long tail effect and can influence overall trends in the heat of research topics.. We also find that there is a strong linear correlation between the research popularity of bio-entities, and the amount and number of funding grants, respectively. However, the impact of the amount and number of grant funds on the entity research popularity is decreasing. The above results indicate the extensive applicability of entitymetrics in funding research.</p></div>","PeriodicalId":72769,"journal":{"name":"Data and information management","volume":"5 3","pages":"Pages 312-328"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S254392512200002X/pdfft?md5=35aa412bbf46c3c0636990378d0b8ebf&pid=1-s2.0-S254392512200002X-main.pdf","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Data and information management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S254392512200002X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
With the increasing pressure on the National Institutes of Health (NIH) budget nowadays, it is such a major challenge to cut waste and improve efficiency in the research funding allocation. To meet this challenge, this paper explores research hotspots and disciplinary trends of the biomedical area, and discusses the relationship between these factors and the government funding, thereby uncovering biomedical hotspots of interest to academia and the evolution law of the U.S. federal government funding through an entitymetrics analysis. Considering that the rapid proliferation of biomedical literature provides large amounts of information resources for knowledge discovery, entities extracted from articles in PubMed and NIH-funded projects during 1988–2017 are taken as experimental data. They are divided into four categories: species, diseases, genes, and drugs. Subsequently, a comparative analysis of entity trajectories in the four domains is performed, which includes occurrence frequency calculations of disease entities to explore frequency variation trends in high-frequency entities and the situation of the distribution of research funds. Finally, we conduct an evolutionary analysis of two sides, respectively: the relationship between research popularity and the amount of funding; the relationship between research popularity and the number of funded projects. The results suggest that research on gene and disease entities is at the stage of rapid development. Diseases with high prevalence rate and mortality and diseases associated with genetic factors will be the emphasis of research trends in the future. The distribution of NIH grant appears obvious long tail effect and can influence overall trends in the heat of research topics.. We also find that there is a strong linear correlation between the research popularity of bio-entities, and the amount and number of funding grants, respectively. However, the impact of the amount and number of grant funds on the entity research popularity is decreasing. The above results indicate the extensive applicability of entitymetrics in funding research.