Markus Stocker, Lauren Snyder, Matthew Anfuso, Oliver Ludwig, Freya Thießen, Kheir Eddine Farfar, Muhammad Haris, Allard Oelen, Mohamad Yaser Jaradeh
{"title":"重新思考制作和出版可通过机器重复使用的研究成果表达方式","authors":"Markus Stocker, Lauren Snyder, Matthew Anfuso, Oliver Ludwig, Freya Thießen, Kheir Eddine Farfar, Muhammad Haris, Allard Oelen, Mohamad Yaser Jaradeh","doi":"arxiv-2405.13129","DOIUrl":null,"url":null,"abstract":"Literature is the primary expression of scientific knowledge and an important\nsource of research data. However, scientific knowledge expressed in narrative\ntext documents is not inherently machine reusable. To facilitate knowledge\nreuse, e.g. for synthesis research, scientific knowledge must be extracted from\narticles and organized into databases post-publication. The high time costs and\ninaccuracies associated with completing these activities manually has driven\nthe development of techniques that automate knowledge extraction. Tackling the\nproblem with a different mindset, we propose a pre-publication approach, known\nas reborn, that ensures scientific knowledge is born reusable, i.e. produced in\na machine-reusable format during knowledge production. We implement the\napproach using the Open Research Knowledge Graph infrastructure for FAIR\nscientific knowledge organization. We test the approach with three use cases,\nand discuss the role of publishers and editors in scaling the approach. Our\nresults suggest that the proposed approach is superior compared to classical\nmanual and semi-automated post-publication extraction techniques in terms of\nknowledge richness and accuracy as well as technological simplicity.","PeriodicalId":501285,"journal":{"name":"arXiv - CS - Digital Libraries","volume":"14 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rethinking the production and publication of machine-reusable expressions of research findings\",\"authors\":\"Markus Stocker, Lauren Snyder, Matthew Anfuso, Oliver Ludwig, Freya Thießen, Kheir Eddine Farfar, Muhammad Haris, Allard Oelen, Mohamad Yaser Jaradeh\",\"doi\":\"arxiv-2405.13129\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Literature is the primary expression of scientific knowledge and an important\\nsource of research data. However, scientific knowledge expressed in narrative\\ntext documents is not inherently machine reusable. To facilitate knowledge\\nreuse, e.g. for synthesis research, scientific knowledge must be extracted from\\narticles and organized into databases post-publication. The high time costs and\\ninaccuracies associated with completing these activities manually has driven\\nthe development of techniques that automate knowledge extraction. Tackling the\\nproblem with a different mindset, we propose a pre-publication approach, known\\nas reborn, that ensures scientific knowledge is born reusable, i.e. produced in\\na machine-reusable format during knowledge production. We implement the\\napproach using the Open Research Knowledge Graph infrastructure for FAIR\\nscientific knowledge organization. We test the approach with three use cases,\\nand discuss the role of publishers and editors in scaling the approach. Our\\nresults suggest that the proposed approach is superior compared to classical\\nmanual and semi-automated post-publication extraction techniques in terms of\\nknowledge richness and accuracy as well as technological simplicity.\",\"PeriodicalId\":501285,\"journal\":{\"name\":\"arXiv - CS - Digital Libraries\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Digital Libraries\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2405.13129\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Digital Libraries","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2405.13129","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Rethinking the production and publication of machine-reusable expressions of research findings
Literature is the primary expression of scientific knowledge and an important
source of research data. However, scientific knowledge expressed in narrative
text documents is not inherently machine reusable. To facilitate knowledge
reuse, e.g. for synthesis research, scientific knowledge must be extracted from
articles and organized into databases post-publication. The high time costs and
inaccuracies associated with completing these activities manually has driven
the development of techniques that automate knowledge extraction. Tackling the
problem with a different mindset, we propose a pre-publication approach, known
as reborn, that ensures scientific knowledge is born reusable, i.e. produced in
a machine-reusable format during knowledge production. We implement the
approach using the Open Research Knowledge Graph infrastructure for FAIR
scientific knowledge organization. We test the approach with three use cases,
and discuss the role of publishers and editors in scaling the approach. Our
results suggest that the proposed approach is superior compared to classical
manual and semi-automated post-publication extraction techniques in terms of
knowledge richness and accuracy as well as technological simplicity.