用于科学信息提取的大型语言模型:病毒学实证研究

Findings Pub Date : 2024-01-18 DOI:10.48550/arXiv.2401.10040
Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer
{"title":"用于科学信息提取的大型语言模型:病毒学实证研究","authors":"Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer","doi":"10.48550/arXiv.2401.10040","DOIUrl":null,"url":null,"abstract":"In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions. These representations provide users with a concise overview, aiding scientists in navigating the dense academic landscape. Our novel automated approach leverages the robust text generation capabilities of LLMs to produce structured scholarly contribution summaries, offering both a practical solution and insights into LLMs’ emergent abilities.For LLMs, the prime focus is on improving their general intelligence as conversational agents. We argue that these models can also be applied effectively in information extraction (IE), specifically in complex IE tasks within terse domains like Science. This paradigm shift replaces the traditional modular, pipelined machine learning approach with a simpler objective expressed through instructions. Our results show that finetuned FLAN-T5 with 1000x fewer parameters than the state-of-the-art GPT-davinci is competitive for the task.","PeriodicalId":508951,"journal":{"name":"Findings","volume":"24 18","pages":"374-392"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Language Models for Scientific Information Extraction: An Empirical Study for Virology\",\"authors\":\"Mahsa Shamsabadi, Jennifer D'Souza, Sören Auer\",\"doi\":\"10.48550/arXiv.2401.10040\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions. These representations provide users with a concise overview, aiding scientists in navigating the dense academic landscape. Our novel automated approach leverages the robust text generation capabilities of LLMs to produce structured scholarly contribution summaries, offering both a practical solution and insights into LLMs’ emergent abilities.For LLMs, the prime focus is on improving their general intelligence as conversational agents. We argue that these models can also be applied effectively in information extraction (IE), specifically in complex IE tasks within terse domains like Science. This paradigm shift replaces the traditional modular, pipelined machine learning approach with a simpler objective expressed through instructions. Our results show that finetuned FLAN-T5 with 1000x fewer parameters than the state-of-the-art GPT-davinci is competitive for the task.\",\"PeriodicalId\":508951,\"journal\":{\"name\":\"Findings\",\"volume\":\"24 18\",\"pages\":\"374-392\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Findings\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2401.10040\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Findings","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2401.10040","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

受维基百科信息框或结构化亚马逊产品描述等工具的启发,我们在本文中倡导使用结构化和语义化的内容表示法来表示基于话语的学术交流。这些表示法为用户提供了简明的概览,有助于科学家浏览密集的学术景观。我们新颖的自动方法利用 LLM 强大的文本生成能力来生成结构化的学术贡献摘要,既提供了实用的解决方案,又让人们深入了解了 LLM 的新兴能力。我们认为,这些模型也可以有效地应用于信息提取(IE),特别是科学等简洁领域的复杂 IE 任务。这种范式的转变取代了传统的模块化、流水线式机器学习方法,而是通过指令来表达更简单的目标。我们的研究结果表明,经过微调的 FLAN-T5 比最先进的 GPT-davinci 少了 1000 倍的参数,在执行任务时具有很强的竞争力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Large Language Models for Scientific Information Extraction: An Empirical Study for Virology
In this paper, we champion the use of structured and semantic content representation of discourse-based scholarly communication, inspired by tools like Wikipedia infoboxes or structured Amazon product descriptions. These representations provide users with a concise overview, aiding scientists in navigating the dense academic landscape. Our novel automated approach leverages the robust text generation capabilities of LLMs to produce structured scholarly contribution summaries, offering both a practical solution and insights into LLMs’ emergent abilities.For LLMs, the prime focus is on improving their general intelligence as conversational agents. We argue that these models can also be applied effectively in information extraction (IE), specifically in complex IE tasks within terse domains like Science. This paradigm shift replaces the traditional modular, pipelined machine learning approach with a simpler objective expressed through instructions. Our results show that finetuned FLAN-T5 with 1000x fewer parameters than the state-of-the-art GPT-davinci is competitive for the task.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Changes in Traffic Jams and Injuries Impact on Acceptability of Automated Vehicles: A Strong Curvilinear Relation with no signs of Loss Aversion. Day-of-Week, Month, and Seasonal Demand Variations: Comparing Flow Estimates Across New Travel Data Sources Human Mobility Patterns during the 2024 Total Solar Eclipse in Canada Substituting Car Trips: Does Intermodal Mobility Decrease External Costs and How Does It Affect Travel Times? An Analysis Based on GPS Tracking Data Revealed Preferences for Utilitarian Cycling Energy Expenditure versus Travel Time
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1