用于将药学实践出版物分类到研究领域的深度神经网络模型。

IF 3.7 3区医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH Research in Social & Administrative Pharmacy Pub Date : 2025-02-01 Epub Date: 2024-11-05 DOI:10.1016/j.sapharm.2024.10.009

Samuel O Adeosun, Afua B Faibille, Aisha N Qadir, Jerotich T Mutwol, Taylor McMannen

{"title":"用于将药学实践出版物分类到研究领域的深度神经网络模型。","authors":"Samuel O Adeosun, Afua B Faibille, Aisha N Qadir, Jerotich T Mutwol, Taylor McMannen","doi":"10.1016/j.sapharm.2024.10.009","DOIUrl":null,"url":null,"abstract":"Background: Pharmacy practice faculty research profiles extend beyond the clinical and social domains, which are core elements of pharmacy practice. But as highlighted by journal editors in the Granada Statements, there is no consensus on these terms. Four domains (clinical, education, social & administrative, and basic & translational) of pharmacy practice faculty research are proposed.Objectives: To develop a classifier for categorizing pharmacy practice faculty publications into four proposed domains, and to compare the model with zero-shot performances of state-of-the-art, general purpose large language models (gpLLMs).Methods: One thousand abstracts from 2018 to 2021 documents published by pharmacy practice faculty were reviewed, labelled and used to screen and finetune several Bidirectional Encoders Representations from Transformers (BERT) models. The selected model was compared with zero-shot performances of 7 state-of-the-art gpLLMs including ChatGPT-4o, Gemini-1.5-Pro, Claude-3.5, LLAMA-3.1 and Mistral Large, using 80 randomly selected abstracts from 2023 publications labelled with ≥80% consensus by all authors. Classification metrics included F1, recall, precision and accuracy, and reproducibility was measured with Cohen's kappa. A use case was demonstrated by testing the null hypothesis that the research domain distribution of faculty publications was independent of the pandemic.Result: The model - Pharmacy Practice Research Domain Classifier (PPRDC) produced a 5-fold stratified cross-validation metrics of 89.4 ± 1.7, 90.2 ± 2.2, 89.0 ± 1.7, and 95.5 ± 0.6, for F1, recall, precision and accuracy, respectively. PPRDC produced perfectly reproducible classifications (Cohen's kappa = 1.0) and outperformed zero-shot performances of all gpLLMs. F1 scores were 96.2 ± 1.6, 92.7 ± 1.2, 85.8 ± 3.2, and 83.1 ± 9.8 for education, clinical, social, and translational domains, respectively.Conclusions: PPRDC (https://sadeosun-pprdc.streamlit.app) performed better than gpLLMs in this abstract classification task. Among several other impacts, PPRDC opens a new frontier in bibliometric studies; it will also advance the goals of the Grenada Statements by aiding authors and journal editors in journal selection and article prioritization decisions, respectively.","PeriodicalId":48126,"journal":{"name":"Research in Social & Administrative Pharmacy","volume":" ","pages":"85-93"},"PeriodicalIF":3.7000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A deep neural network model for classifying pharmacy practice publications into research domains.\",\"authors\":\"Samuel O Adeosun, Afua B Faibille, Aisha N Qadir, Jerotich T Mutwol, Taylor McMannen\",\"doi\":\"10.1016/j.sapharm.2024.10.009\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Pharmacy practice faculty research profiles extend beyond the clinical and social domains, which are core elements of pharmacy practice. But as highlighted by journal editors in the Granada Statements, there is no consensus on these terms. Four domains (clinical, education, social & administrative, and basic & translational) of pharmacy practice faculty research are proposed.Objectives: To develop a classifier for categorizing pharmacy practice faculty publications into four proposed domains, and to compare the model with zero-shot performances of state-of-the-art, general purpose large language models (gpLLMs).Methods: One thousand abstracts from 2018 to 2021 documents published by pharmacy practice faculty were reviewed, labelled and used to screen and finetune several Bidirectional Encoders Representations from Transformers (BERT) models. The selected model was compared with zero-shot performances of 7 state-of-the-art gpLLMs including ChatGPT-4o, Gemini-1.5-Pro, Claude-3.5, LLAMA-3.1 and Mistral Large, using 80 randomly selected abstracts from 2023 publications labelled with ≥80% consensus by all authors. Classification metrics included F1, recall, precision and accuracy, and reproducibility was measured with Cohen's kappa. A use case was demonstrated by testing the null hypothesis that the research domain distribution of faculty publications was independent of the pandemic.Result: The model - Pharmacy Practice Research Domain Classifier (PPRDC) produced a 5-fold stratified cross-validation metrics of 89.4 ± 1.7, 90.2 ± 2.2, 89.0 ± 1.7, and 95.5 ± 0.6, for F1, recall, precision and accuracy, respectively. PPRDC produced perfectly reproducible classifications (Cohen's kappa = 1.0) and outperformed zero-shot performances of all gpLLMs. F1 scores were 96.2 ± 1.6, 92.7 ± 1.2, 85.8 ± 3.2, and 83.1 ± 9.8 for education, clinical, social, and translational domains, respectively.Conclusions: PPRDC (https://sadeosun-pprdc.streamlit.app) performed better than gpLLMs in this abstract classification task. Among several other impacts, PPRDC opens a new frontier in bibliometric studies; it will also advance the goals of the Grenada Statements by aiding authors and journal editors in journal selection and article prioritization decisions, respectively.\",\"PeriodicalId\":48126,\"journal\":{\"name\":\"Research in Social & Administrative Pharmacy\",\"volume\":\" \",\"pages\":\"85-93\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2025-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Research in Social & Administrative Pharmacy\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.sapharm.2024.10.009\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/11/5 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Research in Social & Administrative Pharmacy","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.sapharm.2024.10.009","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/11/5 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

摘要

背景：药学实践教师的研究范围超出了临床和社会领域，这是药学实践的核心要素。但正如期刊编辑在《格拉纳达声明》中强调的那样，这些术语尚未达成共识。提出了药学实践教师研究的四个领域（临床、教育、社会与管理、基础与转化）：开发一种分类器，用于将药学实践教师的出版物归入四个建议的领域，并将该模型与最先进的通用大型语言模型（gpLLMs）的零射性能进行比较：对 2018 年至 2021 年药学实践教师发表的一千篇文献摘要进行了审查、标注，并用于筛选和微调多个双向编码器变换器表征（BERT）模型。利用从 2023 篇出版物中随机选取的 80 篇摘要，并在所有作者达成≥80%共识的情况下进行标注，将所选模型与 7 种最先进的 gpLLM（包括 ChatGPT-4o、Gemini-1.5-Pro、Claude-3.5、LLAMA-3.1 和 Mistral Large）的零射性能进行了比较。分类指标包括 F1、召回率、精确度和准确度，可重复性用 Cohen's kappa 表示。通过测试教职员工出版物的研究领域分布与大流行无关的零假设，演示了一个使用案例：结果：药学实践研究领域分类器（PPRDC）模型的 F1、召回率、精确度和准确度的 5 倍分层交叉验证指标分别为 89.4 ± 1.7、90.2 ± 2.2、89.0 ± 1.7 和 95.5 ± 0.6。PPRDC 的分类结果具有完美的可重复性（Cohen's kappa = 1.0），其表现优于所有 gpLLM 的零点扫描结果。教育、临床、社会和转化领域的 F1 分数分别为 96.2 ± 1.6、92.7 ± 1.2、85.8 ± 3.2 和 83.1 ± 9.8：在这项抽象分类任务中，PPRDC（https://sadeosun-pprdc.streamlit.app）的表现优于 gpLLMs。除其他影响外，PPRDC 还为文献计量学研究开辟了一个新领域；它还将分别帮助作者和期刊编辑做出期刊选择和文章优先级决定，从而推进格林纳达声明的目标。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A deep neural network model for classifying pharmacy practice publications into research domains.

Background: Pharmacy practice faculty research profiles extend beyond the clinical and social domains, which are core elements of pharmacy practice. But as highlighted by journal editors in the Granada Statements, there is no consensus on these terms. Four domains (clinical, education, social & administrative, and basic & translational) of pharmacy practice faculty research are proposed.

Objectives: To develop a classifier for categorizing pharmacy practice faculty publications into four proposed domains, and to compare the model with zero-shot performances of state-of-the-art, general purpose large language models (gpLLMs).

Methods: One thousand abstracts from 2018 to 2021 documents published by pharmacy practice faculty were reviewed, labelled and used to screen and finetune several Bidirectional Encoders Representations from Transformers (BERT) models. The selected model was compared with zero-shot performances of 7 state-of-the-art gpLLMs including ChatGPT-4o, Gemini-1.5-Pro, Claude-3.5, LLAMA-3.1 and Mistral Large, using 80 randomly selected abstracts from 2023 publications labelled with ≥80% consensus by all authors. Classification metrics included F1, recall, precision and accuracy, and reproducibility was measured with Cohen's kappa. A use case was demonstrated by testing the null hypothesis that the research domain distribution of faculty publications was independent of the pandemic.

Result: The model - Pharmacy Practice Research Domain Classifier (PPRDC) produced a 5-fold stratified cross-validation metrics of 89.4 ± 1.7, 90.2 ± 2.2, 89.0 ± 1.7, and 95.5 ± 0.6, for F1, recall, precision and accuracy, respectively. PPRDC produced perfectly reproducible classifications (Cohen's kappa = 1.0) and outperformed zero-shot performances of all gpLLMs. F1 scores were 96.2 ± 1.6, 92.7 ± 1.2, 85.8 ± 3.2, and 83.1 ± 9.8 for education, clinical, social, and translational domains, respectively.

Conclusions: PPRDC (https://sadeosun-pprdc.streamlit.app) performed better than gpLLMs in this abstract classification task. Among several other impacts, PPRDC opens a new frontier in bibliometric studies; it will also advance the goals of the Grenada Statements by aiding authors and journal editors in journal selection and article prioritization decisions, respectively.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Research in Social & Administrative Pharmacy PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-

CiteScore

7.20

自引率

10.30%

发文量

225

审稿时长

47 days

期刊介绍： Research in Social and Administrative Pharmacy (RSAP) is a quarterly publication featuring original scientific reports and comprehensive review articles in the social and administrative pharmaceutical sciences. Topics of interest include outcomes evaluation of products, programs, or services; pharmacoepidemiology; medication adherence; direct-to-consumer advertising of prescription medications; disease state management; health systems reform; drug marketing; medication distribution systems such as e-prescribing; web-based pharmaceutical/medical services; drug commerce and re-importation; and health professions workforce issues.