可信度加权整合人机判断,实现卓越决策

Felipe Yáñez, Xiaoliang Luo, Omar Valerio Minero, Bradley C. Love
{"title":"可信度加权整合人机判断,实现卓越决策","authors":"Felipe Yáñez, Xiaoliang Luo, Omar Valerio Minero, Bradley C. Love","doi":"arxiv-2408.08083","DOIUrl":null,"url":null,"abstract":"Large language models (LLMs) have emerged as powerful tools in various\ndomains. Recent studies have shown that LLMs can surpass humans in certain\ntasks, such as predicting the outcomes of neuroscience studies. What role does\nthis leave for humans in the overall decision process? One possibility is that\nhumans, despite performing worse than LLMs, can still add value when teamed\nwith them. A human and machine team can surpass each individual teammate when\nteam members' confidence is well-calibrated and team members diverge in which\ntasks they find difficult (i.e., calibration and diversity are needed). We\nsimplified and extended a Bayesian approach to combining judgments using a\nlogistic regression framework that integrates confidence-weighted judgments for\nany number of team members. Using this straightforward method, we demonstrated\nin a neuroscience forecasting task that, even when humans were inferior to\nLLMs, their combination with one or more LLMs consistently improved team\nperformance. Our hope is that this simple and effective strategy for\nintegrating the judgments of humans and machines will lead to productive\ncollaborations.","PeriodicalId":501517,"journal":{"name":"arXiv - QuanBio - Neurons and Cognition","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Confidence-weighted integration of human and machine judgments for superior decision-making\",\"authors\":\"Felipe Yáñez, Xiaoliang Luo, Omar Valerio Minero, Bradley C. Love\",\"doi\":\"arxiv-2408.08083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Large language models (LLMs) have emerged as powerful tools in various\\ndomains. Recent studies have shown that LLMs can surpass humans in certain\\ntasks, such as predicting the outcomes of neuroscience studies. What role does\\nthis leave for humans in the overall decision process? One possibility is that\\nhumans, despite performing worse than LLMs, can still add value when teamed\\nwith them. A human and machine team can surpass each individual teammate when\\nteam members' confidence is well-calibrated and team members diverge in which\\ntasks they find difficult (i.e., calibration and diversity are needed). We\\nsimplified and extended a Bayesian approach to combining judgments using a\\nlogistic regression framework that integrates confidence-weighted judgments for\\nany number of team members. Using this straightforward method, we demonstrated\\nin a neuroscience forecasting task that, even when humans were inferior to\\nLLMs, their combination with one or more LLMs consistently improved team\\nperformance. Our hope is that this simple and effective strategy for\\nintegrating the judgments of humans and machines will lead to productive\\ncollaborations.\",\"PeriodicalId\":501517,\"journal\":{\"name\":\"arXiv - QuanBio - Neurons and Cognition\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Neurons and Cognition\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.08083\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Neurons and Cognition","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.08083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

大型语言模型(LLM)已成为各个领域的强大工具。最近的研究表明,大型语言模型在某些任务中可以超越人类,例如预测神经科学研究的结果。在整个决策过程中,人类还能扮演什么角色?一种可能性是,尽管人类的表现不如 LLM,但如果与 LLM 组成团队,仍然可以增加价值。如果团队成员的信心得到了很好的校准,并且团队成员在他们认为困难的任务上存在分歧(即需要校准和多样性),那么人类与机器的团队就可以超越每个单独的队友。我们简化并扩展了贝叶斯方法,使用逻辑回归框架来综合判断,该框架综合了任意数量团队成员的信心加权判断。利用这种简单易行的方法,我们在一项神经科学预测任务中证明,即使人类不如 LLMs,他们与一个或多个 LLMs 的组合也能持续改善团队表现。我们希望这种将人类和机器的判断结合起来的简单而有效的策略能够带来富有成效的合作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Confidence-weighted integration of human and machine judgments for superior decision-making
Large language models (LLMs) have emerged as powerful tools in various domains. Recent studies have shown that LLMs can surpass humans in certain tasks, such as predicting the outcomes of neuroscience studies. What role does this leave for humans in the overall decision process? One possibility is that humans, despite performing worse than LLMs, can still add value when teamed with them. A human and machine team can surpass each individual teammate when team members' confidence is well-calibrated and team members diverge in which tasks they find difficult (i.e., calibration and diversity are needed). We simplified and extended a Bayesian approach to combining judgments using a logistic regression framework that integrates confidence-weighted judgments for any number of team members. Using this straightforward method, we demonstrated in a neuroscience forecasting task that, even when humans were inferior to LLMs, their combination with one or more LLMs consistently improved team performance. Our hope is that this simple and effective strategy for integrating the judgments of humans and machines will lead to productive collaborations.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Early reduced dopaminergic tone mediated by D3 receptor and dopamine transporter in absence epileptogenesis Contrasformer: A Brain Network Contrastive Transformer for Neurodegenerative Condition Identification Identifying Influential nodes in Brain Networks via Self-Supervised Graph-Transformer Contrastive Learning in Memristor-based Neuromorphic Systems Self-Attention Limits Working Memory Capacity of Transformer-Based Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1