{"title":"从 \"嗯 \"到 \"是\":人类对话中信息流的产生、预测和调节","authors":"Claire Augusta Bergey, Simon DeDeo","doi":"arxiv-2403.08890","DOIUrl":null,"url":null,"abstract":"Conversation demands attention. Speakers must call words to mind, listeners\nmust make sense of them, and both together must negotiate this flow of\ninformation, all in fractions of a second. We used large language models to\nstudy how this works in a large-scale dataset of English-language conversation,\nthe CANDOR corpus. We provide a new estimate of the information density of\nunstructured conversation, of approximately 13 bits/second, and find\nsignificant effects associated with the cognitive load of both retrieving, and\npresenting, that information. We also reveal a role for backchannels -- the\nbrief yeahs, uh-huhs, and mhmms that listeners provide -- in regulating the\nproduction of novelty: the lead-up to a backchannel is associated with\ndeclining information rate, while speech downstream rebounds to previous rates.\nOur results provide new insights into long-standing theories of how we respond\nto fluctuating demands on cognitive resources, and how we negotiate those\ndemands in partnership with others.","PeriodicalId":501433,"journal":{"name":"arXiv - CS - Information Theory","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From \\\"um\\\" to \\\"yeah\\\": Producing, predicting, and regulating information flow in human conversation\",\"authors\":\"Claire Augusta Bergey, Simon DeDeo\",\"doi\":\"arxiv-2403.08890\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Conversation demands attention. Speakers must call words to mind, listeners\\nmust make sense of them, and both together must negotiate this flow of\\ninformation, all in fractions of a second. We used large language models to\\nstudy how this works in a large-scale dataset of English-language conversation,\\nthe CANDOR corpus. We provide a new estimate of the information density of\\nunstructured conversation, of approximately 13 bits/second, and find\\nsignificant effects associated with the cognitive load of both retrieving, and\\npresenting, that information. We also reveal a role for backchannels -- the\\nbrief yeahs, uh-huhs, and mhmms that listeners provide -- in regulating the\\nproduction of novelty: the lead-up to a backchannel is associated with\\ndeclining information rate, while speech downstream rebounds to previous rates.\\nOur results provide new insights into long-standing theories of how we respond\\nto fluctuating demands on cognitive resources, and how we negotiate those\\ndemands in partnership with others.\",\"PeriodicalId\":501433,\"journal\":{\"name\":\"arXiv - CS - Information Theory\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Information Theory\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2403.08890\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.08890","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
From "um" to "yeah": Producing, predicting, and regulating information flow in human conversation
Conversation demands attention. Speakers must call words to mind, listeners
must make sense of them, and both together must negotiate this flow of
information, all in fractions of a second. We used large language models to
study how this works in a large-scale dataset of English-language conversation,
the CANDOR corpus. We provide a new estimate of the information density of
unstructured conversation, of approximately 13 bits/second, and find
significant effects associated with the cognitive load of both retrieving, and
presenting, that information. We also reveal a role for backchannels -- the
brief yeahs, uh-huhs, and mhmms that listeners provide -- in regulating the
production of novelty: the lead-up to a backchannel is associated with
declining information rate, while speech downstream rebounds to previous rates.
Our results provide new insights into long-standing theories of how we respond
to fluctuating demands on cognitive resources, and how we negotiate those
demands in partnership with others.