{"title":"From \"um\" to \"yeah\": Producing, predicting, and regulating information flow in human conversation","authors":"Claire Augusta Bergey, Simon DeDeo","doi":"arxiv-2403.08890","DOIUrl":null,"url":null,"abstract":"Conversation demands attention. Speakers must call words to mind, listeners\nmust make sense of them, and both together must negotiate this flow of\ninformation, all in fractions of a second. We used large language models to\nstudy how this works in a large-scale dataset of English-language conversation,\nthe CANDOR corpus. We provide a new estimate of the information density of\nunstructured conversation, of approximately 13 bits/second, and find\nsignificant effects associated with the cognitive load of both retrieving, and\npresenting, that information. We also reveal a role for backchannels -- the\nbrief yeahs, uh-huhs, and mhmms that listeners provide -- in regulating the\nproduction of novelty: the lead-up to a backchannel is associated with\ndeclining information rate, while speech downstream rebounds to previous rates.\nOur results provide new insights into long-standing theories of how we respond\nto fluctuating demands on cognitive resources, and how we negotiate those\ndemands in partnership with others.","PeriodicalId":501433,"journal":{"name":"arXiv - CS - Information Theory","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Information Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2403.08890","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Conversation demands attention. Speakers must call words to mind, listeners
must make sense of them, and both together must negotiate this flow of
information, all in fractions of a second. We used large language models to
study how this works in a large-scale dataset of English-language conversation,
the CANDOR corpus. We provide a new estimate of the information density of
unstructured conversation, of approximately 13 bits/second, and find
significant effects associated with the cognitive load of both retrieving, and
presenting, that information. We also reveal a role for backchannels -- the
brief yeahs, uh-huhs, and mhmms that listeners provide -- in regulating the
production of novelty: the lead-up to a backchannel is associated with
declining information rate, while speech downstream rebounds to previous rates.
Our results provide new insights into long-standing theories of how we respond
to fluctuating demands on cognitive resources, and how we negotiate those
demands in partnership with others.