ClioQuery:面向交互式查询的历史新闻档案综合调查文本分析

IF 3.6 4区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE ACM Transactions on Interactive Intelligent Systems Pub Date : 2022-07-26 DOI:https://dl.acm.org/doi/10.1145/3524025
Abram Handler, Narges Mahyar, Brendan O’Connor
{"title":"ClioQuery:面向交互式查询的历史新闻档案综合调查文本分析","authors":"Abram Handler, Narges Mahyar, Brendan O’Connor","doi":"https://dl.acm.org/doi/10.1145/3524025","DOIUrl":null,"url":null,"abstract":"<p>Historians and archivists often find and analyze the occurrences of query words in newspaper archives to help answer fundamental questions about society. But much work in text analytics focuses on helping people investigate other textual units, such as events, clusters, ranked documents, entity relationships, or thematic hierarchies. Informed by a study into the needs of historians and archivists, we thus propose <span>ClioQuery</span>, a text analytics system uniquely organized around the analysis of query words in context. <span>ClioQuery</span> applies text simplification techniques from natural language processing to help historians quickly and comprehensively gather and analyze all occurrences of a query word across an archive. It also pairs these new NLP methods with more traditional features like linked views and in-text highlighting to help engender trust in summarization techniques. We evaluate <span>ClioQuery</span> with two separate user studies, in which historians explain how <span>ClioQuery</span>’s novel text simplification features can help facilitate historical research. We also evaluate with a separate quantitative comparison study, which shows that <span>ClioQuery</span> helps crowdworkers find and remember historical information. Such results suggest possible new directions for text analytics in other query-oriented settings.</p>","PeriodicalId":48574,"journal":{"name":"ACM Transactions on Interactive Intelligent Systems","volume":"39 1","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ClioQuery: Interactive Query-oriented Text Analytics for Comprehensive Investigation of Historical News Archives\",\"authors\":\"Abram Handler, Narges Mahyar, Brendan O’Connor\",\"doi\":\"https://dl.acm.org/doi/10.1145/3524025\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Historians and archivists often find and analyze the occurrences of query words in newspaper archives to help answer fundamental questions about society. But much work in text analytics focuses on helping people investigate other textual units, such as events, clusters, ranked documents, entity relationships, or thematic hierarchies. Informed by a study into the needs of historians and archivists, we thus propose <span>ClioQuery</span>, a text analytics system uniquely organized around the analysis of query words in context. <span>ClioQuery</span> applies text simplification techniques from natural language processing to help historians quickly and comprehensively gather and analyze all occurrences of a query word across an archive. It also pairs these new NLP methods with more traditional features like linked views and in-text highlighting to help engender trust in summarization techniques. We evaluate <span>ClioQuery</span> with two separate user studies, in which historians explain how <span>ClioQuery</span>’s novel text simplification features can help facilitate historical research. We also evaluate with a separate quantitative comparison study, which shows that <span>ClioQuery</span> helps crowdworkers find and remember historical information. Such results suggest possible new directions for text analytics in other query-oriented settings.</p>\",\"PeriodicalId\":48574,\"journal\":{\"name\":\"ACM Transactions on Interactive Intelligent Systems\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2022-07-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACM Transactions on Interactive Intelligent Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/https://dl.acm.org/doi/10.1145/3524025\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Interactive Intelligent Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/10.1145/3524025","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

历史学家和档案保管员经常在报纸档案中发现并分析查询词的出现情况,以帮助回答有关社会的基本问题。但是,文本分析中的许多工作侧重于帮助人们调查其他文本单元,例如事件、集群、排序文档、实体关系或主题层次结构。通过对历史学家和档案管理员需求的研究,我们提出了ClioQuery,这是一个围绕上下文查询词分析的文本分析系统。ClioQuery应用自然语言处理中的文本简化技术,帮助历史学家快速、全面地收集和分析档案中查询词的所有出现情况。它还将这些新的NLP方法与更传统的功能(如链接视图和文本内高亮显示)相结合,以帮助生成对摘要技术的信任。我们用两个独立的用户研究来评估ClioQuery,其中历史学家解释了ClioQuery新颖的文本简化功能如何有助于促进历史研究。我们还通过一项单独的定量比较研究进行了评估,该研究表明ClioQuery可以帮助众包工作者找到并记住历史信息。这些结果为其他面向查询的设置中的文本分析提供了可能的新方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ClioQuery: Interactive Query-oriented Text Analytics for Comprehensive Investigation of Historical News Archives

Historians and archivists often find and analyze the occurrences of query words in newspaper archives to help answer fundamental questions about society. But much work in text analytics focuses on helping people investigate other textual units, such as events, clusters, ranked documents, entity relationships, or thematic hierarchies. Informed by a study into the needs of historians and archivists, we thus propose ClioQuery, a text analytics system uniquely organized around the analysis of query words in context. ClioQuery applies text simplification techniques from natural language processing to help historians quickly and comprehensively gather and analyze all occurrences of a query word across an archive. It also pairs these new NLP methods with more traditional features like linked views and in-text highlighting to help engender trust in summarization techniques. We evaluate ClioQuery with two separate user studies, in which historians explain how ClioQuery’s novel text simplification features can help facilitate historical research. We also evaluate with a separate quantitative comparison study, which shows that ClioQuery helps crowdworkers find and remember historical information. Such results suggest possible new directions for text analytics in other query-oriented settings.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
ACM Transactions on Interactive Intelligent Systems
ACM Transactions on Interactive Intelligent Systems Computer Science-Human-Computer Interaction
CiteScore
7.80
自引率
2.90%
发文量
38
期刊介绍: The ACM Transactions on Interactive Intelligent Systems (TiiS) publishes papers on research concerning the design, realization, or evaluation of interactive systems that incorporate some form of machine intelligence. TIIS articles come from a wide range of research areas and communities. An article can take any of several complementary views of interactive intelligent systems, focusing on: the intelligent technology, the interaction of users with the system, or both aspects at once.
期刊最新文献
Categorical and Continuous Features in Counterfactual Explanations of AI Systems ID.8: Co-Creating Visual Stories with Generative AI Visualization for Recommendation Explainability: A Survey and New Perspectives Unpacking Human-AI interactions: From interaction primitives to a design space AutoRL X: Automated Reinforcement Learning on the Web
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1