Soon-Gyo Jung, Joni O. Salminen, Haewoon Kwak, Jisun An, B. Jansen
We present Automatic Persona Generation (APG), a methodology and system for quantitative persona generation using large amounts of online social media data. The system is operational, beta deployed with several client organizations in multiple industry verticals and ranging from small-to-medium sized enterprises to large multi-national corporations. Using a robust web framework and stable back-end database, APG is currently processing tens of millions of user interactions with thousands of online digital products on multiple social media platforms, such as Facebook and YouTube. APG identifies both distinct and impactful user segments and then creates persona descriptions by automatically adding pertinent features, such as names, photos, and personal attributes. We present the overall methodological approach, architecture development, and main system features. APG has a potential value for organizations distributing content via online platforms and is unique in its approach to persona generation. APG can be found online at https://persona.qcri.org.
{"title":"Automatic Persona Generation (APG): A Rationale and Demonstration","authors":"Soon-Gyo Jung, Joni O. Salminen, Haewoon Kwak, Jisun An, B. Jansen","doi":"10.1145/3176349.3176893","DOIUrl":"https://doi.org/10.1145/3176349.3176893","url":null,"abstract":"We present Automatic Persona Generation (APG), a methodology and system for quantitative persona generation using large amounts of online social media data. The system is operational, beta deployed with several client organizations in multiple industry verticals and ranging from small-to-medium sized enterprises to large multi-national corporations. Using a robust web framework and stable back-end database, APG is currently processing tens of millions of user interactions with thousands of online digital products on multiple social media platforms, such as Facebook and YouTube. APG identifies both distinct and impactful user segments and then creates persona descriptions by automatically adding pertinent features, such as names, photos, and personal attributes. We present the overall methodological approach, architecture development, and main system features. APG has a potential value for organizations distributing content via online platforms and is unique in its approach to persona generation. APG can be found online at https://persona.qcri.org.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"450 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125805077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We propose query priming to activate careful user information seeking in web searches. Query priming employs query auto-completion (QAC) and query suggestion (QS) to present search terms that stimulate critical thinking and encourages careful information seeking and decision making. We conducted an online user study using a crowdsourcing service. Analysis of search behavior logs and questionnaire responses confirmed the following. (1) With query priming, participants issued more queries and (re-)visited search engine result pages more frequently. (2) Query priming promoted webpage selection targeted at evidence-based decision making. (3) The query priming effect varied relative to participant educational background. This study contributes to search interaction design to enhance user engagement in critical thinking in web searches.
{"title":"Query Priming for Promoting Critical Thinking in Web Search","authors":"Yusuke Yamamoto, Takehiro Yamamoto","doi":"10.1145/3176349.3176377","DOIUrl":"https://doi.org/10.1145/3176349.3176377","url":null,"abstract":"We propose query priming to activate careful user information seeking in web searches. Query priming employs query auto-completion (QAC) and query suggestion (QS) to present search terms that stimulate critical thinking and encourages careful information seeking and decision making. We conducted an online user study using a crowdsourcing service. Analysis of search behavior logs and questionnaire responses confirmed the following. (1) With query priming, participants issued more queries and (re-)visited search engine result pages more frequently. (2) Query priming promoted webpage selection targeted at evidence-based decision making. (3) The query priming effect varied relative to participant educational background. This study contributes to search interaction design to enhance user engagement in critical thinking in web searches.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126461930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
As search tasks move beyond targeted search and into the domain of complex search, a substantial cognitive burden is placed on the searcher to craft and refine their queries, evaluate and explore among the search results, and ultimately make use of what is found. In such cases, information visualization techniques may be leveraged to enable searchers to perceive, interpret, and make sense of the information available throughout the search process. This tutorial will establish the fundamental principles and theories of information visualization, explain how information visualization can support interactive information retrieval, and survey search interfaces from my own research that leverage information visualization techniques. The goal of this tutorial will be to encourage researchers to make informed design decisions for how to integrate information visualization into their own interactive information retrieval projects.
{"title":"Information Visualization for Interactive Information Retrieval","authors":"O. Hoeber","doi":"10.1145/3176349.3176898","DOIUrl":"https://doi.org/10.1145/3176349.3176898","url":null,"abstract":"As search tasks move beyond targeted search and into the domain of complex search, a substantial cognitive burden is placed on the searcher to craft and refine their queries, evaluate and explore among the search results, and ultimately make use of what is found. In such cases, information visualization techniques may be leveraged to enable searchers to perceive, interpret, and make sense of the information available throughout the search process. This tutorial will establish the fundamental principles and theories of information visualization, explain how information visualization can support interactive information retrieval, and survey search interfaces from my own research that leverage information visualization techniques. The goal of this tutorial will be to encourage researchers to make informed design decisions for how to integrate information visualization into their own interactive information retrieval projects.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127337546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tarfah Alrashed, Ahmed Hassan Awadallah, S. Dumais
Email continues to be one of the most important means of online communication, leading to a number of challenges related to information overload and email management. To better understand email management practices in detail, we examine the distribution of visits to emails over time. During their lifetime, emails may be visited one or more times, and with each visit different actions may be taken. Emails that are revisited over time are especially interesting because they represent an opportunity to improve email management and search. In this paper, we present a large-scale log analysis of email revisitation, the activities that people perform on revisited email messages (e.g. responding to, organizing or deleting messages, and opening attachments), and the strategies they use to go back to these emails. We find that most emails have a short lifetime, with more than 33% having a lifetime of less than 5 minutes. We also find that deleting is the most common action taken on messages visited once, and that responding and organizing are more common for messages visited more than once. We complement the log analysis with a survey to understand the motivation behind revisits and the types of emails that are revisited. The survey results show that 73% of the visits are to find information (e.g. a link or document, instructions to perform a task, or answers to questions), while 20% of revisits are to respond to the email. Our findings have implications for designing email clients and intelligent agents that support both short- and long-term revisitation patterns.
{"title":"The Lifetime of Email Messages: A Large-Scale Analysis of Email Revisitation","authors":"Tarfah Alrashed, Ahmed Hassan Awadallah, S. Dumais","doi":"10.1145/3176349.3176398","DOIUrl":"https://doi.org/10.1145/3176349.3176398","url":null,"abstract":"Email continues to be one of the most important means of online communication, leading to a number of challenges related to information overload and email management. To better understand email management practices in detail, we examine the distribution of visits to emails over time. During their lifetime, emails may be visited one or more times, and with each visit different actions may be taken. Emails that are revisited over time are especially interesting because they represent an opportunity to improve email management and search. In this paper, we present a large-scale log analysis of email revisitation, the activities that people perform on revisited email messages (e.g. responding to, organizing or deleting messages, and opening attachments), and the strategies they use to go back to these emails. We find that most emails have a short lifetime, with more than 33% having a lifetime of less than 5 minutes. We also find that deleting is the most common action taken on messages visited once, and that responding and organizing are more common for messages visited more than once. We complement the log analysis with a survey to understand the motivation behind revisits and the types of emails that are revisited. The survey results show that 73% of the visits are to find information (e.g. a link or document, instructions to perform a task, or answers to questions), while 20% of revisits are to respond to the email. Our findings have implications for designing email clients and intelligent agents that support both short- and long-term revisitation patterns.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134496680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The detection of echo chambers and information bubbles is becoming increasingly relevant in this era of polarized information. It may be possible to evaluate information trustworthiness by examining the behavior of individuals in belief space rather than evaluating the information itself, which is a harder problem. To explore this, I propose to research a model for information retrieval that integrates two levels of information interaction. On the individual level, I leverage Munson and Resnick»s diversity-seeker, confirmer, and avoider patterns. At a group level, I integrate individual behaviors according to Moskivici»s work on crowd polarization. These perspectives have been integrated in a simulation that employs insights from animal collective behavior to model agent groups, which enable the systematic exploration of belief navigation behaviors that can be detected algorithmically. Viewing information retrieval from the perspective of belief spaces may shed light on current practices and lay out consideration for future design work.
{"title":"With Maps and Mobs: Searching for Trustworthiness using Belief Spaces","authors":"Philip G. Feldman","doi":"10.1145/3176349.3176353","DOIUrl":"https://doi.org/10.1145/3176349.3176353","url":null,"abstract":"The detection of echo chambers and information bubbles is becoming increasingly relevant in this era of polarized information. It may be possible to evaluate information trustworthiness by examining the behavior of individuals in belief space rather than evaluating the information itself, which is a harder problem. To explore this, I propose to research a model for information retrieval that integrates two levels of information interaction. On the individual level, I leverage Munson and Resnick»s diversity-seeker, confirmer, and avoider patterns. At a group level, I integrate individual behaviors according to Moskivici»s work on crowd polarization. These perspectives have been integrated in a simulation that employs insights from animal collective behavior to model agent groups, which enable the systematic exploration of belief navigation behaviors that can be detected algorithmically. Viewing information retrieval from the perspective of belief spaces may shed light on current practices and lay out consideration for future design work.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130511812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Search, especially library search, is a distinct part of the research process which can be taught and supported separately from the scholarly processes of knowledge creation. We interviewed eight early career researchers (ECRs) composed of doctoral students or recent graduates about their overall scholarly workflows including not only search but also social networking around scholarly information and production of scholarly works. Evidence suggests that search itself is less discrete and library-centric than prior models may have suggested, and that students use both social resources and non-library technologies to discover and locate scholarly works. We argue that taking a workflow-centric and collaboration-centric view, rather than a search-centric view, should inform design of tools and training for search of scholarly resources.
{"title":"A Study of Search Practices in Doctoral Student Scholarly Workflows","authors":"S. Ince, C. Hoadley, P. Kirschner","doi":"10.1145/3176349.3176877","DOIUrl":"https://doi.org/10.1145/3176349.3176877","url":null,"abstract":"Search, especially library search, is a distinct part of the research process which can be taught and supported separately from the scholarly processes of knowledge creation. We interviewed eight early career researchers (ECRs) composed of doctoral students or recent graduates about their overall scholarly workflows including not only search but also social networking around scholarly information and production of scholarly works. Evidence suggests that search itself is less discrete and library-centric than prior models may have suggested, and that students use both social resources and non-library technologies to discover and locate scholarly works. We argue that taking a workflow-centric and collaboration-centric view, rather than a search-centric view, should inform design of tools and training for search of scholarly resources.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133717074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, we present work-in-progress results from the stimulated recall portion of a U.S.-based lab study that investigated the influence of financial knowledge and cognitive abilities on the search performance, relevance assessments, and mental workload of adults searching the Internet for personal finance topics. Participants were asked to retrospectively think aloud while viewing screen recordings of one of their search tasks. Qualitative, inductive coding was applied to transcribed interviews. An early theme about government websites and information emerged in the data analysis and that is the topic of this paper. For all three tasks, participants prioritized and valued information from U.S. government websites over that of commercial websites, which seems to contradict recent national surveys indicating low levels of trust in government information sources and the government in general. Our findings suggest that for certain topics, especially those associated with high levels of uncertainty, people might resort to more basic search and evaluation behaviors.
{"title":"Strategies for Finding and Evaluating Information about Personal Finance Topics: The Role of Government Information","authors":"Kathy Brennan, D. Kelly","doi":"10.1145/3176349.3176883","DOIUrl":"https://doi.org/10.1145/3176349.3176883","url":null,"abstract":"In this paper, we present work-in-progress results from the stimulated recall portion of a U.S.-based lab study that investigated the influence of financial knowledge and cognitive abilities on the search performance, relevance assessments, and mental workload of adults searching the Internet for personal finance topics. Participants were asked to retrospectively think aloud while viewing screen recordings of one of their search tasks. Qualitative, inductive coding was applied to transcribed interviews. An early theme about government websites and information emerged in the data analysis and that is the topic of this paper. For all three tasks, participants prioritized and valued information from U.S. government websites over that of commercial websites, which seems to contradict recent national surveys indicating low levels of trust in government information sources and the government in general. Our findings suggest that for certain topics, especially those associated with high levels of uncertainty, people might resort to more basic search and evaluation behaviors.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132288272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Web search is frequently used by people to acquire new knowledge and to satisfy learning-related objectives, but little is known about how a user»s knowledge evolves through the course of a search session. We present a study addressing the knowledge gain of users in informational search sessions. Using crowdsourcing, we recruited 500 distinct users and orchestrated real-world search sessions spanning 10 different topics and information needs. By using scientifically formulated knowledge tests we calibrated the knowledge of users before and after their search sessions, quantifying their knowledge gain. We investigated the impact of information needs on the search behavior and knowledge gain of users, revealing a significant effect of information need on user queries and navigational patterns, but no direct effect on the knowledge gain. Users on average exhibited a higher knowledge gain through search sessions pertaining to topics they were less familiar with. Our findings in this paper contribute important ground work towards advancing current research in understanding user knowledge gain through web search sessions.
{"title":"Analyzing Knowledge Gain of Users in Informational Search Sessions on the Web","authors":"U. Gadiraju, Ran Yu, S. Dietze, Peter Holtz","doi":"10.1145/3176349.3176381","DOIUrl":"https://doi.org/10.1145/3176349.3176381","url":null,"abstract":"Web search is frequently used by people to acquire new knowledge and to satisfy learning-related objectives, but little is known about how a user»s knowledge evolves through the course of a search session. We present a study addressing the knowledge gain of users in informational search sessions. Using crowdsourcing, we recruited 500 distinct users and orchestrated real-world search sessions spanning 10 different topics and information needs. By using scientifically formulated knowledge tests we calibrated the knowledge of users before and after their search sessions, quantifying their knowledge gain. We investigated the impact of information needs on the search behavior and knowledge gain of users, revealing a significant effect of information need on user queries and navigational patterns, but no direct effect on the knowledge gain. Users on average exhibited a higher knowledge gain through search sessions pertaining to topics they were less familiar with. Our findings in this paper contribute important ground work towards advancing current research in understanding user knowledge gain through web search sessions.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128473593","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this demo, we present Coagmento, a Web-based, open-source tool for information seeking projects that collects information for individuals and groups and helps facilitate collaborative information seeking. Coagmento has been used in information retrieval and human-computer interaction studies to investigate individual and group information seeking behaviors in a lab or a field setting. In this demo, we discuss what Coagmento is, its past uses in prior studies, and its present state. We also discuss current work in progress. With Coagmento recently passing its 10th anniversary, we discuss our intention to make it a tool that is easy to configure for a human information behavior researcher with little programming skill.
{"title":"Coagmento: Past, Present, and Future of an Individual and Collaborative Information Seeking Platform","authors":"M. Mitsui, Jiqun Liu, C. Shah","doi":"10.1145/3176349.3176896","DOIUrl":"https://doi.org/10.1145/3176349.3176896","url":null,"abstract":"In this demo, we present Coagmento, a Web-based, open-source tool for information seeking projects that collects information for individuals and groups and helps facilitate collaborative information seeking. Coagmento has been used in information retrieval and human-computer interaction studies to investigate individual and group information seeking behaviors in a lab or a field setting. In this demo, we discuss what Coagmento is, its past uses in prior studies, and its present state. We also discuss current work in progress. With Coagmento recently passing its 10th anniversary, we discuss our intention to make it a tool that is easy to configure for a human information behavior researcher with little programming skill.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128380069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Kevin Ong, K. Järvelin, M. Sanderson, Falk Scholer
Typing is a common form of query input for search engines and other information retrieval systems; we therefore investigate the relationship between typing behavior and search interactions. The search process is interactive and typically requires entering one or more queries, and assessing both summaries from Search Engine Result Pages and the underlying documents, to ultimately satisfy some information need. Under the Search Economic Theory model of interactive information retrieval, differences in query costs will result in search behavior changes. We investigate how differences in query inputs themselves may relate to Search Economic Theory by conducting a lab-based experiment to observe how text entries influence subsequent search interactions. Our results indicate that for faster typing speeds, more queries are entered in a session, while both query lengths and assessment times are lower.
输入是搜索引擎和其他信息检索系统查询输入的常见形式;因此,我们研究键入行为和搜索交互之间的关系。搜索过程是交互式的,通常需要输入一个或多个查询,并评估来自search Engine Result Pages和底层文档的摘要,以最终满足某些信息需求。在交互式信息检索的搜索经济理论模型下,查询成本的差异会导致搜索行为的变化。我们通过进行基于实验室的实验来观察文本条目如何影响随后的搜索交互,研究查询输入本身的差异如何与搜索经济理论相关。我们的结果表明,对于更快的输入速度,在会话中输入的查询更多,而查询长度和评估时间都更短。
{"title":"QWERTY: The Effects of Typing on Web Search Behavior","authors":"Kevin Ong, K. Järvelin, M. Sanderson, Falk Scholer","doi":"10.1145/3176349.3176872","DOIUrl":"https://doi.org/10.1145/3176349.3176872","url":null,"abstract":"Typing is a common form of query input for search engines and other information retrieval systems; we therefore investigate the relationship between typing behavior and search interactions. The search process is interactive and typically requires entering one or more queries, and assessing both summaries from Search Engine Result Pages and the underlying documents, to ultimately satisfy some information need. Under the Search Economic Theory model of interactive information retrieval, differences in query costs will result in search behavior changes. We investigate how differences in query inputs themselves may relate to Search Economic Theory by conducting a lab-based experiment to observe how text entries influence subsequent search interactions. Our results indicate that for faster typing speeds, more queries are entered in a session, while both query lengths and assessment times are lower.","PeriodicalId":198379,"journal":{"name":"Proceedings of the 2018 Conference on Human Information Interaction & Retrieval","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126272068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}