首页 > 最新文献

Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval最新文献

英文 中文
Deep Chit-Chat: Deep Learning for Chatbots 深度聊天:用于聊天机器人的深度学习
Wei Wu, Rui Yan
The tutorial is based on our long-term research on open domain conversation, rich hands-on experience on development of Microsoft XiaoIce, and our previous tutorials on EMNLP 2018 and the Web Conference 2019. It starts from a summary of recent achievement made by both academia and industry on chatbots, and then performs a thorough and systematic introduction to state-of-the-art methods for open domain conversation modeling including both retrieval-based methods and generation-based methods. In addition to these, the tutorial also covers some new progress on both groups of methods, such as transition from model design to model learning, transition from knowledge agnostic conversation to knowledge aware conversation, and transition from single-modal conversation to multi-modal conversation. The tutorial is ended by some promising future directions such as how to combine non-task-oriented dialogue systems with task-oriented dialogue systems and how to enhance language learning with chatbots.
本教程基于我们对开放域会话的长期研究,微软小冰开发的丰富实践经验,以及我们之前关于EMNLP 2018和Web Conference 2019的教程。本文首先总结了学术界和工业界在聊天机器人方面取得的最新成就,然后对开放领域对话建模的最新方法进行了全面而系统的介绍,包括基于检索的方法和基于生成的方法。除此之外,本教程还涵盖了两组方法的一些新进展,例如从模型设计到模型学习的过渡,从知识不可知论对话到知识感知对话的过渡,以及从单模态对话到多模态对话的过渡。教程的最后介绍了一些有前景的未来方向,比如如何将非任务导向的对话系统与任务导向的对话系统结合起来,以及如何利用聊天机器人增强语言学习。
{"title":"Deep Chit-Chat: Deep Learning for Chatbots","authors":"Wei Wu, Rui Yan","doi":"10.1145/3331184.3331388","DOIUrl":"https://doi.org/10.1145/3331184.3331388","url":null,"abstract":"The tutorial is based on our long-term research on open domain conversation, rich hands-on experience on development of Microsoft XiaoIce, and our previous tutorials on EMNLP 2018 and the Web Conference 2019. It starts from a summary of recent achievement made by both academia and industry on chatbots, and then performs a thorough and systematic introduction to state-of-the-art methods for open domain conversation modeling including both retrieval-based methods and generation-based methods. In addition to these, the tutorial also covers some new progress on both groups of methods, such as transition from model design to model learning, transition from knowledge agnostic conversation to knowledge aware conversation, and transition from single-modal conversation to multi-modal conversation. The tutorial is ended by some promising future directions such as how to combine non-task-oriented dialogue systems with task-oriented dialogue systems and how to enhance language learning with chatbots.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"232 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72896521","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Normalized Query Commitment Revisited 重新访问规范化查询承诺
Haggai Roitman
We revisit the Normalized Query Commitment (NQC) query performance prediction (QPP) method. To this end, we suggest a scaled extension to a discriminative QPP framework and use it to analyze NQC. Using this analysis allows us to redesign NQC and suggest several options for improvement.
我们回顾了规范化查询承诺(NQC)查询性能预测(QPP)方法。为此,我们建议对判别性QPP框架进行扩展,并将其用于NQC分析。利用这种分析,我们可以重新设计NQC,并提出几个改进方案。
{"title":"Normalized Query Commitment Revisited","authors":"Haggai Roitman","doi":"10.1145/3331184.3331334","DOIUrl":"https://doi.org/10.1145/3331184.3331334","url":null,"abstract":"We revisit the Normalized Query Commitment (NQC) query performance prediction (QPP) method. To this end, we suggest a scaled extension to a discriminative QPP framework and use it to analyze NQC. Using this analysis allows us to redesign NQC and suggest several options for improvement.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"15 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74084262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Session details: Session 6A: Social Media 会议详情:6A:社交媒体
J. Mothe
{"title":"Session details: Session 6A: Social Media","authors":"J. Mothe","doi":"10.1145/3349689","DOIUrl":"https://doi.org/10.1145/3349689","url":null,"abstract":"","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"221 2","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72578140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories 从文本到声音:广播故事音效检索的初步研究
Songwei Ge, Curtis Xuan, Ruihua Song, Chao Zou, Wei Liu, Jin Zhou
Sound effects play an essential role in producing high-quality radio stories but require enormous labor cost to add. In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model. However, directly implementing a tag-based retrieval model leads to high false positives due to the ambiguity of story contents. To solve this problem, we introduce a retrieval-based framework hybridized with a semantic inference model which helps to achieve robust retrieval results. Our model relies on fine-designed features extracted from the context of candidate triggers. We collect two story dubbing datasets through crowdsourcing to analyze the setting of adding sound effects and to train and test our proposed methods. We further discuss the importance of each feature and introduce several heuristic rules for the trade-off between precision and recall. Together with the text-to-speech technology, our results reveal a promising automatic pipeline on producing high-quality radio stories.
声音效果在制作高质量的广播故事中起着至关重要的作用,但需要大量的劳动力成本。在本文中,我们使用基于检索的模型解决了自动向广播故事添加声音效果的问题。然而,由于故事内容的模糊性,直接实现基于标签的检索模型会导致高误报。为了解决这个问题,我们引入了一个基于检索的框架和一个语义推理模型,以帮助实现鲁棒的检索结果。我们的模型依赖于从候选触发器的上下文中提取的精心设计的特征。我们通过众包收集了两个故事配音数据集来分析添加音效的设置,并训练和测试我们提出的方法。我们进一步讨论了每个特征的重要性,并引入了几个启发式规则来权衡精度和召回率。结合文本转语音技术,我们的研究结果揭示了一个有前途的生产高质量广播故事的自动管道。
{"title":"From Text to Sound: A Preliminary Study on Retrieving Sound Effects to Radio Stories","authors":"Songwei Ge, Curtis Xuan, Ruihua Song, Chao Zou, Wei Liu, Jin Zhou","doi":"10.1145/3331184.3331274","DOIUrl":"https://doi.org/10.1145/3331184.3331274","url":null,"abstract":"Sound effects play an essential role in producing high-quality radio stories but require enormous labor cost to add. In this paper, we address the problem of automatically adding sound effects to radio stories with a retrieval-based model. However, directly implementing a tag-based retrieval model leads to high false positives due to the ambiguity of story contents. To solve this problem, we introduce a retrieval-based framework hybridized with a semantic inference model which helps to achieve robust retrieval results. Our model relies on fine-designed features extracted from the context of candidate triggers. We collect two story dubbing datasets through crowdsourcing to analyze the setting of adding sound effects and to train and test our proposed methods. We further discuss the importance of each feature and introduce several heuristic rules for the trade-off between precision and recall. Together with the text-to-speech technology, our results reveal a promising automatic pipeline on producing high-quality radio stories.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74648903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Legal Intelligence for E-commerce: Multi-task Learning by Leveraging Multiview Dispute Representation 电子商务的法律智能:利用多视角纠纷表示的多任务学习
Xin Zhou, Yating Zhang, Xiaozhong Liu, Changlong Sun, Luo Si
Various e-commerce platforms produce millions of transactions per day with many transaction disputes. This generates the demand for effective and efficient dispute resolutions for e-commerce transactions. This paper proposes a novel research task of Legal Dispute Judgment (LDJ) prediction for e-commerce transactions, which connects two yet isolated domains, e-commerce data mining and legal intelligence. Different from traditional legal intelligence with the focus on textual evidence of the dispute itself, the new research utilizes multiview information such as past behavior information of seller and buyer as well as textual evidence of the current transaction. The multiview dispute representation is integrated into an innovative multi-task learning framework for predicting the legal result. An extensive set of experiments with a large dispute case dataset collected from a world leading e-commerce platform shows that the proposed model can more accurately characterize a dispute case through buyer, seller, and transaction viewpoints for legal judgment prediction against several alternatives.
各种电子商务平台每天产生数百万笔交易,交易纠纷也很多。这就产生了对电子商务交易中有效和高效的争议解决方案的需求。本文提出了一个新的研究课题——电子商务交易的法律纠纷判决预测,它将电子商务数据挖掘和法律智能两个孤立的领域联系起来。与传统的法律情报侧重于争议本身的文本证据不同,新的研究利用了卖方和买方过去的行为信息以及当前交易的文本证据等多视角信息。将多视角纠纷表示集成到一个创新的多任务学习框架中,用于预测法律结果。从世界领先的电子商务平台收集的大型争议案件数据集进行的大量实验表明,所提出的模型可以更准确地通过买方,卖方和交易观点来描述争议案件,以便针对几种替代方案进行法律判决预测。
{"title":"Legal Intelligence for E-commerce: Multi-task Learning by Leveraging Multiview Dispute Representation","authors":"Xin Zhou, Yating Zhang, Xiaozhong Liu, Changlong Sun, Luo Si","doi":"10.1145/3331184.3331212","DOIUrl":"https://doi.org/10.1145/3331184.3331212","url":null,"abstract":"Various e-commerce platforms produce millions of transactions per day with many transaction disputes. This generates the demand for effective and efficient dispute resolutions for e-commerce transactions. This paper proposes a novel research task of Legal Dispute Judgment (LDJ) prediction for e-commerce transactions, which connects two yet isolated domains, e-commerce data mining and legal intelligence. Different from traditional legal intelligence with the focus on textual evidence of the dispute itself, the new research utilizes multiview information such as past behavior information of seller and buyer as well as textual evidence of the current transaction. The multiview dispute representation is integrated into an innovative multi-task learning framework for predicting the legal result. An extensive set of experiments with a large dispute case dataset collected from a world leading e-commerce platform shows that the proposed model can more accurately characterize a dispute case through buyer, seller, and transaction viewpoints for legal judgment prediction against several alternatives.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75007716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Sequence and Time Aware Neighborhood for Session-based Recommendations: STAN 基于会话的推荐的序列和时间感知邻域:STAN
Diksha Garg, Priyanka Gupta, Pankaj Malhotra, L. Vig, Gautam M. Shroff
Recent advances in sequence-aware approaches for session-based recommendation, such as those based on recurrent neural networks, highlight the importance of leveraging sequential information from a session while making recommendations. Further, a session based k-nearest-neighbors approach (SKNN) has proven to be a strong baseline for session-based recommendations. However, SKNN does not take into account the readily available sequential and temporal information from sessions. In this work, we propose Sequence and Time Aware Neighborhood (STAN), with vanilla SKNN as its special case. STAN takes into account the following factors for making recommendations: i) position of an item in the current session, ii) recency of a past session w.r.t. to the current session, and iii) position of a recommendable item in a neighboring session. The importance of above factors for a specific application can be adjusted via controllable decay factors. Despite being simple, intuitive and easy to implement, empirical evaluation on three real-world datasets shows that STAN significantly improves over SKNN, and is even comparable to the recently proposed state-of-the-art deep learning approaches. Our results suggest that STAN can be considered as a strong baseline for evaluating session-based recommendation algorithms in future.
基于会话的推荐的序列感知方法的最新进展,例如基于递归神经网络的方法,强调了在进行推荐时利用会话序列信息的重要性。此外,基于会话的k近邻方法(SKNN)已被证明是基于会话的推荐的强大基线。然而,SKNN不考虑从会话中随时可用的顺序和时间信息。在这项工作中,我们提出了序列和时间感知邻域(STAN),并以香草SKNN为特例。STAN在提出建议时会考虑以下因素:i)项目在当前会议中的位置,ii)过去的会议与当前会议的距离,以及iii)可推荐的项目在相邻会议中的位置。上述因素对于特定应用的重要性可以通过可控的衰减因子来调整。尽管简单、直观且易于实现,但对三个现实世界数据集的经验评估表明,STAN比SKNN有显著改善,甚至可以与最近提出的最先进的深度学习方法相媲美。我们的研究结果表明,STAN可以被认为是未来评估基于会话的推荐算法的一个强有力的基线。
{"title":"Sequence and Time Aware Neighborhood for Session-based Recommendations: STAN","authors":"Diksha Garg, Priyanka Gupta, Pankaj Malhotra, L. Vig, Gautam M. Shroff","doi":"10.1145/3331184.3331322","DOIUrl":"https://doi.org/10.1145/3331184.3331322","url":null,"abstract":"Recent advances in sequence-aware approaches for session-based recommendation, such as those based on recurrent neural networks, highlight the importance of leveraging sequential information from a session while making recommendations. Further, a session based k-nearest-neighbors approach (SKNN) has proven to be a strong baseline for session-based recommendations. However, SKNN does not take into account the readily available sequential and temporal information from sessions. In this work, we propose Sequence and Time Aware Neighborhood (STAN), with vanilla SKNN as its special case. STAN takes into account the following factors for making recommendations: i) position of an item in the current session, ii) recency of a past session w.r.t. to the current session, and iii) position of a recommendable item in a neighboring session. The importance of above factors for a specific application can be adjusted via controllable decay factors. Despite being simple, intuitive and easy to implement, empirical evaluation on three real-world datasets shows that STAN significantly improves over SKNN, and is even comparable to the recently proposed state-of-the-art deep learning approaches. Our results suggest that STAN can be considered as a strong baseline for evaluating session-based recommendation algorithms in future.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"67 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"72644553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 88
Family History Discovery through Search at Ancestry 通过祖先搜索发现家族史
Peng Jiang, Yingrui Yang, Gann Bierner, F. Li, Ruhan Wang, Azadeh Moghtaderi
At Ancestry, we apply learning to rank algorithms to a new area to assist our customers in better understanding their family history. The foundation of our service is an extensive and unique collection of billions of historical records that we have digitized and indexed. Currently, our content collection includes 20 billion historical records. The record data consists of birth records, death records, marriage records, adoption records, census records, obituary records, among many others types. It is important for us to return relevant records from diversified record types in order to assist our customers to better understand their family history.
在Ancestry,我们将学习算法应用到一个新的领域,以帮助我们的客户更好地了解他们的家族史。我们服务的基础是一个广泛而独特的收集数十亿的历史记录,我们已经数字化和索引。目前,我们的内容集合包括200亿条历史记录。记录数据包括出生记录、死亡记录、婚姻记录、收养记录、人口普查记录、讣告记录以及许多其他类型的记录。为了帮助我们的客户更好地了解他们的家族史,我们必须从不同的记录类型中归还相关的记录。
{"title":"Family History Discovery through Search at Ancestry","authors":"Peng Jiang, Yingrui Yang, Gann Bierner, F. Li, Ruhan Wang, Azadeh Moghtaderi","doi":"10.1145/3331184.3331430","DOIUrl":"https://doi.org/10.1145/3331184.3331430","url":null,"abstract":"At Ancestry, we apply learning to rank algorithms to a new area to assist our customers in better understanding their family history. The foundation of our service is an extensive and unique collection of billions of historical records that we have digitized and indexed. Currently, our content collection includes 20 billion historical records. The record data consists of birth records, death records, marriage records, adoption records, census records, obituary records, among many others types. It is important for us to return relevant records from diversified record types in order to assist our customers to better understand their family history.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"75 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75070138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model 基于屏蔽自关注模型的热话题感知转发预测
Renfeng Ma, Xiangkun Hu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang
Social media users create millions of microblog entries on various topics each day. Retweet behaviour play a crucial role in spreading topics on social media. Retweet prediction task has received considerable attention in recent years. The majority of existing retweet prediction methods are focus on modeling user preference by utilizing various information, such as user profiles, user post history, user following relationships, etc. Yet, the users exposures towards real-time posting from their followees contribute significantly to making retweet predictions, considering that the users may participate into the hot topics discussed by their followees rather than be limited to their previous interests. To make efficient use of hot topics, we propose a novel masked self-attentive model to perform the retweet prediction task by perceiving the hot topics discussed by the users' followees. We incorporate the posting histories of users with external memory and utilize a hierarchical attention mechanism to construct the users' interests. Hence, our model can be jointly hot-topic aware and user interests aware to make a final prediction. Experimental results on a dataset collected from Twitter demonstrated that the proposed method can achieve better performance than state-of-the-art methods.
社交媒体用户每天就各种话题创建数百万条微博。转发行为在社交媒体上传播话题方面起着至关重要的作用。转发预测任务近年来受到了相当大的关注。现有的转推预测方法大多侧重于利用用户资料、用户帖子历史、用户关注关系等各种信息对用户偏好进行建模。然而,用户对关注者实时发布的内容的接触对转发预测有很大的帮助,因为用户可能会参与到关注者讨论的热点话题中,而不是局限于自己以前的兴趣。为了有效地利用热点话题,我们提出了一种新的掩蔽自关注模型,通过感知用户关注者讨论的热点话题来完成转发预测任务。我们将用户的帖子历史与外部记忆相结合,并利用分层关注机制构建用户的兴趣。因此,我们的模型可以同时感知热点话题和用户兴趣,从而做出最终的预测。从Twitter收集的数据集上的实验结果表明,所提出的方法可以获得比目前最先进的方法更好的性能。
{"title":"Hot Topic-Aware Retweet Prediction with Masked Self-attentive Model","authors":"Renfeng Ma, Xiangkun Hu, Qi Zhang, Xuanjing Huang, Yu-Gang Jiang","doi":"10.1145/3331184.3331236","DOIUrl":"https://doi.org/10.1145/3331184.3331236","url":null,"abstract":"Social media users create millions of microblog entries on various topics each day. Retweet behaviour play a crucial role in spreading topics on social media. Retweet prediction task has received considerable attention in recent years. The majority of existing retweet prediction methods are focus on modeling user preference by utilizing various information, such as user profiles, user post history, user following relationships, etc. Yet, the users exposures towards real-time posting from their followees contribute significantly to making retweet predictions, considering that the users may participate into the hot topics discussed by their followees rather than be limited to their previous interests. To make efficient use of hot topics, we propose a novel masked self-attentive model to perform the retweet prediction task by perceiving the hot topics discussed by the users' followees. We incorporate the posting histories of users with external memory and utilize a hierarchical attention mechanism to construct the users' interests. Hence, our model can be jointly hot-topic aware and user interests aware to make a final prediction. Experimental results on a dataset collected from Twitter demonstrated that the proposed method can achieve better performance than state-of-the-art methods.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"77 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76013320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Coarse-to-Fine Grained Classification 粗到细粒度分类
Yuqi Huo, Yao Lu, Yulei Niu, Zhiwu Lu, Ji-Rong Wen
Fine-grained image classification and retrieval become topical in both computer vision and information retrieval. In real-life scenarios, fine-grained tasks tend to appear along with coarse-grained tasks when the observed object is coming closer. However, in previous works, the combination of fine-grained and coarse-grained tasks was often ignored. In this paper, we define a new problem called coarse-to-fine grained classification (C2FGC) which aims to recognize the classes of objects in multiple resolutions (from low to high). To solve this problem, we propose a novel Multi-linear Pooling with Hierarchy (MLPH) model. Specifically, we first design a multi-linear pooling module to include both trilinear and bilinear pooling, and then formulate the coarse-grained and fine-grained tasks within a unified framework. Experiments on two benchmark datasets show that our model achieves state-of-the-art results.
细粒度图像的分类和检索已成为计算机视觉和信息检索领域的研究热点。在现实场景中,当观察到的对象越来越近时,细粒度任务往往会与粗粒度任务一起出现。然而,在以往的工作中,细粒度和粗粒度任务的结合往往被忽略。在本文中,我们定义了一个新的问题,称为粗粒度到细粒度分类(C2FGC),其目的是在多个分辨率(从低到高)下识别对象的类别。为了解决这个问题,我们提出了一种新的多层线性池化(MLPH)模型。具体来说,我们首先设计了一个包含三线性和双线性池化的多线性池化模块,然后在统一的框架内制定粗粒度和细粒度任务。在两个基准数据集上的实验表明,我们的模型达到了最先进的结果。
{"title":"Coarse-to-Fine Grained Classification","authors":"Yuqi Huo, Yao Lu, Yulei Niu, Zhiwu Lu, Ji-Rong Wen","doi":"10.1145/3331184.3331336","DOIUrl":"https://doi.org/10.1145/3331184.3331336","url":null,"abstract":"Fine-grained image classification and retrieval become topical in both computer vision and information retrieval. In real-life scenarios, fine-grained tasks tend to appear along with coarse-grained tasks when the observed object is coming closer. However, in previous works, the combination of fine-grained and coarse-grained tasks was often ignored. In this paper, we define a new problem called coarse-to-fine grained classification (C2FGC) which aims to recognize the classes of objects in multiple resolutions (from low to high). To solve this problem, we propose a novel Multi-linear Pooling with Hierarchy (MLPH) model. Specifically, we first design a multi-linear pooling module to include both trilinear and bilinear pooling, and then formulate the coarse-grained and fine-grained tasks within a unified framework. Experiments on two benchmark datasets show that our model achieves state-of-the-art results.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"50 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82564302","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Third International Workshop on Recent Trends in News Information Retrieval (NewsIR'19) 第三届新闻信息检索最新趋势国际研讨会(NewsIR'19)
M. Albakour, Miguel Martinez, S. Tippmann, Ahmet Aker, J. Stray, Shiri Dori-Hacohen, Alberto Barrón-Cedeño
The journalism industry has undergone a revolution in the past decade, leading to new opportunities as well as challenges. News consumption, production and delivery have all been affected and transformed by technology Readers require new mechanisms to cope with the vast volume of information in order to be informed about news events. Reporters have begun to use natural language processing (NLP) and (IR) techniques for investigative work. Publishers and aggregators are seeking new business models, and new ways to reach and retain their audience. A shift in business models has led to a gradual shift in styles of journalism in attempts to increase page views; and, far more concerning, to real mis- and dis-information, alongside allegations of "fake news" threatening the journalistic freedom and integrity of legitimate news outlets. Social media platforms drive viewership, creating filter bubbles and an increasingly polarized readership. News documents have always been a part of research on information access and retrieval methods. Over the last few years, the IR community has increasingly recognized these challenges in journalism and opened a conversation about how we might begin to address them. Evidence of this recognition is the participation in the two previous editions of our NewsIR workshop, held in ECIR 2016 and 2018. One of the most important outcomes of those workshops is an increasing awareness in the community about the changing nature of journalism and the IR challenges it entails. To move yet another step forward, the goal of the third edition of our workshop will be to create a multidisciplinary venue that brings together news experts from both technology and journalism. This would take NewsIR from a European forum targeting mainly IR researchers, into a more inclusive and influential international forum. We hope that this new format will foster further understanding for both news professionals and IR researchers, as well as producing better outcomes for news consumers. We will address the possibilities and challenges that technology offers to the journalists, the challenges that new developments in journalism create for IR researchers, and the complexity of information access tasks for news readers.
过去十年,新闻业经历了一场革命,既带来了新的机遇,也带来了新的挑战。新闻的消费、生产和传递都受到技术的影响和改变,读者需要新的机制来处理大量的信息,以便了解新闻事件。记者已经开始在调查工作中使用自然语言处理(NLP)和(IR)技术。出版商和聚合商正在寻找新的商业模式,以及接触和留住受众的新方法。商业模式的转变导致了新闻风格的逐渐转变,以增加页面浏览量;更令人担忧的是,真正的错误和虚假信息,以及威胁到合法新闻媒体的新闻自由和诚信的“假新闻”指控。社交媒体平台推动了收视率,制造了过滤泡沫和日益两极分化的读者群体。新闻文献一直是信息获取与检索方法研究的一部分。过去几年来,国际关系界日益认识到新闻业面临的这些挑战,并就如何着手应对这些挑战展开了讨论。这一认可的证据是参加了2016年和2018年在ECIR举行的前两届NewsIR研讨会。这些工作坊最重要的成果之一,是提高社群对新闻性质变迁的认识,以及它所带来的国际关系挑战。为了更进一步,第三届研讨会的目标将是创建一个多学科的场所,汇集来自技术和新闻业的新闻专家。这将使NewsIR从一个主要针对IR研究人员的欧洲论坛,变成一个更具包容性和影响力的国际论坛。我们希望这种新格式能够促进新闻专业人员和IR研究人员的进一步理解,并为新闻消费者带来更好的结果。我们将讨论技术给记者带来的可能性和挑战,新闻业的新发展给IR研究人员带来的挑战,以及新闻读者获取信息任务的复杂性。
{"title":"Third International Workshop on Recent Trends in News Information Retrieval (NewsIR'19)","authors":"M. Albakour, Miguel Martinez, S. Tippmann, Ahmet Aker, J. Stray, Shiri Dori-Hacohen, Alberto Barrón-Cedeño","doi":"10.1145/3331184.3331646","DOIUrl":"https://doi.org/10.1145/3331184.3331646","url":null,"abstract":"The journalism industry has undergone a revolution in the past decade, leading to new opportunities as well as challenges. News consumption, production and delivery have all been affected and transformed by technology Readers require new mechanisms to cope with the vast volume of information in order to be informed about news events. Reporters have begun to use natural language processing (NLP) and (IR) techniques for investigative work. Publishers and aggregators are seeking new business models, and new ways to reach and retain their audience. A shift in business models has led to a gradual shift in styles of journalism in attempts to increase page views; and, far more concerning, to real mis- and dis-information, alongside allegations of \"fake news\" threatening the journalistic freedom and integrity of legitimate news outlets. Social media platforms drive viewership, creating filter bubbles and an increasingly polarized readership. News documents have always been a part of research on information access and retrieval methods. Over the last few years, the IR community has increasingly recognized these challenges in journalism and opened a conversation about how we might begin to address them. Evidence of this recognition is the participation in the two previous editions of our NewsIR workshop, held in ECIR 2016 and 2018. One of the most important outcomes of those workshops is an increasing awareness in the community about the changing nature of journalism and the IR challenges it entails. To move yet another step forward, the goal of the third edition of our workshop will be to create a multidisciplinary venue that brings together news experts from both technology and journalism. This would take NewsIR from a European forum targeting mainly IR researchers, into a more inclusive and influential international forum. We hope that this new format will foster further understanding for both news professionals and IR researchers, as well as producing better outcomes for news consumers. We will address the possibilities and challenges that technology offers to the journalists, the challenges that new developments in journalism create for IR researchers, and the complexity of information access tasks for news readers.","PeriodicalId":20700,"journal":{"name":"Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval","volume":"13 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2019-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89517465","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1