首页 > 最新文献

Proceedings of the 2017 ACM on Conference on Information and Knowledge Management最新文献

英文 中文
Semantic Rules for Machine Diagnostics: Execution and Management 机器诊断的语义规则:执行和管理
E. Kharlamov, Ognjen Savkovic, Guohui Xiao, R. Peñaloza, G. Mehdi, M. Roshchin, Ian Horrocks
Rule-based diagnostics of equipment is an important task in industry. In this paper we present how semantic technologies can enhance diagnostics. In particular, we present our semantic rule language sigRL that is inspired by the real diagnostic languages used in Siemens. SigRL allows to write compact yet powerful diagnostic programs by relying on a high level data independent vocabulary, diagnostic ontologies, and queries over these ontologies. We study computational complexity of SigRL: execution of diagnostic programs, provenance computation, as well as automatic verification of redundancy and inconsistency in diagnostic programs.
基于规则的设备诊断是工业领域的一项重要任务。在本文中,我们介绍了语义技术如何增强诊断。特别地,我们提出了语义规则语言sigRL,它受到西门子实际使用的诊断语言的启发。通过依赖于高级数据独立词汇表、诊断本体和对这些本体的查询,SigRL允许编写紧凑但功能强大的诊断程序。我们研究了SigRL的计算复杂性:诊断程序的执行,来源计算,以及诊断程序中冗余和不一致的自动验证。
{"title":"Semantic Rules for Machine Diagnostics: Execution and Management","authors":"E. Kharlamov, Ognjen Savkovic, Guohui Xiao, R. Peñaloza, G. Mehdi, M. Roshchin, Ian Horrocks","doi":"10.1145/3132847.3133159","DOIUrl":"https://doi.org/10.1145/3132847.3133159","url":null,"abstract":"Rule-based diagnostics of equipment is an important task in industry. In this paper we present how semantic technologies can enhance diagnostics. In particular, we present our semantic rule language sigRL that is inspired by the real diagnostic languages used in Siemens. SigRL allows to write compact yet powerful diagnostic programs by relying on a high level data independent vocabulary, diagnostic ontologies, and queries over these ontologies. We study computational complexity of SigRL: execution of diagnostic programs, provenance computation, as well as automatic verification of redundancy and inconsistency in diagnostic programs.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"81 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77122661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
From Fingerprint to Footprint: Revealing Physical World Privacy Leakage by Cyberspace Cookie Logs 从指纹到足迹:通过网络空间Cookie日志揭示物理世界隐私泄露
Huandong Wang, Chen Gao, Yong Li, Zhi-Li Zhang, Depeng Jin
It is well-known that online services resort to various cookies to track users through users' online service identifiers (IDs) - in other words, when users access online services, various "fingerprints" are left behind in the cyberspace. As they roam around in the physical world while accessing online services via mobile devices, users also leave a series of "footprints" -- i.e., hints about their physical locations - in the physical world. This poses a potent new threat to user privacy: one can potentially correlate the "fingerprints" left by the users in the cyberspace with "footprints" left in the physical world to infer and reveal leakage of user physical world privacy, such as frequent user locations or mobility trajectories in the physical world - we refer to this problem as user physical world privacy leakage via user cyberspace privacy leakage. In this paper we address the following fundamental question: what kind - and how much - of user physical world privacy might be leaked if we could get hold of such diverse network datasets even without any physical location information. In order to conduct an in-depth investigation of these questions, we utilize the network data collected via a DPI system at the routers within one of the largest Internet operator in Shanghai, China over a duration of one month. We decompose the fundamental question into the three problems: i) linkage of various online user IDs belonging to the same person via mobility pattern mining; ii) physical location classification via aggregate user mobility patterns over time; and iii) tracking user physical mobility. By developing novel and effective methods for solving each of these problems, we demonstrate that the question of user physical world privacy leakage via user cyberspace privacy leakage is not hypothetical, but indeed poses a real potent threat to user privacy.
众所周知,在线服务通过用户的在线服务标识符(id)使用各种cookie来跟踪用户,换句话说,当用户访问在线服务时,在网络空间中留下了各种“指纹”。当用户在现实世界中漫游,同时通过移动设备访问在线服务时,他们也会在现实世界中留下一系列“足迹”,即关于他们实际位置的提示。这对用户隐私构成了一个潜在的新威胁:人们可以将用户在网络空间中留下的“指纹”与在物理世界中留下的“足迹”联系起来,推断和揭示用户物理世界隐私的泄露,例如用户在物理世界中的频繁位置或移动轨迹——我们将这个问题称为通过用户网络空间隐私泄露来泄露用户物理世界隐私。在本文中,我们解决了以下基本问题:如果我们能够在没有任何物理位置信息的情况下获得如此多样化的网络数据集,那么什么样的用户物理世界隐私可能会泄露,以及泄露多少用户物理世界隐私。为了对这些问题进行深入调查,我们利用DPI系统在中国上海最大的互联网运营商之一的路由器上收集的网络数据,持续一个月。我们将基本问题分解为三个问题:i)通过移动模式挖掘将属于同一个人的各种在线用户id链接起来;Ii)根据用户随时间的移动模式进行物理位置分类;iii)跟踪用户的身体移动。通过开发解决这些问题的新颖有效的方法,我们证明了通过用户网络空间隐私泄露导致用户物理世界隐私泄露的问题不是假设的,而是对用户隐私构成了真正的潜在威胁。
{"title":"From Fingerprint to Footprint: Revealing Physical World Privacy Leakage by Cyberspace Cookie Logs","authors":"Huandong Wang, Chen Gao, Yong Li, Zhi-Li Zhang, Depeng Jin","doi":"10.1145/3132847.3132998","DOIUrl":"https://doi.org/10.1145/3132847.3132998","url":null,"abstract":"It is well-known that online services resort to various cookies to track users through users' online service identifiers (IDs) - in other words, when users access online services, various \"fingerprints\" are left behind in the cyberspace. As they roam around in the physical world while accessing online services via mobile devices, users also leave a series of \"footprints\" -- i.e., hints about their physical locations - in the physical world. This poses a potent new threat to user privacy: one can potentially correlate the \"fingerprints\" left by the users in the cyberspace with \"footprints\" left in the physical world to infer and reveal leakage of user physical world privacy, such as frequent user locations or mobility trajectories in the physical world - we refer to this problem as user physical world privacy leakage via user cyberspace privacy leakage. In this paper we address the following fundamental question: what kind - and how much - of user physical world privacy might be leaked if we could get hold of such diverse network datasets even without any physical location information. In order to conduct an in-depth investigation of these questions, we utilize the network data collected via a DPI system at the routers within one of the largest Internet operator in Shanghai, China over a duration of one month. We decompose the fundamental question into the three problems: i) linkage of various online user IDs belonging to the same person via mobility pattern mining; ii) physical location classification via aggregate user mobility patterns over time; and iii) tracking user physical mobility. By developing novel and effective methods for solving each of these problems, we demonstrate that the question of user physical world privacy leakage via user cyberspace privacy leakage is not hypothetical, but indeed poses a real potent threat to user privacy.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"137 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76772906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Conflict of Interest Declaration and Detection System in Heterogeneous Networks 异构网络中的利益冲突声明与检测系统
Siyuan Wu, Leong Hou U, S. Bhowmick, Wolfgang Gatterbauer
Peer review is the most critical process in evaluating an article to be accepted for publication in an academic venue. When assigning a reviewer to evaluate an article, the assignment should be aware of conflicts of interest (COIs) such that the reviews are fair to everyone. However, existing conference management systems simply ask reviewers and authors to declare their explicit COIs through a plain search user interface guided by some simple conflict rules. We argue that such declaration system is not enough to discover all latent COI cases. In this work, we study a graphical declaration system that visualizes the relationships of authors and reviewers based on a heterogeneous co-authorship network. With the help of the declarations, we attempt to detect the latent COIs automatically based on the meta-paths of a heterogeneous network.
同行评议是评估一篇文章是否被学术机构接受发表的最关键的过程。当指派审稿人评估一篇文章时,审稿人应该意识到利益冲突(COIs),这样审稿对每个人都是公平的。然而,现有的会议管理系统只是要求审稿人和作者通过由一些简单的冲突规则指导的简单搜索用户界面声明其显式的coi。我们认为,这种申报制度不足以发现所有潜在的COI病例。在这项工作中,我们研究了一个基于异构共同作者网络的可视化作者和审稿人关系的图形声明系统。在声明的帮助下,我们尝试基于异构网络的元路径自动检测潜在的coi。
{"title":"Conflict of Interest Declaration and Detection System in Heterogeneous Networks","authors":"Siyuan Wu, Leong Hou U, S. Bhowmick, Wolfgang Gatterbauer","doi":"10.1145/3132847.3133134","DOIUrl":"https://doi.org/10.1145/3132847.3133134","url":null,"abstract":"Peer review is the most critical process in evaluating an article to be accepted for publication in an academic venue. When assigning a reviewer to evaluate an article, the assignment should be aware of conflicts of interest (COIs) such that the reviews are fair to everyone. However, existing conference management systems simply ask reviewers and authors to declare their explicit COIs through a plain search user interface guided by some simple conflict rules. We argue that such declaration system is not enough to discover all latent COI cases. In this work, we study a graphical declaration system that visualizes the relationships of authors and reviewers based on a heterogeneous co-authorship network. With the help of the declarations, we attempt to detect the latent COIs automatically based on the meta-paths of a heterogeneous network.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"301 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79725385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Hierarchical Module Classification in Mixed-initiative Conversational Agent System 混合主动会话代理系统中的分层模块分类
Sia Xin Yun Suzanna, A. Li
Our operational context is a task-oriented dialog system where no single module satisfactorily addresses the range of conversational queries from humans. Such systems must be equipped with a range of technologies to address semantic, factual, task-oriented, open domain conversations using rule-based, semantic-web, traditional machine learning and deep learning. This raises two key challenges. First, the modules need to be managed and selected appropriately. Second, the complexity of troubleshooting on such systems is high. We address these challenges with a mixed-initiative model that controls conversational logic through hierarchical classification. We also developed an interface to increase interpretability for operators and to aggregate module performance.
我们的操作上下文是一个面向任务的对话系统,其中没有一个模块能令人满意地处理来自人类的一系列会话查询。这样的系统必须配备一系列技术,以解决基于规则、语义网、传统机器学习和深度学习的语义、事实、任务导向、开放领域对话。这就提出了两个关键挑战。首先,需要适当地管理和选择模块。其次,在此类系统上进行故障排除的复杂性很高。我们使用混合主动模型来解决这些挑战,该模型通过分层分类控制会话逻辑。我们还开发了一个接口来提高运算符的可解释性和聚合模块性能。
{"title":"Hierarchical Module Classification in Mixed-initiative Conversational Agent System","authors":"Sia Xin Yun Suzanna, A. Li","doi":"10.1145/3132847.3133185","DOIUrl":"https://doi.org/10.1145/3132847.3133185","url":null,"abstract":"Our operational context is a task-oriented dialog system where no single module satisfactorily addresses the range of conversational queries from humans. Such systems must be equipped with a range of technologies to address semantic, factual, task-oriented, open domain conversations using rule-based, semantic-web, traditional machine learning and deep learning. This raises two key challenges. First, the modules need to be managed and selected appropriately. Second, the complexity of troubleshooting on such systems is high. We address these challenges with a mixed-initiative model that controls conversational logic through hierarchical classification. We also developed an interface to increase interpretability for operators and to aggregate module performance.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79934227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast Algorithms for Pareto Optimal Group-based Skyline 基于Pareto最优群的Skyline快速算法
Wenhui Yu, Zheng Qin, Jinfei Liu, Li Xiong, Xu Chen, Huidi Zhang
Skyline, aiming at finding a Pareto optimal subset of points in a multi-dimensional dataset, has gained great interest due to its extensive use for multi-criteria analysis and decision making. Skyline consists of all points that are not dominated by, or not worse than other points. It is a candidate set of optimal solution, which depends on a specific evaluation criterion for optimum. However, conventional skyline queries, which return individual points, are inadequate in group querying case since optimal combinations are required. To address this gap, we study the skyline computation in group case and propose fast methods to find the group-based skyline (G-skyline), which contains Pareto optimal groups. For computing the front k skyline layers, we lay out an efficient approach that does the search concurrently on each dimension and investigates each point in subspace. After that, we present a novel structure to construct the G-skyline with a queue of combinations of the first-layer points. Experimental results show that our algorithms are several orders of magnitude faster than the previous work.
Skyline旨在寻找多维数据集中的帕累托最优点子集,由于其广泛用于多标准分析和决策而获得了极大的兴趣。天际线由所有不受其他点支配或不比其他点差的点组成。它是一个最优解的候选集,它依赖于一个特定的最优评价准则。然而,传统的天际线查询,返回单个点,不适合组查询,因为需要最优的组合。为了解决这一问题,我们研究了群情况下的天际线计算,提出了快速寻找包含Pareto最优群的基于群的天际线(g -天际线)的方法。为了计算前面的k个天际线层,我们提出了一种有效的方法,在每个维度上同时进行搜索,并研究子空间中的每个点。在此基础上,我们提出了一种用一组第一层点的组合来构造G-skyline的新结构。实验结果表明,我们的算法比以前的工作快了几个数量级。
{"title":"Fast Algorithms for Pareto Optimal Group-based Skyline","authors":"Wenhui Yu, Zheng Qin, Jinfei Liu, Li Xiong, Xu Chen, Huidi Zhang","doi":"10.1145/3132847.3132950","DOIUrl":"https://doi.org/10.1145/3132847.3132950","url":null,"abstract":"Skyline, aiming at finding a Pareto optimal subset of points in a multi-dimensional dataset, has gained great interest due to its extensive use for multi-criteria analysis and decision making. Skyline consists of all points that are not dominated by, or not worse than other points. It is a candidate set of optimal solution, which depends on a specific evaluation criterion for optimum. However, conventional skyline queries, which return individual points, are inadequate in group querying case since optimal combinations are required. To address this gap, we study the skyline computation in group case and propose fast methods to find the group-based skyline (G-skyline), which contains Pareto optimal groups. For computing the front k skyline layers, we lay out an efficient approach that does the search concurrently on each dimension and investigates each point in subspace. After that, we present a novel structure to construct the G-skyline with a queue of combinations of the first-layer points. Experimental results show that our algorithms are several orders of magnitude faster than the previous work.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86917147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Linking News across Multiple Streams for Timeliness Analysis 链接新闻跨多个流的时效性分析
I. Mele, Seyed Ali Bahrainian, F. Crestani
Linking multiple news streams based on the reported events and analyzing the streams' temporal publishing patterns are two very important tasks for information analysis, discovering newsworthy stories, studying the event evolution, and detecting untrustworthy sources of information. In this paper, we propose techniques for cross-linking news streams based on the reported events with the purpose of analyzing the temporal dependencies among streams. Our research tackles two main issues: (1) how news streams are connected as reporting an event or the evolution of the same event and (2) how timely the newswires report related events using different publishing platforms. Our approach is based on dynamic topic modeling for detecting and tracking events over the timeline and on clustering news according to the events. We leverage the event-based clustering to link news across different streams and present two scoring functions for ranking the streams based on their timeliness in publishing news about a specific event.
基于事件报道链接多个信息流,分析信息流的时间发布模式,是信息分析、发现有新闻价值的故事、研究事件演变和发现不可信信息源的重要任务。在本文中,我们提出了基于报道事件的新闻流交叉链接技术,目的是分析流之间的时间依赖性。我们的研究解决了两个主要问题:(1)如何将新闻流与报道事件或同一事件的演变联系起来;(2)新闻通讯社使用不同的发布平台报道相关事件的及时性。我们的方法是基于动态主题建模来检测和跟踪时间轴上的事件,并根据事件聚类新闻。我们利用基于事件的聚类来链接跨不同流的新闻,并提供两个评分功能,根据发布特定事件新闻的时效性对流进行排名。
{"title":"Linking News across Multiple Streams for Timeliness Analysis","authors":"I. Mele, Seyed Ali Bahrainian, F. Crestani","doi":"10.1145/3132847.3132988","DOIUrl":"https://doi.org/10.1145/3132847.3132988","url":null,"abstract":"Linking multiple news streams based on the reported events and analyzing the streams' temporal publishing patterns are two very important tasks for information analysis, discovering newsworthy stories, studying the event evolution, and detecting untrustworthy sources of information. In this paper, we propose techniques for cross-linking news streams based on the reported events with the purpose of analyzing the temporal dependencies among streams. Our research tackles two main issues: (1) how news streams are connected as reporting an event or the evolution of the same event and (2) how timely the newswires report related events using different publishing platforms. Our approach is based on dynamic topic modeling for detecting and tracking events over the timeline and on clustering news according to the events. We leverage the event-based clustering to link news across different streams and present two scoring functions for ranking the streams based on their timeliness in publishing news about a specific event.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83671755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Metacrate: Organize and Analyze Millions of Data Profiles Metacrate:组织和分析数以百万计的数据配置文件
Sebastian Kruse, David Hahn, Marius Walter, Felix Naumann
Databases are one of the great success stories in IT. However, they have been continuously increasing in complexity, hampering operation, maintenance, and upgrades. To face this complexity, sophisticated methods for schema summarization, data cleaning, information integration, and many more have been devised that usually rely on data profiles, such as data statistics, signatures, and integrity constraints. Such data profiles are often extracted by automatic algorithms, which entails various problems: The profiles can be unfiltered and huge in volume; different profile types require different complex data structures; and the various profile types are not integrated with each other. We introduce Metacrate, a system to store, organize, and analyze data profiles of relational databases, thereby following the proven design of databases. In particular, we (i) propose a logical and a physical data model to store all kinds of data profiles in a scalable fashion; (ii) describe an analytics layer to query, integrate, and analyze the profiles efficiently; and (iii) implement on top a library of established algorithms to serve use cases, such as schema discovery, database refactoring, and data cleaning.
数据库是IT界最成功的案例之一。然而,它们的复杂性不断增加,阻碍了操作、维护和升级。为了应对这种复杂性,已经设计了用于模式总结、数据清理、信息集成等的复杂方法,这些方法通常依赖于数据概要文件,例如数据统计、签名和完整性约束。这些数据配置文件通常是由自动算法提取的,这带来了各种问题:配置文件可能未经过滤且数量庞大;不同的配置文件类型需要不同的复杂数据结构;并且各种profile类型没有相互集成。我们介绍Metacrate,这是一个存储、组织和分析关系数据库的数据配置文件的系统,因此遵循了经过验证的数据库设计。特别是,我们(i)提出了一个逻辑和物理数据模型,以可扩展的方式存储各种数据配置文件;(ii)描述一个分析层,以便有效地查询、整合和分析配置文件;(iii)在已建立算法库的基础上实现以服务于用例,例如模式发现、数据库重构和数据清理。
{"title":"Metacrate: Organize and Analyze Millions of Data Profiles","authors":"Sebastian Kruse, David Hahn, Marius Walter, Felix Naumann","doi":"10.1145/3132847.3133180","DOIUrl":"https://doi.org/10.1145/3132847.3133180","url":null,"abstract":"Databases are one of the great success stories in IT. However, they have been continuously increasing in complexity, hampering operation, maintenance, and upgrades. To face this complexity, sophisticated methods for schema summarization, data cleaning, information integration, and many more have been devised that usually rely on data profiles, such as data statistics, signatures, and integrity constraints. Such data profiles are often extracted by automatic algorithms, which entails various problems: The profiles can be unfiltered and huge in volume; different profile types require different complex data structures; and the various profile types are not integrated with each other. We introduce Metacrate, a system to store, organize, and analyze data profiles of relational databases, thereby following the proven design of databases. In particular, we (i) propose a logical and a physical data model to store all kinds of data profiles in a scalable fashion; (ii) describe an analytics layer to query, integrate, and analyze the profiles efficiently; and (iii) implement on top a library of established algorithms to serve use cases, such as schema discovery, database refactoring, and data cleaning.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87698284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Computing Betweenness Centrality in B-hypergraphs 计算b超图的中间中心性
Kwang Hee Lee, Myoung-Ho Kim
The directed hypergraph (especially B-hypergraph) has hyperedges that represent relations of a set of source nodes to a single target node. Author-cited networks and cellular signaling pathways can be modeled as a B-hypergraph. In this paper every source node of a hyperedge in the shortest path p in a B-hypergraph is considered a participant of p. We propose a betweenness centrality in the B-hypergraph that measures the number of shortest paths in which a node participates. The algorithm for computing the approximated betweenness centrality scores is also proposed. Through various performance experiments such as attack robustness and reachability tests, we show that our proposed betweenness centrality is a more appropriate measure in real-world B-hypergraph applications than ordinary betweenness centrality.
有向超图(特别是b超图)具有表示一组源节点到单个目标节点的关系的超边。作者引用的网络和细胞信号通路可以建模为b超图。本文将b -超图中最短路径p上的超边的每个源节点视为p的参与者。我们提出了b -超图中的中间性中心性,用来度量一个节点参与的最短路径的数量。提出了计算近似中间性中心性分数的算法。通过各种性能实验,如攻击鲁棒性和可达性测试,我们表明我们提出的中间性中心性比普通的中间性中心性更适合真实世界的b -超图应用。
{"title":"Computing Betweenness Centrality in B-hypergraphs","authors":"Kwang Hee Lee, Myoung-Ho Kim","doi":"10.1145/3132847.3133093","DOIUrl":"https://doi.org/10.1145/3132847.3133093","url":null,"abstract":"The directed hypergraph (especially B-hypergraph) has hyperedges that represent relations of a set of source nodes to a single target node. Author-cited networks and cellular signaling pathways can be modeled as a B-hypergraph. In this paper every source node of a hyperedge in the shortest path p in a B-hypergraph is considered a participant of p. We propose a betweenness centrality in the B-hypergraph that measures the number of shortest paths in which a node participates. The algorithm for computing the approximated betweenness centrality scores is also proposed. Through various performance experiments such as attack robustness and reachability tests, we show that our proposed betweenness centrality is a more appropriate measure in real-world B-hypergraph applications than ordinary betweenness centrality.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90637452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
AliMe Assist: An Intelligent Assistant for Creating an Innovative E-commerce Experience AliMe Assist:打造创新电子商务体验的智能助手
Feng-Lin Li, Minghui Qiu, Haiqing Chen, Xiongwei Wang, Xing Gao, Jun Huang, Juwei Ren, Zhongzhou Zhao, Weipeng Zhao, Lei Wang, Guwei Jin, Wei Chu
We present AliMe Assist, an intelligent assistant designed for creating an innovative online shopping experience in E-commerce. Based on question answering (QA), AliMe Assist offers assistance service, customer service, and chatting service. It is able to take voice and text input, incorporate context to QA, and support multi-round interaction. Currently, it serves millions of customer questions per day and is able to address 85% of them. In this paper, we demonstrate the system, present the underlying techniques, and share our experience in dealing with real-world QA in the E-commerce field.
我们推出了AliMe Assist,这是一款智能助手,旨在为电子商务创造创新的在线购物体验。基于问答(QA), AliMe Assist提供协助服务、客户服务和聊天服务。它能够接受语音和文本输入,将上下文整合到QA中,并支持多轮交互。目前,它每天为数百万客户提供服务,并能够解决其中85%的问题。在本文中,我们演示了该系统,介绍了底层技术,并分享了我们在处理电子商务领域的实际QA方面的经验。
{"title":"AliMe Assist: An Intelligent Assistant for Creating an Innovative E-commerce Experience","authors":"Feng-Lin Li, Minghui Qiu, Haiqing Chen, Xiongwei Wang, Xing Gao, Jun Huang, Juwei Ren, Zhongzhou Zhao, Weipeng Zhao, Lei Wang, Guwei Jin, Wei Chu","doi":"10.1145/3132847.3133169","DOIUrl":"https://doi.org/10.1145/3132847.3133169","url":null,"abstract":"We present AliMe Assist, an intelligent assistant designed for creating an innovative online shopping experience in E-commerce. Based on question answering (QA), AliMe Assist offers assistance service, customer service, and chatting service. It is able to take voice and text input, incorporate context to QA, and support multi-round interaction. Currently, it serves millions of customer questions per day and is able to address 85% of them. In this paper, we demonstrate the system, present the underlying techniques, and share our experience in dealing with real-world QA in the E-commerce field.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"37 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86025610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 99
A Two-Stage Framework for Computing Entity Relatedness in Wikipedia 维基百科中计算实体关联的两阶段框架
Marco Ponza, P. Ferragina, Soumen Chakrabarti
Introducing a new dataset with human judgments of entity relatedness, we present a thorough study of all entity relatedness measures in recent literature based on Wikipedia as the knowledge graph. No clear dominance is seen between measures based on textual similarity and graph proximity. Some of the better measures involve expensive global graph computations. We then propose a new, space-efficient, computationally lightweight, two-stage framework for relatedness computation. In the first stage, a small weighted subgraph is dynamically grown around the two query entities; in the second stage, relatedness is derived based on computations on this subgraph. Our system shows better agreement with human judgment than existing proposals both on the new dataset and on an established one. We also plug our relatedness algorithm into a state-of-the-art entity linker and observe an increase in its accuracy and robustness.
我们引入了一个新的数据集,对最近文献中基于维基百科作为知识图的所有实体相关性度量进行了深入的研究。在基于文本相似度和图形接近度的测量之间没有明显的优势。一些更好的度量涉及昂贵的全局图计算。然后,我们提出了一种新的、节省空间的、计算轻量级的、两阶段的相关性计算框架。在第一阶段,围绕两个查询实体动态生长一个小的加权子图;在第二阶段,基于该子图的计算推导相关性。无论在新数据集上还是在已建立的数据集上,我们的系统都比现有的建议更符合人类的判断。我们还将我们的关联算法插入到最先进的实体链接器中,并观察到其准确性和鲁棒性的提高。
{"title":"A Two-Stage Framework for Computing Entity Relatedness in Wikipedia","authors":"Marco Ponza, P. Ferragina, Soumen Chakrabarti","doi":"10.1145/3132847.3132890","DOIUrl":"https://doi.org/10.1145/3132847.3132890","url":null,"abstract":"Introducing a new dataset with human judgments of entity relatedness, we present a thorough study of all entity relatedness measures in recent literature based on Wikipedia as the knowledge graph. No clear dominance is seen between measures based on textual similarity and graph proximity. Some of the better measures involve expensive global graph computations. We then propose a new, space-efficient, computationally lightweight, two-stage framework for relatedness computation. In the first stage, a small weighted subgraph is dynamically grown around the two query entities; in the second stage, relatedness is derived based on computations on this subgraph. Our system shows better agreement with human judgment than existing proposals both on the new dataset and on an established one. We also plug our relatedness algorithm into a state-of-the-art entity linker and observe an increase in its accuracy and robustness.","PeriodicalId":20449,"journal":{"name":"Proceedings of the 2017 ACM on Conference on Information and Knowledge Management","volume":"77 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2017-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73851322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
期刊
Proceedings of the 2017 ACM on Conference on Information and Knowledge Management
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1