首页 > 最新文献

HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media最新文献

英文 中文
HT '22: 33rd ACM Conference on Hypertext and Social Media, Barcelona, Spain, 28 June 2022- 1 July 2022 ht22:第33届ACM超文本和社交媒体会议,西班牙巴塞罗那,2022年6月28日至2022年7月1日
{"title":"HT '22: 33rd ACM Conference on Hypertext and Social Media, Barcelona, Spain, 28 June 2022- 1 July 2022","authors":"","doi":"10.1145/3511095","DOIUrl":"https://doi.org/10.1145/3511095","url":null,"abstract":"","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"31 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86256155","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HT '21: 32nd ACM Conference on Hypertext and Social Media, Virtual Event, Ireland, 30 August 2021 - 2 September 2021 ht21:第32届ACM超文本和社交媒体会议,虚拟事件,爱尔兰,2021年8月30日至2021年9月2日
{"title":"HT '21: 32nd ACM Conference on Hypertext and Social Media, Virtual Event, Ireland, 30 August 2021 - 2 September 2021","authors":"","doi":"10.1145/3465336","DOIUrl":"https://doi.org/10.1145/3465336","url":null,"abstract":"","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"59 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84473413","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HT '20: 31st ACM Conference on Hypertext and Social Media, Virtual Event, USA, July 13-15, 2020 ht20:第31届美国计算机学会超文本与社交媒体会议,虚拟事件,美国,2020年7月13-15日
{"title":"HT '20: 31st ACM Conference on Hypertext and Social Media, Virtual Event, USA, July 13-15, 2020","authors":"","doi":"10.1145/3372923","DOIUrl":"https://doi.org/10.1145/3372923","url":null,"abstract":"","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"4 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74758209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides. 发现名人自杀后社交媒体上自杀内容的变化。
Mrinal Kumar, Mark Dredze, Glen Coppersmith, Munmun De Choudhury

The Werther effect describes the increased rate of completed or attempted suicides following the depiction of an individual's suicide in the media, typically a celebrity. We present findings on the prevalence of this effect in an online platform: r/SuicideWatch on Reddit. We examine both the posting activity and post content after the death of ten high-profile suicides. Posting activity increases following reports of celebrity suicides, and post content exhibits considerable changes that indicate increased suicidal ideation. Specifically, we observe that post-celebrity suicide content is more likely to be inward focused, manifest decreased social concerns, and laden with greater anxiety, anger, and negative emotion. Topic model analysis further reveals content in this period to switch to a more derogatory tone that bears evidence of self-harm and suicidal tendencies. We discuss the implications of our findings in enabling better community support to psychologically vulnerable populations, and the potential of building suicide prevention interventions following high-profile suicides.

维特效应描述的是媒体(通常是名人)对个人自杀的描述后,完成或企图自杀的比率上升。我们在Reddit上的一个在线平台r/SuicideWatch上展示了这一现象的普遍性。我们研究了10个备受瞩目的自杀者死亡后的发帖活动和发帖内容。在名人自杀的报道之后,帖子活动增加,帖子内容显示出相当大的变化,表明自杀意念的增加。具体来说,我们观察到名人后的自杀内容更有可能是内向的,表现出较少的社会关注,并且充满了更多的焦虑、愤怒和负面情绪。话题模型分析进一步揭示了这一时期的内容转向更加贬损的语气,带有自残和自杀倾向的证据。我们讨论了我们的研究结果在为心理脆弱人群提供更好的社区支持方面的意义,以及在高调自杀事件发生后建立自杀预防干预措施的潜力。
{"title":"Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides.","authors":"Mrinal Kumar,&nbsp;Mark Dredze,&nbsp;Glen Coppersmith,&nbsp;Munmun De Choudhury","doi":"10.1145/2700171.2791026","DOIUrl":"https://doi.org/10.1145/2700171.2791026","url":null,"abstract":"<p><p>The Werther effect describes the increased rate of completed or attempted suicides following the depiction of an individual's suicide in the media, typically a celebrity. We present findings on the prevalence of this effect in an online platform: r/SuicideWatch on Reddit. We examine both the posting activity and post content after the death of ten high-profile suicides. Posting activity increases following reports of celebrity suicides, and post content exhibits considerable changes that indicate increased suicidal ideation. Specifically, we observe that post-celebrity suicide content is more likely to be inward focused, manifest decreased social concerns, and laden with greater anxiety, anger, and negative emotion. Topic model analysis further reveals content in this period to switch to a more derogatory tone that bears evidence of self-harm and suicidal tendencies. We discuss the implications of our findings in enabling better community support to psychologically vulnerable populations, and the potential of building suicide prevention interventions following high-profile suicides.</p>","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"2015 ","pages":"85-94"},"PeriodicalIF":0.0,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1145/2700171.2791026","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"35175985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 136
Revisiting reverts: accurate revert detection in wikipedia 重访还原:准确的还原检测在维基百科
Fabian Flöck, Denny Vrandečić, E. Simperl
Wikipedia is commonly used as a proving ground for research in collaborative systems. This is likely due to its popularity and scale, but also to the fact that large amounts of data about its formation and evolution are freely available to inform and validate theories and models of online collaboration. As part of the development of such approaches, revert detection is often performed as an important pre-processing step in tasks as diverse as the extraction of implicit networks of editors, the analysis of edit or editor features and the removal of noise when analyzing the emergence of the content of an article. The current state of the art in revert detection is based on a rather naive approach, which identifies revision duplicates based on MD5 hash values. This is an efficient, but not very precise technique that forms the basis for the majority of research based on revert relations in Wikipedia. In this paper we prove that this method has a number of important drawbacks - it only detects a limited number of reverts, while simultaneously misclassifying too many edits as reverts, and not distinguishing between complete and partial reverts. This is very likely to hamper the accurate interpretation of the findings of revert-related research. We introduce an improved algorithm for the detection of reverts based on word tokens added or deleted to adresses these drawbacks. We report on the results of a user study and other tests demonstrating the considerable gains in accuracy and coverage by our method, and argue for a positive trade-off, in certain research scenarios, between these improvements and our algorithm's increased runtime.
维基百科通常被用作协作系统研究的试验场。这可能是由于它的受欢迎程度和规模,但也因为关于它的形成和发展的大量数据是免费提供的,可以为在线协作的理论和模型提供信息和验证。作为这些方法发展的一部分,在各种任务中,还原检测通常作为重要的预处理步骤执行,如提取编辑的隐式网络,分析编辑或编辑特征以及在分析文章内容出现时去除噪声。当前的还原检测技术是基于一种相当简单的方法,该方法根据MD5散列值识别重复的修订。这是一种有效的,但不是非常精确的技术,它构成了基于维基百科中恢复关系的大多数研究的基础。在本文中,我们证明了这种方法有一些重要的缺点——它只检测到有限数量的还原,同时将太多的编辑错误地分类为还原,并且不能区分完全还原和部分还原。这很可能会妨碍对恢复相关研究结果的准确解释。我们引入了一种改进的算法,用于基于添加或删除的单词标记来检测还原,以解决这些缺点。我们报告了用户研究和其他测试的结果,这些结果表明我们的方法在准确性和覆盖率方面取得了相当大的进步,并且在某些研究场景中,在这些改进和我们的算法增加的运行时间之间进行了积极的权衡。
{"title":"Revisiting reverts: accurate revert detection in wikipedia","authors":"Fabian Flöck, Denny Vrandečić, E. Simperl","doi":"10.1145/2309996.2310000","DOIUrl":"https://doi.org/10.1145/2309996.2310000","url":null,"abstract":"Wikipedia is commonly used as a proving ground for research in collaborative systems. This is likely due to its popularity and scale, but also to the fact that large amounts of data about its formation and evolution are freely available to inform and validate theories and models of online collaboration. As part of the development of such approaches, revert detection is often performed as an important pre-processing step in tasks as diverse as the extraction of implicit networks of editors, the analysis of edit or editor features and the removal of noise when analyzing the emergence of the content of an article. The current state of the art in revert detection is based on a rather naive approach, which identifies revision duplicates based on MD5 hash values. This is an efficient, but not very precise technique that forms the basis for the majority of research based on revert relations in Wikipedia. In this paper we prove that this method has a number of important drawbacks - it only detects a limited number of reverts, while simultaneously misclassifying too many edits as reverts, and not distinguishing between complete and partial reverts. This is very likely to hamper the accurate interpretation of the findings of revert-related research. We introduce an improved algorithm for the detection of reverts based on word tokens added or deleted to adresses these drawbacks. We report on the results of a user study and other tests demonstrating the considerable gains in accuracy and coverage by our method, and argue for a positive trade-off, in certain research scenarios, between these improvements and our algorithm's increased runtime.","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"45 1","pages":"3-12"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78716436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Understanding factors that affect response rates in twitter 了解影响twitter回复率的因素
Giovanni V. Comarela, M. Crovella, Virgílio A. F. Almeida, Fabrício Benevenuto
In information networks where users send messages to one another, the issue of information overload naturally arises: which are the most important messages? In this paper we study the problem of understanding the importance of messages in Twitter. We approach this problem in two stages. First, we perform an extensive characterization of a very large Twitter dataset which includes all users, social relations, and messages posted from the beginning of the service up to August 2009. We show evidence that information overload is present: users sometimes have to search through hundreds of messages to find those that are interesting to reply or retweet. We then identify factors that influence user response or retweet probability: previous responses to the same tweeter, the tweeter's sending rate, the age and some basic text elements of the tweet. In our second stage, we show that some of these factors can be used to improve the presentation order of tweets to the user. First, by inspecting user activity over time, we construct a simple on-off model of user behavior that allows us to infer when a user is actively using Twitter. Then, we explore two methods from machine learning for ranking tweets: a Naive Bayes predictor and a Support Vector Machine classifier. We show that it is possible to reorder tweets to increase the fraction of replied or retweeted messages appearing in the first p positions of the list by as much as 50-60%.
在用户互相发送信息的信息网络中,自然会出现信息过载的问题:哪些是最重要的信息?本文研究了Twitter中信息重要性的理解问题。我们分两个阶段处理这个问题。首先,我们对一个非常大的Twitter数据集进行了广泛的表征,该数据集包括从服务开始到2009年8月发布的所有用户、社会关系和消息。我们展示了信息过载存在的证据:用户有时不得不在数百条消息中搜索,以找到那些有趣的回复或转发。然后,我们确定影响用户响应或转发概率的因素:先前对同一推特者的响应,推特者的发送率,年龄和推文的一些基本文本元素。在第二阶段,我们展示了这些因素中的一些可以用来改善推文对用户的呈现顺序。首先,通过检查一段时间内的用户活动,我们构建了一个简单的用户行为开-关模型,该模型允许我们推断用户何时在积极使用Twitter。然后,我们从机器学习中探索了两种推文排名方法:朴素贝叶斯预测器和支持向量机分类器。我们表明,重新排序推文可以将出现在列表前p个位置的回复或转发消息的比例增加多达50-60%。
{"title":"Understanding factors that affect response rates in twitter","authors":"Giovanni V. Comarela, M. Crovella, Virgílio A. F. Almeida, Fabrício Benevenuto","doi":"10.1145/2309996.2310017","DOIUrl":"https://doi.org/10.1145/2309996.2310017","url":null,"abstract":"In information networks where users send messages to one another, the issue of information overload naturally arises: which are the most important messages? In this paper we study the problem of understanding the importance of messages in Twitter. We approach this problem in two stages. First, we perform an extensive characterization of a very large Twitter dataset which includes all users, social relations, and messages posted from the beginning of the service up to August 2009. We show evidence that information overload is present: users sometimes have to search through hundreds of messages to find those that are interesting to reply or retweet. We then identify factors that influence user response or retweet probability: previous responses to the same tweeter, the tweeter's sending rate, the age and some basic text elements of the tweet. In our second stage, we show that some of these factors can be used to improve the presentation order of tweets to the user. First, by inspecting user activity over time, we construct a simple on-off model of user behavior that allows us to infer when a user is actively using Twitter. Then, we explore two methods from machine learning for ranking tweets: a Naive Bayes predictor and a Support Vector Machine classifier. We show that it is possible to reorder tweets to increase the fraction of replied or retweeted messages appearing in the first p positions of the list by as much as 50-60%.","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"4 1","pages":"123-132"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73310703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 72
An evaluation of tailored web materials for public administration 为公共行政量身定制的网络材料的评价
N. Colineau, Cécile Paris, Keith Vander Linden
Public Administration organizations generally write their citizen-focused, informational materials for generic audiences because they don't have the resources to produce personalized materials for everyone. The goal of this project is to replace these generic materials, which must include careful discussions of the conditions distinguishing the various constituencies within the generic audience, with tailored materials, which can be automatically personalized to focus on the information relevant to an individual reader. Two key questions must be addressed. First, are the automatically produced, tailored forms more effective than the generic forms they replace, and second, is the time the reader spends specifying the demographic information on which the tailoring is based too costly to be worth the effort. This paper describes an adaptive hypermedia application that produces tailored materials for students exploring government educational entitlement programs, and focuses in particular on the effectiveness of the generated tailored material.
公共管理组织通常为普通受众编写以公民为中心的信息材料,因为他们没有资源为每个人制作个性化的材料。该项目的目标是用量身定制的材料取代这些通用材料,这些材料必须包括对区分普通受众中各种选民的条件的仔细讨论,这些材料可以自动个性化,以关注与个人读者相关的信息。必须解决两个关键问题。首先,自动生成的、定制的表单是否比它们所取代的通用表单更有效?其次,读者花在指定定制所依据的人口统计信息上的时间过于昂贵,不值得付出努力。本文描述了一种自适应超媒体应用程序,该应用程序为探索政府教育权利计划的学生提供量身定制的材料,并特别关注生成的量身定制材料的有效性。
{"title":"An evaluation of tailored web materials for public administration","authors":"N. Colineau, Cécile Paris, Keith Vander Linden","doi":"10.1145/2309996.2310040","DOIUrl":"https://doi.org/10.1145/2309996.2310040","url":null,"abstract":"Public Administration organizations generally write their citizen-focused, informational materials for generic audiences because they don't have the resources to produce personalized materials for everyone. The goal of this project is to replace these generic materials, which must include careful discussions of the conditions distinguishing the various constituencies within the generic audience, with tailored materials, which can be automatically personalized to focus on the information relevant to an individual reader. Two key questions must be addressed. First, are the automatically produced, tailored forms more effective than the generic forms they replace, and second, is the time the reader spends specifying the demographic information on which the tailoring is based too costly to be worth the effort. This paper describes an adaptive hypermedia application that produces tailored materials for students exploring government educational entitlement programs, and focuses in particular on the effectiveness of the generated tailored material.","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"114 1","pages":"265-274"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76672680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Finding and exploring memes in social media 在社交媒体中寻找和探索模因
Hohyon Ryu, Matthew Lease, N. Woodward
Online critical literacy challenges readers to recognize and question how online textual information has been shaped by its greater context. While comparing information from multiple sources provides a foundation for such awareness, keeping pace with everything being written is a daunting proposition, especially for the casual reader. We propose a new form of technological assistance for critical literacy which automatically discovers and displays underlying memes: ideas represented by similar phrases which occur across diýerent information sources. By surfacing these memes to users, we create a rich hypertext representation in which underlying memes can be explored in context. Given the vast scale of social media, we describe a highly-scalable system architecture designed for MapReduce distributed computing. To validate our approach, we report on use of our system to discover and browse memes in a 1.5 TB collection of crawled social media. Our primary contributions include: 1) a novel technological approach and hypertext browsing design for supporting critical literacy; and 2) a highly-scalable system architecture for meme discovery, providing a solid foundation for further system extensions and refinements.
在线批判性素养挑战读者认识和质疑在线文本信息是如何被其更大的背景所塑造的。虽然比较来自多个来源的信息为这种意识提供了基础,但跟上所写的所有内容是一项艰巨的任务,特别是对临时读者来说。我们提出了一种新的批判性素养技术援助形式,它可以自动发现和显示潜在的模因:由diýerent信息源中出现的类似短语表示的想法。通过将这些模因呈现给用户,我们创建了一个丰富的超文本表示,其中可以在上下文中探索潜在的模因。考虑到社交媒体的巨大规模,我们描述了一个为MapReduce分布式计算设计的高度可扩展的系统架构。为了验证我们的方法,我们报告了使用我们的系统在1.5 TB的爬行社交媒体集合中发现和浏览模因的情况。我们的主要贡献包括:1)支持批判性读写的新颖技术方法和超文本浏览设计;2)用于模因发现的高度可扩展的系统架构,为进一步的系统扩展和改进提供坚实的基础。
{"title":"Finding and exploring memes in social media","authors":"Hohyon Ryu, Matthew Lease, N. Woodward","doi":"10.1145/2309996.2310044","DOIUrl":"https://doi.org/10.1145/2309996.2310044","url":null,"abstract":"Online critical literacy challenges readers to recognize and question how online textual information has been shaped by its greater context. While comparing information from multiple sources provides a foundation for such awareness, keeping pace with everything being written is a daunting proposition, especially for the casual reader. We propose a new form of technological assistance for critical literacy which automatically discovers and displays underlying memes: ideas represented by similar phrases which occur across diýerent information sources. By surfacing these memes to users, we create a rich hypertext representation in which underlying memes can be explored in context. Given the vast scale of social media, we describe a highly-scalable system architecture designed for MapReduce distributed computing. To validate our approach, we report on use of our system to discover and browse memes in a 1.5 TB collection of crawled social media. Our primary contributions include: 1) a novel technological approach and hypertext browsing design for supporting critical literacy; and 2) a highly-scalable system architecture for meme discovery, providing a solid foundation for further system extensions and refinements.","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"14 1","pages":"295-304"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87475880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Moving beyond SameAs with PLATO: partonomy detection for linked data 使用PLATO超越SameAs:关联数据的部分分类检测
Prateek Jain, P. Hitzler, Kunal Verma, P. Yeh, A. Sheth
The Linked Open Data (LOD) Cloud has gained significant traction over the past few years. With over 275 interlinked datasets across diverse domains such as life science, geography, politics, and more, the LOD Cloud has the potential to support a variety of applications ranging from open domain question answering to drug discovery. Despite its significant size (approx. 30 billion triples), the data is relatively sparely interlinked (approx. 400 million links). A semantically richer LOD Cloud is needed to fully realize its potential. Data in the LOD Cloud are currently interlinked mainly via the owl:sameAs property, which is inadequate for many applications. Additional properties capturing relations based on causality or partonomy are needed to enable the answering of complex questions and to support applications. In this paper, we present a solution to enrich the LOD Cloud by automatically detecting partonomic relationships, which are well-established, fundamental properties grounded in linguistics and philosophy. We empirically evaluate our solution across several domains, and show that our approach performs well on detecting partonomic properties between LOD Cloud data.
关联开放数据(LOD)云在过去几年中获得了显著的发展。LOD Cloud拥有超过275个相互关联的数据集,跨越不同的领域,如生命科学、地理、政治等,有潜力支持从开放领域问答到药物发现的各种应用。尽管它的规模相当大(大约。300亿个三元组),数据之间的相互关联相对较少(大约为1。4亿链接)。要充分发挥其潜力,需要语义更丰富的LOD Cloud。LOD Cloud中的数据目前主要通过owl:sameAs属性进行互联,这对于很多应用来说是不够的。为了能够回答复杂的问题并支持应用程序,还需要基于因果关系或局部关系捕获关系的附加属性。在本文中,我们提出了一种通过自动检测部分关系来丰富LOD云的解决方案,部分关系是建立在语言学和哲学基础上的成熟的基本属性。我们在多个领域对我们的解决方案进行了经验评估,并表明我们的方法在检测LOD Cloud数据之间的局部属性方面表现良好。
{"title":"Moving beyond SameAs with PLATO: partonomy detection for linked data","authors":"Prateek Jain, P. Hitzler, Kunal Verma, P. Yeh, A. Sheth","doi":"10.1145/2309996.2310004","DOIUrl":"https://doi.org/10.1145/2309996.2310004","url":null,"abstract":"The Linked Open Data (LOD) Cloud has gained significant traction over the past few years. With over 275 interlinked datasets across diverse domains such as life science, geography, politics, and more, the LOD Cloud has the potential to support a variety of applications ranging from open domain question answering to drug discovery.\u0000 Despite its significant size (approx. 30 billion triples), the data is relatively sparely interlinked (approx. 400 million links). A semantically richer LOD Cloud is needed to fully realize its potential. Data in the LOD Cloud are currently interlinked mainly via the owl:sameAs property, which is inadequate for many applications. Additional properties capturing relations based on causality or partonomy are needed to enable the answering of complex questions and to support applications.\u0000 In this paper, we present a solution to enrich the LOD Cloud by automatically detecting partonomic relationships, which are well-established, fundamental properties grounded in linguistics and philosophy. We empirically evaluate our solution across several domains, and show that our approach performs well on detecting partonomic properties between LOD Cloud data.","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"12 1","pages":"33-42"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82662764","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Early detection of buzzwords based on large-scale time-series analysis of blog entries 基于大规模博客时间序列分析的热词早期检测
Shinsuke Nakajima, Jianwei Zhang, Y. Inagaki, Reyn Y. Nakamoto
In this paper, we discuss a method for early detection of "gradual buzzwords" by analyzing time-series data of blog entries. We observe the process in which certain topics grow to become major buzzwords and determine the key indicators that are necessary for their early detection. From the analysis results based on 81,922,977 blog entries from 3,776,154 blog websites posted in the past two years, we find that as topics grow to become major buzzwords, the percentages of blog entries from the blogger communities closely related to the target buzzword decrease gradually, and the percentages of blog entries from the weakly related blogger communities increase gradually. We then describe a method for early detection of these buzzwords, which is dependent on identifying the blogger communities which are closely related to these buzzwords. Moreover, we verify the effectiveness of the proposed method through experimentation that compares the rankings of several buzzword candidates with a real-life idol group popularity competition.
本文通过分析博客条目的时间序列数据,探讨了一种“渐进式流行语”的早期检测方法。我们观察某些话题成长为主要流行语的过程,并确定早期发现这些话题所需的关键指标。通过对3776154个博客网站近两年发布的81922977篇博客文章的分析结果发现,随着话题成为主要流行语,与目标流行语密切相关的博客社区的博客文章所占比例逐渐下降,而与目标流行语关联度较弱的博客社区的博客文章所占比例逐渐上升。然后,我们描述了一种早期检测这些流行语的方法,该方法依赖于识别与这些流行语密切相关的博客社区。此外,我们通过实验验证了所提出方法的有效性,该实验将几个流行语候选人的排名与现实生活中的偶像团体人气竞争进行比较。
{"title":"Early detection of buzzwords based on large-scale time-series analysis of blog entries","authors":"Shinsuke Nakajima, Jianwei Zhang, Y. Inagaki, Reyn Y. Nakamoto","doi":"10.1145/2309996.2310042","DOIUrl":"https://doi.org/10.1145/2309996.2310042","url":null,"abstract":"In this paper, we discuss a method for early detection of \"gradual buzzwords\" by analyzing time-series data of blog entries. We observe the process in which certain topics grow to become major buzzwords and determine the key indicators that are necessary for their early detection. From the analysis results based on 81,922,977 blog entries from 3,776,154 blog websites posted in the past two years, we find that as topics grow to become major buzzwords, the percentages of blog entries from the blogger communities closely related to the target buzzword decrease gradually, and the percentages of blog entries from the weakly related blogger communities increase gradually. We then describe a method for early detection of these buzzwords, which is dependent on identifying the blogger communities which are closely related to these buzzwords. Moreover, we verify the effectiveness of the proposed method through experimentation that compares the rankings of several buzzword candidates with a real-life idol group popularity competition.","PeriodicalId":91270,"journal":{"name":"HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media","volume":"77 1","pages":"275-284"},"PeriodicalIF":0.0,"publicationDate":"2012-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83503812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
期刊
HT ... : the proceedings of the ... ACM Conference on Hypertext and Social Media. ACM Conference on Hypertext and Social Media
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1