首页 > 最新文献

2017 14th Web Information Systems and Applications Conference (WISA)最新文献

英文 中文
Keyword Extraction for Social Media Short Text 社交媒体短文本关键字提取
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.12
Dexin Zhao, Nana Du, Zhi Chang, Yukun Li
With the booming development of social media in recent years, researchers have begun to pay more attention to extracting personal profiles from information. Keyword extraction plays an important role in extracting personal profiles. However, most of the previous studies are only valid for ordinary text, but not ideal for social media short text. In this paper, we propose an improved method for keyword extraction based on Word2vec and Textrank to solve the unique problem of social media short text. Our approach uses the Word2vec to capture the semantic features between words in selected text, and meanwhile naturally fuses the word frequency, semantic relation and directional relation into Textrank to extract keywords. We conduct the experiments on the three datasets. The experimental results show the superior performance of our method in keyword extraction.
近年来,随着社交媒体的蓬勃发展,从信息中提取个人资料的研究开始受到越来越多的关注。关键词提取在个人资料提取中起着重要的作用。然而,以往的研究大多只对普通文本有效,对社交媒体短文本的研究并不理想。本文提出了一种基于Word2vec和Textrank的改进关键字提取方法,以解决社交媒体短文本特有的问题。我们的方法是利用Word2vec捕获所选文本中词之间的语义特征,同时将词频、语义关系和方向关系自然地融合到Textrank中提取关键词。我们在三个数据集上进行实验。实验结果表明,该方法在关键字提取方面具有优异的性能。
{"title":"Keyword Extraction for Social Media Short Text","authors":"Dexin Zhao, Nana Du, Zhi Chang, Yukun Li","doi":"10.1109/WISA.2017.12","DOIUrl":"https://doi.org/10.1109/WISA.2017.12","url":null,"abstract":"With the booming development of social media in recent years, researchers have begun to pay more attention to extracting personal profiles from information. Keyword extraction plays an important role in extracting personal profiles. However, most of the previous studies are only valid for ordinary text, but not ideal for social media short text. In this paper, we propose an improved method for keyword extraction based on Word2vec and Textrank to solve the unique problem of social media short text. Our approach uses the Word2vec to capture the semantic features between words in selected text, and meanwhile naturally fuses the word frequency, semantic relation and directional relation into Textrank to extract keywords. We conduct the experiments on the three datasets. The experimental results show the superior performance of our method in keyword extraction.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115719589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
User Identification across Social Networks Based on Global View Features 基于全局视图特征的跨社交网络用户识别
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.36
Shuo Feng, Qian Wang, Derong Shen, Yue Kou, Tiezheng Nie, Ge Yu
Nowadays, people prefer to take part in multiple social networks to enjoy different kinds of services. Consequently, a significant task is to identify users across networks. Most state-of-the-art works on this issue exploit user local structure features (e.g., friend, follow and followed). In this paper, we first proposes the notion of user global view features, which represent the location of users in the network. Then, we present an iterative two-stage algorithm (GAUI) using Global view features with user Attribute features to solve User Identification. In GAUI, we iteratively update pairwise similarity and predict new matching users. Certainly, we present a community based core anchor link filter strategy to reduce the computation cost, and present a stable matching based mapping strategy to improve the accuracy. At last, the experiments conducted on two real-world aligned networks demonstrate that our method has better performance on precision and recall.
如今,人们更喜欢参与多个社交网络来享受不同种类的服务。因此,一个重要的任务是识别跨网络的用户。在这个问题上,大多数最先进的工作都利用了用户本地结构特征(例如,好友、关注和被关注)。在本文中,我们首先提出了用户全局视图特征的概念,它表示用户在网络中的位置。然后,我们提出了一种使用全局视图特征和用户属性特征的迭代两阶段算法(GAUI)来解决用户识别问题。在gai中,我们迭代更新两两相似度并预测新的匹配用户。当然,我们提出了一种基于社区的核心锚链过滤策略来降低计算成本,并提出了一种基于稳定匹配的映射策略来提高精度。最后,在两个真实的对齐网络上进行的实验表明,我们的方法在查准率和查全率上都有更好的性能。
{"title":"User Identification across Social Networks Based on Global View Features","authors":"Shuo Feng, Qian Wang, Derong Shen, Yue Kou, Tiezheng Nie, Ge Yu","doi":"10.1109/WISA.2017.36","DOIUrl":"https://doi.org/10.1109/WISA.2017.36","url":null,"abstract":"Nowadays, people prefer to take part in multiple social networks to enjoy different kinds of services. Consequently, a significant task is to identify users across networks. Most state-of-the-art works on this issue exploit user local structure features (e.g., friend, follow and followed). In this paper, we first proposes the notion of user global view features, which represent the location of users in the network. Then, we present an iterative two-stage algorithm (GAUI) using Global view features with user Attribute features to solve User Identification. In GAUI, we iteratively update pairwise similarity and predict new matching users. Certainly, we present a community based core anchor link filter strategy to reduce the computation cost, and present a stable matching based mapping strategy to improve the accuracy. At last, the experiments conducted on two real-world aligned networks demonstrate that our method has better performance on precision and recall.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130284725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Method of the Association Statistics between the Cause of Action and the Statutes 诉因与法规关联统计方法研究
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.3
Yi Feng, Jidong Ge, Yemao Zhou, Chuanyi Li, Zhongjin Li, Xiaoyu Zhou, B. Luo
This paper presents a method of the association statistics between the cause of action and the statute. According to the close relationship between the cause of action and the statute in the written judgment, this paper puts forward the statistical analysis of the cause of action and the statute. The method mainly includes the pretreatment of semi-structured written judgments, reading information of the cause of action and the statute from structured documents, standardizing statutes, depositing in the database, generating EXCEL form of the association statistics from the cause of action to the statue and generating TXT form of the association statistics from the statue to the cause of action. In the process of reasoning and assessment, we can achieve the prediction of statutes and narrow the size of the cause of action.
本文提出了一种诉因与法规关联统计的方法。根据判决书中诉因与成文法的密切关系,本文提出了对诉因与成文法的统计分析。该方法主要包括对半结构化的书面判决书进行预处理,从结构化文件中读取案由和成文法的信息,对成文法进行标准化,存入数据库,生成从案由到案由的关联统计数据的EXCEL格式,生成从案由到案由的关联统计数据的TXT格式。在推理和评估的过程中,我们可以实现法规的预测,缩小诉因的大小。
{"title":"A Method of the Association Statistics between the Cause of Action and the Statutes","authors":"Yi Feng, Jidong Ge, Yemao Zhou, Chuanyi Li, Zhongjin Li, Xiaoyu Zhou, B. Luo","doi":"10.1109/WISA.2017.3","DOIUrl":"https://doi.org/10.1109/WISA.2017.3","url":null,"abstract":"This paper presents a method of the association statistics between the cause of action and the statute. According to the close relationship between the cause of action and the statute in the written judgment, this paper puts forward the statistical analysis of the cause of action and the statute. The method mainly includes the pretreatment of semi-structured written judgments, reading information of the cause of action and the statute from structured documents, standardizing statutes, depositing in the database, generating EXCEL form of the association statistics from the cause of action to the statue and generating TXT form of the association statistics from the statue to the cause of action. In the process of reasoning and assessment, we can achieve the prediction of statutes and narrow the size of the cause of action.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122269146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Visualization of Linked Biomedical Data Using Cluster Chart 利用聚类图可视化关联生物医学数据
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.43
Yiran Shan, Xin Wang
With the continuous increasing of biomedical data, how to effectively use these large-scale data sets has become an urgent problem. It is also an essential issue to make benefit to users by consuming these biomedical data on the Semantic Web in a reasonable way. We present a visualization approach based on a tree-like layered interactive user interface, realize the queries of the relationships between targets, compounds, and diseases, and show the width of the path between the two biological entities according to their correlations. Furthermore, we design an iterative query method, which can find not only direct results of the input entity, but also extended results with some similarities of the input entity. Thus, the potential relationships among the extended results can be further investigated by biomedical scientists. Therefore, we have developed a user-friendly visualization system that can leverage the rich sets of the linked biomedical data.
随着生物医学数据的不断增加,如何有效地利用这些大规模数据集已成为一个亟待解决的问题。如何在语义网上合理地消费这些生物医学数据,使用户受益也是一个重要的问题。我们提出了一种基于树状分层交互用户界面的可视化方法,实现了对靶点、化合物和疾病之间关系的查询,并根据它们之间的相关性显示了两个生物实体之间的路径宽度。此外,我们设计了一种迭代查询方法,不仅可以找到输入实体的直接结果,还可以找到与输入实体具有一定相似性的扩展结果。因此,生物医学科学家可以进一步研究扩展结果之间的潜在关系。因此,我们开发了一个用户友好的可视化系统,可以利用丰富的生物医学数据集。
{"title":"Visualization of Linked Biomedical Data Using Cluster Chart","authors":"Yiran Shan, Xin Wang","doi":"10.1109/WISA.2017.43","DOIUrl":"https://doi.org/10.1109/WISA.2017.43","url":null,"abstract":"With the continuous increasing of biomedical data, how to effectively use these large-scale data sets has become an urgent problem. It is also an essential issue to make benefit to users by consuming these biomedical data on the Semantic Web in a reasonable way. We present a visualization approach based on a tree-like layered interactive user interface, realize the queries of the relationships between targets, compounds, and diseases, and show the width of the path between the two biological entities according to their correlations. Furthermore, we design an iterative query method, which can find not only direct results of the input entity, but also extended results with some similarities of the input entity. Thus, the potential relationships among the extended results can be further investigated by biomedical scientists. Therefore, we have developed a user-friendly visualization system that can leverage the rich sets of the linked biomedical data.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"18 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131775000","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Asymmetric Item-Item Similarity Measure for Linked Open Data Enabled Collaborative Filtering 链接开放数据支持协同过滤的非对称项-项相似性度量
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.23
Chengwang Mao, Zhuoming Xu, Xiuli Wang
The boom in Linked Open Data (LOD) has recently stimulated the research of a new generation of recommender systems—LOD-enabled recommender systems, in which the similarity measure for LOD is one of the core issues. The partitioned information content (PIC)-based semantic similarity (PICSS) is a newly developed symmetric similarity measure for LOD. However, recent studies have shown that asymmetric similarity measures are more effective than symmetric similarity measures in solving recommendation problems. In this paper we develop an asymmetric item-item similarity measure for LOD—the asymmetric PIC-based semantic similarity measure (APICSS), which applies our proposed two notions: the proportion of common PIC between two resources in the PIC of a resource and the PIC difference between two resources, on the basis of the notion of PIC. Experimental evaluation with the item-based collaborative filtering method on the MovieLens 100k dataset, the DBpedia 2016-04 release, and the DBpedia-MovieLens 100k dataset shows that our APICSS measure outperforms the PICSS measure in terms of both Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). The average RMSE accuracy has an increase of 1.58% and the maximum RMSE accuracy has an increase of 2.07%, compared to PICSS. The average MAE accuracy has an increase of 1.63% and the maximum MAE accuracy has an increase of 2.19%, compared to PICSS.
近年来,关联开放数据(LOD)的蓬勃发展刺激了新一代推荐系统——支持LOD的推荐系统的研究,其中LOD的相似度度量是核心问题之一。基于分区信息内容(PIC)的语义相似度(PICSS)是一种新的LOD对称相似度度量方法。然而,最近的研究表明,在解决推荐问题时,非对称相似度度量比对称相似度度量更有效。本文提出了一种基于非对称PIC的语义相似度量(APICSS),它应用了我们提出的两个概念:在PIC概念的基础上,两个资源之间共同PIC在资源PIC中的比例和两个资源之间PIC的差异。基于项目的协同过滤方法在MovieLens 100k数据集、DBpedia 2016-04版本和DBpedia-MovieLens 100k数据集上的实验评估表明,我们的APICSS测量在均方根误差(RMSE)和平均绝对误差(MAE)方面都优于PICSS测量。与PICSS相比,平均RMSE精度提高了1.58%,最大RMSE精度提高了2.07%。与PICSS相比,平均MAE准确率提高了1.63%,最大MAE准确率提高了2.19%。
{"title":"Asymmetric Item-Item Similarity Measure for Linked Open Data Enabled Collaborative Filtering","authors":"Chengwang Mao, Zhuoming Xu, Xiuli Wang","doi":"10.1109/WISA.2017.23","DOIUrl":"https://doi.org/10.1109/WISA.2017.23","url":null,"abstract":"The boom in Linked Open Data (LOD) has recently stimulated the research of a new generation of recommender systems—LOD-enabled recommender systems, in which the similarity measure for LOD is one of the core issues. The partitioned information content (PIC)-based semantic similarity (PICSS) is a newly developed symmetric similarity measure for LOD. However, recent studies have shown that asymmetric similarity measures are more effective than symmetric similarity measures in solving recommendation problems. In this paper we develop an asymmetric item-item similarity measure for LOD—the asymmetric PIC-based semantic similarity measure (APICSS), which applies our proposed two notions: the proportion of common PIC between two resources in the PIC of a resource and the PIC difference between two resources, on the basis of the notion of PIC. Experimental evaluation with the item-based collaborative filtering method on the MovieLens 100k dataset, the DBpedia 2016-04 release, and the DBpedia-MovieLens 100k dataset shows that our APICSS measure outperforms the PICSS measure in terms of both Root Mean Squared Error (RMSE) and Mean Absolute Error (MAE). The average RMSE accuracy has an increase of 1.58% and the maximum RMSE accuracy has an increase of 2.07%, compared to PICSS. The average MAE accuracy has an increase of 1.63% and the maximum MAE accuracy has an increase of 2.19%, compared to PICSS.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127401522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Extracting Log Patterns Based on Association Analysis for Power Quality Disturbance Detection 基于关联分析的对数模式提取用于电能质量干扰检测
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.15
D. Feng, Tongxun Wang, Chen Liu, Shen Su
To detect anomalies according to system log is a hot topic recently. For the harmonic monitoring system of the power grid, the common practice of anomaly detection is to conduct machine learning. The learning model is trained with the historical anomaly data, and used for online detection. The premise of this method is to predefine a set of indicators as the input features of the machine learning model. However, existing methods rely mainly on business experience to extract such indicators, which limits the scope of the indicators used for data analysis, but also limits the accuracy of power quality perturbation analysis. In this paper, we propose an algorithm for power quality disturbance detection which investigates the correlation among the harmonic monitoring indicators, and extract the frequently concurrent abnormal indicators as the features to locate power quality disturbance detection. With the verification of the historical disturbance records, we prove that our algorithm can effectively detect the power quality disturbing events.
根据系统日志进行异常检测是近年来的研究热点。对于电网谐波监测系统,异常检测的常用做法是进行机器学习。利用历史异常数据对学习模型进行训练,并用于在线检测。该方法的前提是预先定义一组指标作为机器学习模型的输入特征。然而,现有方法主要依靠业务经验提取此类指标,这限制了用于数据分析的指标范围,也限制了电能质量摄动分析的准确性。本文提出了一种电能质量干扰检测算法,该算法研究谐波监测指标之间的相关性,提取频繁并发的异常指标作为电能质量干扰检测定位的特征。通过对历史扰动记录的验证,证明了该算法能够有效地检测出电能质量扰动事件。
{"title":"Extracting Log Patterns Based on Association Analysis for Power Quality Disturbance Detection","authors":"D. Feng, Tongxun Wang, Chen Liu, Shen Su","doi":"10.1109/WISA.2017.15","DOIUrl":"https://doi.org/10.1109/WISA.2017.15","url":null,"abstract":"To detect anomalies according to system log is a hot topic recently. For the harmonic monitoring system of the power grid, the common practice of anomaly detection is to conduct machine learning. The learning model is trained with the historical anomaly data, and used for online detection. The premise of this method is to predefine a set of indicators as the input features of the machine learning model. However, existing methods rely mainly on business experience to extract such indicators, which limits the scope of the indicators used for data analysis, but also limits the accuracy of power quality perturbation analysis. In this paper, we propose an algorithm for power quality disturbance detection which investigates the correlation among the harmonic monitoring indicators, and extract the frequently concurrent abnormal indicators as the features to locate power quality disturbance detection. With the verification of the historical disturbance records, we prove that our algorithm can effectively detect the power quality disturbing events.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123181471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Building Top-k Consistent Results for Web Table Augmentation 为Web表增强构建Top-k一致的结果
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.30
Fei Qi, Xiaoyu Wu, Ning Wang
Web table augmentation enables users to augment attributes based on key column and other known information. For table augmentation, most of systems return a single result which could not meet the users' needs of selection and validation. Furthermore, previous works only consider the entity-attribute binary tables with the first column corresponding to the entity name and the second to an attribute to be extended. When a table has multiple columns to be extended, the result table consolidated by binary tables will suffer from entity inconsistency. In this paper, we present a framework called TAT to build Top-k consistent results for web table augmentation. While ensuring the consistency of entities, TAT provides as diverse results as possible. We design two algorithms, exclusive and iterative algorithm, for web table augmentation that return Top-k results based on different requirements from users. The experiments show that TAT could return Top-k consistent results without loss of precision or coverage.
Web表增强使用户能够根据键列和其他已知信息增强属性。对于表扩展,大多数系统返回一个单一的结果,不能满足用户选择和验证的需要。此外,以前的工作只考虑实体-属性二进制表,其中第一列对应实体名称,第二列对应要扩展的属性。当一个表有多个列要扩展时,由二进制表合并的结果表将遭受实体不一致的问题。在本文中,我们提出了一个名为TAT的框架,用于构建web表增强的Top-k一致结果。在确保实体一致性的同时,TAT提供尽可能多样化的结果。我们设计了排他算法和迭代算法两种web表增强算法,根据用户的不同需求返回Top-k结果。实验表明,TAT可以在不损失精度和覆盖范围的情况下返回Top-k一致的结果。
{"title":"Building Top-k Consistent Results for Web Table Augmentation","authors":"Fei Qi, Xiaoyu Wu, Ning Wang","doi":"10.1109/WISA.2017.30","DOIUrl":"https://doi.org/10.1109/WISA.2017.30","url":null,"abstract":"Web table augmentation enables users to augment attributes based on key column and other known information. For table augmentation, most of systems return a single result which could not meet the users' needs of selection and validation. Furthermore, previous works only consider the entity-attribute binary tables with the first column corresponding to the entity name and the second to an attribute to be extended. When a table has multiple columns to be extended, the result table consolidated by binary tables will suffer from entity inconsistency. In this paper, we present a framework called TAT to build Top-k consistent results for web table augmentation. While ensuring the consistency of entities, TAT provides as diverse results as possible. We design two algorithms, exclusive and iterative algorithm, for web table augmentation that return Top-k results based on different requirements from users. The experiments show that TAT could return Top-k consistent results without loss of precision or coverage.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"2016 27","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120968046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Collaborative Filtering Algorithm Based on User Similarity and Trust 一种基于用户相似度和信任度的协同过滤算法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.21
Qingzhou Wu, Mengxing Huang, Yangzi Mu
Collaborative filtering algorithm is one of the most widely used algorithms in recommender systems and has demonstrated promising results. But it relies too much on similarity to find the nearest neighbors. Whatever, the trust between users is also an import factor needed to be considered. This paper proposed a collaborative filtering algorithm that combined the user similarity and trust to obtain a more appropriate nearest neighbors set. Users not only have same interests as their nearest neighbors, but also have higher level of acceptance in the items recom-mended by their nearest neighbors. Extensive experiments based on Film Trust and MovieLens datasets have shown that the approach has major potential in improving the accuracy of recommended item.
协同过滤算法是推荐系统中应用最广泛的算法之一,并取得了良好的效果。但它过于依赖相似性来找到最近的邻居。无论如何,用户之间的信任也是一个需要考虑的重要因素。本文提出了一种结合用户相似度和信任度的协同过滤算法,以获得更合适的最近邻集。用户不仅与最近的邻居有相同的兴趣,而且对最近的邻居推荐的物品也有更高的接受程度。基于Film Trust和MovieLens数据集的大量实验表明,该方法在提高推荐项目的准确性方面具有很大的潜力。
{"title":"A Collaborative Filtering Algorithm Based on User Similarity and Trust","authors":"Qingzhou Wu, Mengxing Huang, Yangzi Mu","doi":"10.1109/WISA.2017.21","DOIUrl":"https://doi.org/10.1109/WISA.2017.21","url":null,"abstract":"Collaborative filtering algorithm is one of the most widely used algorithms in recommender systems and has demonstrated promising results. But it relies too much on similarity to find the nearest neighbors. Whatever, the trust between users is also an import factor needed to be considered. This paper proposed a collaborative filtering algorithm that combined the user similarity and trust to obtain a more appropriate nearest neighbors set. Users not only have same interests as their nearest neighbors, but also have higher level of acceptance in the items recom-mended by their nearest neighbors. Extensive experiments based on Film Trust and MovieLens datasets have shown that the approach has major potential in improving the accuracy of recommended item.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116511230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Big Data Framework for Electric Power Data Quality Assessment 电力数据质量评估的大数据框架
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.29
He Liu, Fupeng Huang, Han Li, Weiwei Liu, Tongxun Wang
Since a low-quality data may influence the effectiveness and reliability of applications, data quality is required to be guaranteed. Data quality assessment is considered as the foundation of the promotion of data quality, so it is essential to access the data quality before any other data related activities. In the electric power industry, more and more electric power data is continuously accumulated, and many electric power applications have been developed based on these data. In China, the power grid has many special characteristic, traditional big data assessment frameworks cannot be directly applied. Therefore, a big data framework for electric power data quality assessment is proposed. Based on big data techniques, the framework can accumulate both the real-time data and the history data, provide an integrated computation environment for electric power big data assessment, and support the storage of different types of data.
由于低质量的数据可能会影响应用的有效性和可靠性,因此需要保证数据质量。数据质量评估被认为是提升数据质量的基础,因此,在开展任何与数据相关的活动之前,首先要了解数据质量。在电力工业中,不断积累的电力数据越来越多,基于这些数据开发了许多电力应用。在中国,电网有许多特殊的特点,传统的大数据评估框架不能直接应用。为此,提出了电力数据质量评估的大数据框架。该框架基于大数据技术,能够积累实时数据和历史数据,为电力大数据评估提供一体化计算环境,并支持不同类型数据的存储。
{"title":"A Big Data Framework for Electric Power Data Quality Assessment","authors":"He Liu, Fupeng Huang, Han Li, Weiwei Liu, Tongxun Wang","doi":"10.1109/WISA.2017.29","DOIUrl":"https://doi.org/10.1109/WISA.2017.29","url":null,"abstract":"Since a low-quality data may influence the effectiveness and reliability of applications, data quality is required to be guaranteed. Data quality assessment is considered as the foundation of the promotion of data quality, so it is essential to access the data quality before any other data related activities. In the electric power industry, more and more electric power data is continuously accumulated, and many electric power applications have been developed based on these data. In China, the power grid has many special characteristic, traditional big data assessment frameworks cannot be directly applied. Therefore, a big data framework for electric power data quality assessment is proposed. Based on big data techniques, the framework can accumulate both the real-time data and the history data, provide an integrated computation environment for electric power big data assessment, and support the storage of different types of data.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128072081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
A Trusted Service Selection Method Based on User's Personality Feature and Service Recommendation 基于用户个性特征和服务推荐的可信服务选择方法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.55
Weijin Jiang, Jiahui Chen, Qijie Feng
In view of the fact that the QoS-based service selection method in the current service selection method is less concerned with the personality attribute characteristics of the service requesters and the service selection method based on collaborative filtering, the service providers', Based on the characteristics of the personality of the service requester, this paper describes the user correlation by defining the similarity and domain relevance of the user, and the calculation method of the recommended credibility is given by using the credible measurement theory. Using the analytic hierarchy process (AHP) to determine the weight of each correlation factor, this paper proposes a credible service selection model based on collaborative filtering service selection trust model (SSTM). The simulation results show that the model can effectively improve the efficiency of service selection and resist the attack of malicious feedback. There are two major innovations as following: Firstly, to make a introduction of user relevance to reflect the degree of close between two users (service requester) under the network environment; to apply the user’s personality attribute characteristics to the service provider reputation value during the prediction, to improve the accuracy of service selection by reducing the size of service providers. Secondly, combining user relevance and recommendation credibility organically, using AHP to determine the weight of relevant factors in the service selection index system so that we can make the reputation of the predicted service provider more reliable and effectively resist the malicious user feedback.
针对当前服务选择方法中基于qos的服务选择方法对服务请求者的个性属性特征关注较少,而基于协同过滤的服务选择方法对服务提供者的个性特征关注较少的问题,本文根据服务请求者的个性特征,通过定义用户的相似性和领域相关性来描述用户相关性。并利用可信度测量理论给出了推荐信度的计算方法。利用层次分析法(AHP)确定各相关因素的权重,提出了一种基于协同过滤的可信服务选择信任模型(SSTM)。仿真结果表明,该模型能有效提高服务选择效率,抵御恶意反馈攻击。主要创新有两点:一是引入用户相关性,反映网络环境下两个用户(服务请求者)之间的密切程度;在预测过程中将用户的人格属性特征应用到服务提供商声誉值中,通过减小服务提供商的规模来提高服务选择的准确性。其次,将用户相关性和推荐可信度有机地结合起来,利用层次分析法确定服务选择指标体系中相关因素的权重,使预测的服务提供者的信誉更加可靠,有效抵御恶意用户反馈。
{"title":"A Trusted Service Selection Method Based on User's Personality Feature and Service Recommendation","authors":"Weijin Jiang, Jiahui Chen, Qijie Feng","doi":"10.1109/WISA.2017.55","DOIUrl":"https://doi.org/10.1109/WISA.2017.55","url":null,"abstract":"In view of the fact that the QoS-based service selection method in the current service selection method is less concerned with the personality attribute characteristics of the service requesters and the service selection method based on collaborative filtering, the service providers', Based on the characteristics of the personality of the service requester, this paper describes the user correlation by defining the similarity and domain relevance of the user, and the calculation method of the recommended credibility is given by using the credible measurement theory. Using the analytic hierarchy process (AHP) to determine the weight of each correlation factor, this paper proposes a credible service selection model based on collaborative filtering service selection trust model (SSTM). The simulation results show that the model can effectively improve the efficiency of service selection and resist the attack of malicious feedback. There are two major innovations as following: Firstly, to make a introduction of user relevance to reflect the degree of close between two users (service requester) under the network environment; to apply the user’s personality attribute characteristics to the service provider reputation value during the prediction, to improve the accuracy of service selection by reducing the size of service providers. Secondly, combining user relevance and recommendation credibility organically, using AHP to determine the weight of relevant factors in the service selection index system so that we can make the reputation of the predicted service provider more reliable and effectively resist the malicious user feedback.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133151514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2017 14th Web Information Systems and Applications Conference (WISA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1