首页 > 最新文献

2017 14th Web Information Systems and Applications Conference (WISA)最新文献

英文 中文
Checking the Statutes in Chinese Judgment Document Based on Editing Distance Algorithm 基于编辑距离算法的中文裁判文书成文法检验
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.1
Yufan Yang, Yi Feng, Jidong Ge, Yemao Zhou, Jin Zeng, Chuanyi Li, B. Luo
With the continuous advancement of the informatization of the Chinese People's Court, the court's view on the extraction and application of information has not only been on the structured data, but also for the semi-structured and unstructured data. In the process of in-depth study of the judgment document, many cases require the collection of the document result as an important data dimension, and the key is that the statute is the core of the whole result, so the integrity and correctness of the statute obtained has played a key role for the process of the judgment document processing. However, in the process of writing a specific judgment document, the same statute has different string forms due to the diversity of writing, which leads directly to the error data source. Comparing the editing distance between the strings can judge the similarity of them to a certain extent. Therefore, an automatic method based on the editing distance algorithm is devised, which constructs the disparity model between different statutes strings, to obtain the standardized writing of the same type data. Using this method can remove the non-standard writing of statutes, and ultimately access to the standard statutes collection. This method has a higher efficiency than the method of enumerating all the writing circumstances, which needs the manual participation, additional data storage and update.
随着中国人民法院信息化进程的不断推进,法院对信息的提取和应用的看法已经不仅仅局限于结构化数据,还包括半结构化和非结构化数据。在对裁判文书进行深入研究的过程中,很多案件都需要将文书结果的收集作为重要的数据维度,而关键是规约是整个结果的核心,因此所获得的规约的完整性和正确性对裁判文书处理的过程起到了关键作用。然而,在具体判决书的书写过程中,由于书写方式的多样性,同一法规的字符串形式不同,这就直接导致了错误数据源的产生。比较字符串之间的编辑距离可以在一定程度上判断它们的相似度。为此,设计了一种基于编辑距离算法的自动编写方法,构建不同法规字符串之间的差异模型,实现同类型数据的规范化编写。使用这种方法可以消除法规的非标准写作,并最终获得标准法规的收集。这种方法比列举所有写入情况的方法效率更高,后者需要人工参与,需要额外的数据存储和更新。
{"title":"Checking the Statutes in Chinese Judgment Document Based on Editing Distance Algorithm","authors":"Yufan Yang, Yi Feng, Jidong Ge, Yemao Zhou, Jin Zeng, Chuanyi Li, B. Luo","doi":"10.1109/WISA.2017.1","DOIUrl":"https://doi.org/10.1109/WISA.2017.1","url":null,"abstract":"With the continuous advancement of the informatization of the Chinese People's Court, the court's view on the extraction and application of information has not only been on the structured data, but also for the semi-structured and unstructured data. In the process of in-depth study of the judgment document, many cases require the collection of the document result as an important data dimension, and the key is that the statute is the core of the whole result, so the integrity and correctness of the statute obtained has played a key role for the process of the judgment document processing. However, in the process of writing a specific judgment document, the same statute has different string forms due to the diversity of writing, which leads directly to the error data source. Comparing the editing distance between the strings can judge the similarity of them to a certain extent. Therefore, an automatic method based on the editing distance algorithm is devised, which constructs the disparity model between different statutes strings, to obtain the standardized writing of the same type data. Using this method can remove the non-standard writing of statutes, and ultimately access to the standard statutes collection. This method has a higher efficiency than the method of enumerating all the writing circumstances, which needs the manual participation, additional data storage and update.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114400903","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Information Extraction from Chinese Judgment Documents 中文判决书信息提取
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.67
C. Zhuang, Yemao Zhou, Jidong Ge, Zhongjin Li, Chuanyi Li, Xiaoyu Zhou, B. Luo
Judgment documents contain a wealth of valuable information. The original judgment documents are written in pure text format, so we cannot obtain information directly, which hinders the study of the judgment documents. We propose an approach to parse Chinese judgment documents into structured documents to solve this problem. Divide a judgment document into logical segments, and then extract and label information items from these logical segments. Use information items to build analytic document information model and the model is output into a structured XML document.
判决书包含了大量有价值的信息。原始判决书以纯文本形式书写,无法直接获取信息,阻碍了对判决书的研究。为了解决这一问题,我们提出了一种将中文判决书解析为结构化文件的方法。将判断文档划分为多个逻辑段,然后从这些逻辑段中提取和标记信息项。使用信息项构建分析文档信息模型,并将模型输出为结构化XML文档。
{"title":"Information Extraction from Chinese Judgment Documents","authors":"C. Zhuang, Yemao Zhou, Jidong Ge, Zhongjin Li, Chuanyi Li, Xiaoyu Zhou, B. Luo","doi":"10.1109/WISA.2017.67","DOIUrl":"https://doi.org/10.1109/WISA.2017.67","url":null,"abstract":"Judgment documents contain a wealth of valuable information. The original judgment documents are written in pure text format, so we cannot obtain information directly, which hinders the study of the judgment documents. We propose an approach to parse Chinese judgment documents into structured documents to solve this problem. Divide a judgment document into logical segments, and then extract and label information items from these logical segments. Use information items to build analytic document information model and the model is output into a structured XML document.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114954354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Research on Influence Evaluation Based on RkNN and Its Application in Location Problem 基于RkNN的影响评价及其在定位问题中的应用研究
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.40
Jingke Xu, Xuefa Xia, Huanliang Sun, Shoujing Wang, Ge Yu
The influence of spatial position means how deep it affects the spatial objects and can be measured by the number of affected spatial objects. The evaluation of spatial position influence which widely used in architectural planning and facility location is a typical study in the spatial database. In previous studies, a spatial object was supposed to affect only one spatial position, the influence of the object calculated by the number of the space objects in the area. However, the spatial object can affect many spatial positions and the effects are multiple. In this study, we provide a new evaluation model based on RkNN. A new measurement method was proposed by calculating the weight of the contribution based on the distance between the space object and the space position. The new measurement method makes the model more suitable for the practical application. In addition, a location algorithm was proposed based on the RkNN influence evaluation model. The algorithm can solve the problem such as making the facilities to provide the best service to the customers and using each facility effectively. The influence of each facility is calculated in this algorithm and the rationality of the location scheme is evaluated by equilibrium coefficient, the smaller the equilibrium coefficient, the more reasonable the scheme. The location algorithm based on the new model shows a better performance in the practical application, it contributes to the more reasonable and effective facility location.
空间位置的影响是指它对空间对象的影响程度,可以通过受影响的空间对象的数量来衡量。空间位置影响评价是典型的空间数据库研究,广泛应用于建筑规划和设施选址。在以往的研究中,假设一个空间对象只影响一个空间位置,该对象的影响由该区域内空间对象的数量来计算。然而,空间对象可以影响许多空间位置,并且影响是多重的。本文提出了一种基于RkNN的评价模型。提出了一种基于空间目标与空间位置之间的距离计算贡献权重的测量方法。新的测量方法使模型更适合实际应用。此外,提出了一种基于RkNN影响评价模型的定位算法。该算法可以解决如何使设施为客户提供最佳服务,以及如何有效地利用各设施等问题。该算法对各设施的影响进行计算,并用平衡系数来评价选址方案的合理性,平衡系数越小说明方案越合理。基于新模型的定位算法在实际应用中表现出较好的性能,有助于实现更合理、更有效的设施定位。
{"title":"Research on Influence Evaluation Based on RkNN and Its Application in Location Problem","authors":"Jingke Xu, Xuefa Xia, Huanliang Sun, Shoujing Wang, Ge Yu","doi":"10.1109/WISA.2017.40","DOIUrl":"https://doi.org/10.1109/WISA.2017.40","url":null,"abstract":"The influence of spatial position means how deep it affects the spatial objects and can be measured by the number of affected spatial objects. The evaluation of spatial position influence which widely used in architectural planning and facility location is a typical study in the spatial database. In previous studies, a spatial object was supposed to affect only one spatial position, the influence of the object calculated by the number of the space objects in the area. However, the spatial object can affect many spatial positions and the effects are multiple. In this study, we provide a new evaluation model based on RkNN. A new measurement method was proposed by calculating the weight of the contribution based on the distance between the space object and the space position. The new measurement method makes the model more suitable for the practical application. In addition, a location algorithm was proposed based on the RkNN influence evaluation model. The algorithm can solve the problem such as making the facilities to provide the best service to the customers and using each facility effectively. The influence of each facility is calculated in this algorithm and the rationality of the location scheme is evaluated by equilibrium coefficient, the smaller the equilibrium coefficient, the more reasonable the scheme. The location algorithm based on the new model shows a better performance in the practical application, it contributes to the more reasonable and effective facility location.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129950204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
USLD: A New Approach for Preserving Location Privacy in LBS USLD: LBS中保护位置隐私的新方法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.27
M. Ma, Yuejin Du
The development of LBS bring great convenience to our lives, but also presents new challenges to privacy protection. Many of the existing methods are inadequate because in their schemes they assume that all users can be trusted which is not practical. So, the existing methods can not resist the query sampling attack and self-betrayal attacks. In addition to this, they also did not take the location semantic into account, so they are vulnerable to location homogeneity attacks. In order to solve the problems, we introduce the concept of USLD (user similar location diversity), we consider the scenario that part of the users are not trusted, and the users which we choose as candidates may in the locations which have same semantic. We consider some of users to be untrustworthy, propose the idea that users who have similar privacy settings with the real user are more plausible than others. We select users who are similar with the real user using Adjusted Cosine Similarity, and the Earth Mover Distance is used to calculate location semantics. Our method can well resist query sampling attacks, self-betrayal attacks, location homogeneity attacks. Experiments show that our method is very practical.
LBS的发展在给我们的生活带来极大便利的同时,也对隐私保护提出了新的挑战。许多现有的方法是不充分的,因为在他们的方案中,他们假设所有的用户都是可以信任的,这是不切实际的。因此,现有的方法无法抵抗查询抽样攻击和自我背叛攻击。除此之外,它们也没有考虑位置语义,因此容易受到位置同质性攻击。为了解决这一问题,我们引入了USLD (user similar location diversity,用户相似位置多样性)的概念,考虑了部分用户不可信的情况,我们选择的候选用户可能处于语义相同的位置。我们认为一些用户是不值得信任的,提出了与真实用户具有相似隐私设置的用户比其他用户更可信的想法。我们使用调整余弦相似度来选择与真实用户相似的用户,并使用地球移动距离来计算位置语义。该方法可以很好地抵抗查询抽样攻击、自我背叛攻击、位置同质性攻击。实验表明,该方法是非常实用的。
{"title":"USLD: A New Approach for Preserving Location Privacy in LBS","authors":"M. Ma, Yuejin Du","doi":"10.1109/WISA.2017.27","DOIUrl":"https://doi.org/10.1109/WISA.2017.27","url":null,"abstract":"The development of LBS bring great convenience to our lives, but also presents new challenges to privacy protection. Many of the existing methods are inadequate because in their schemes they assume that all users can be trusted which is not practical. So, the existing methods can not resist the query sampling attack and self-betrayal attacks. In addition to this, they also did not take the location semantic into account, so they are vulnerable to location homogeneity attacks. In order to solve the problems, we introduce the concept of USLD (user similar location diversity), we consider the scenario that part of the users are not trusted, and the users which we choose as candidates may in the locations which have same semantic. We consider some of users to be untrustworthy, propose the idea that users who have similar privacy settings with the real user are more plausible than others. We select users who are similar with the real user using Adjusted Cosine Similarity, and the Earth Mover Distance is used to calculate location semantics. Our method can well resist query sampling attacks, self-betrayal attacks, location homogeneity attacks. Experiments show that our method is very practical.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129304947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A System for Recognizing Entities and Extracting Relations from Electronic Medical Records 电子病历实体识别与关系提取系统
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.54
Chi Chen, Hongxia Liu, Chunxiao Xing
Digging rich knowledge from clinical texts becomes a popular topic today. Knowledge graph has been widely used to integrate and manage abundant knowledge. Entity recognition and relation extraction play important roles in constructing knowledge graphs. In this paper, we develop a system to recognize entities and extract their relations from clinical texts in Electronic Medical Records. Our system implements four major functions: manual entity annotation, automatic entity recognition, manual relation annotation and automatic relation extraction. Tools of entity annotation and relation annotation are designed for professionals to help them manually annotate objects given original clinical texts. Moreover, entity recognition and relation recognition, which CRF and CNN are applied in, are accessible for professionals before manual annotation in order to increase the efficiency. Our system has been used in several applications, such as medical knowledge graph construction and health QA system.
从临床文献中挖掘丰富的知识已成为当今的热门话题。知识图谱被广泛用于对丰富的知识进行整合和管理。实体识别和关系提取在知识图谱的构建中起着重要作用。在本文中,我们开发了一个从电子病历的临床文本中识别实体并提取它们之间关系的系统。系统实现了手动实体标注、自动实体识别、手动关系标注和自动关系提取四大功能。实体注释和关系注释工具是为专业人员设计的,帮助他们手动注释给定的原始临床文本对象。此外,CRF和CNN应用的实体识别和关系识别,在人工标注之前,专业人员可以访问,以提高效率。该系统已在医学知识图谱构建、卫生质量保证系统等多个领域得到应用。
{"title":"A System for Recognizing Entities and Extracting Relations from Electronic Medical Records","authors":"Chi Chen, Hongxia Liu, Chunxiao Xing","doi":"10.1109/WISA.2017.54","DOIUrl":"https://doi.org/10.1109/WISA.2017.54","url":null,"abstract":"Digging rich knowledge from clinical texts becomes a popular topic today. Knowledge graph has been widely used to integrate and manage abundant knowledge. Entity recognition and relation extraction play important roles in constructing knowledge graphs. In this paper, we develop a system to recognize entities and extract their relations from clinical texts in Electronic Medical Records. Our system implements four major functions: manual entity annotation, automatic entity recognition, manual relation annotation and automatic relation extraction. Tools of entity annotation and relation annotation are designed for professionals to help them manually annotate objects given original clinical texts. Moreover, entity recognition and relation recognition, which CRF and CNN are applied in, are accessible for professionals before manual annotation in order to increase the efficiency. Our system has been used in several applications, such as medical knowledge graph construction and health QA system.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"42 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120913657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Progressive Method for Detecting Duplication Entities Based on Bloom Filters 一种基于Bloom过滤器的重复实体递进检测方法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.11
Yebing Luo, Tiezheng Nie, Derong Shen, Yue Kou, Ge Yu
With the volume of data grows rapidly, the cost of detecting duplication entities has increased significantly in data cleaning. However, some real-time applications only need to identify as many duplicate entities as possible in a limited time, rather than all of them. The existing works adopt the sorting method to divide similar records into blocks, and arrange the processing order of blocks to detect duplicate entity progressively. However, this method only works well when the attributes of records are suitable for sorting. Therefore, this paper proposes a novel progressive de-duplicate method for records that can't be sorted by their attributes. The method distributes records into different blocks based on their features and generates a modified bloom filter index for each block. Then it uses the bloom filter to predict the probability of duplicate entities in this block, which determines the processing order of blocks to detect the duplicate entities more quickly. The comprehensive experiment shows that the number of duplicate detection by this algorithm in the finite time is far more efficient than other algorithms involved in the related works.
随着数据量的快速增长,在数据清理过程中,检测重复实体的成本显著增加。然而,一些实时应用程序只需要在有限的时间内识别尽可能多的重复实体,而不是所有的重复实体。现有的工作采用排序的方法,将相似的记录分成块,排列块的处理顺序,逐级检测重复实体。但是,只有当记录的属性适合排序时,这种方法才有效。因此,针对不能按属性排序的记录,本文提出了一种新的渐进式重复数据删除方法。该方法根据记录的特征将记录分布到不同的块中,并为每个块生成修改后的布隆过滤器索引。然后使用布隆过滤器预测该块中重复实体的概率,从而确定块的处理顺序,从而更快地检测出重复实体。综合实验表明,该算法在有限时间内检测重复次数的效率远远高于相关工作中涉及的其他算法。
{"title":"A Progressive Method for Detecting Duplication Entities Based on Bloom Filters","authors":"Yebing Luo, Tiezheng Nie, Derong Shen, Yue Kou, Ge Yu","doi":"10.1109/WISA.2017.11","DOIUrl":"https://doi.org/10.1109/WISA.2017.11","url":null,"abstract":"With the volume of data grows rapidly, the cost of detecting duplication entities has increased significantly in data cleaning. However, some real-time applications only need to identify as many duplicate entities as possible in a limited time, rather than all of them. The existing works adopt the sorting method to divide similar records into blocks, and arrange the processing order of blocks to detect duplicate entity progressively. However, this method only works well when the attributes of records are suitable for sorting. Therefore, this paper proposes a novel progressive de-duplicate method for records that can't be sorted by their attributes. The method distributes records into different blocks based on their features and generates a modified bloom filter index for each block. Then it uses the bloom filter to predict the probability of duplicate entities in this block, which determines the processing order of blocks to detect the duplicate entities more quickly. The comprehensive experiment shows that the number of duplicate detection by this algorithm in the finite time is far more efficient than other algorithms involved in the related works.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127967773","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on Short-Time Prediction of Dynamical Local Replanning Route Guidance Method Based on HMM 基于HMM的动态局部重规划路径引导方法短时预测研究
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.32
Yongmei Zhao, Hongmei Zhang
The insufficient real-time responses, accuracy and intelligence have become key issues in the practical application of traffic guidance information services. This paper addresses these issues by proposing a new dynamic route guidance method. It firstly establishes a concurrent global route search method. By using this method, multiple relative static shortest routes can be searched, and then the shortest global optimized route is obtained for the current traffic flow. Secondly, by using the sliding window model, the method extracts the real-time traffic data stream reflected by the spatial and temporal changes in location of vehicles. By combining with the hidden Markov model, the method can also be used for the forecast of short-term traffic states and the decision-making of whether local planning is necessary.
实时性、准确性和智能性不足已成为交通引导信息服务实际应用中的关键问题。针对这些问题,本文提出了一种新的动态路径引导方法。首先建立了一种并发全局路径搜索方法;该方法通过搜索多条相对静态最短路径,得到当前交通流最短的全局优化路径。其次,利用滑动窗口模型提取车辆位置时空变化所反映的实时交通数据流;通过与隐马尔可夫模型的结合,该方法还可以用于短期交通状态的预测以及是否需要进行局部规划的决策。
{"title":"Research on Short-Time Prediction of Dynamical Local Replanning Route Guidance Method Based on HMM","authors":"Yongmei Zhao, Hongmei Zhang","doi":"10.1109/WISA.2017.32","DOIUrl":"https://doi.org/10.1109/WISA.2017.32","url":null,"abstract":"The insufficient real-time responses, accuracy and intelligence have become key issues in the practical application of traffic guidance information services. This paper addresses these issues by proposing a new dynamic route guidance method. It firstly establishes a concurrent global route search method. By using this method, multiple relative static shortest routes can be searched, and then the shortest global optimized route is obtained for the current traffic flow. Secondly, by using the sliding window model, the method extracts the real-time traffic data stream reflected by the spatial and temporal changes in location of vehicles. By combining with the hidden Markov model, the method can also be used for the forecast of short-term traffic states and the decision-making of whether local planning is necessary.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129176260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Collaborative Filtering Recommendation Algorithm for Social Interaction 一种面向社会互动的协同过滤推荐算法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.26
Jinglong Zhang, Mengxing Huang, Yu Zhang
When the traditional collaborative filtering algorithm faces high sparse data, its precision and quality of recommendation become unsatisfied. With the development of social networks, it is possible to selectively fill the missing value in the user-item matrix by using the friendship or trust relationship information of social networks. According to the memory-based collaborative filtering algorithm, in the paper, the two steps which are similarity calculation and user rating prediction are taken into account. Besides, this paper has filled appropriately the missing value and improved memory-based collaborative filtering recommendation algorithms to integrate the social relations. The experiment on the Epinions dataset shows that the improved algorithm can effectively alleviate the sparsity problem of user rating data and perform better than other classic algorithms in RMSE and MAP evaluation metrics.
传统的协同过滤算法在面对高度稀疏的数据时,其推荐精度和质量都不尽如人意。随着社交网络的发展,利用社交网络的友谊或信任关系信息,可以选择性地填补用户-物品矩阵中的缺失值。根据基于记忆的协同过滤算法,本文考虑了相似度计算和用户评分预测两个步骤。此外,本文还对缺失值进行了适当的填充,并改进了基于记忆的协同过滤推荐算法,以整合社会关系。在Epinions数据集上的实验表明,改进算法能有效缓解用户评分数据的稀疏性问题,在RMSE和MAP评价指标上优于其他经典算法。
{"title":"A Collaborative Filtering Recommendation Algorithm for Social Interaction","authors":"Jinglong Zhang, Mengxing Huang, Yu Zhang","doi":"10.1109/WISA.2017.26","DOIUrl":"https://doi.org/10.1109/WISA.2017.26","url":null,"abstract":"When the traditional collaborative filtering algorithm faces high sparse data, its precision and quality of recommendation become unsatisfied. With the development of social networks, it is possible to selectively fill the missing value in the user-item matrix by using the friendship or trust relationship information of social networks. According to the memory-based collaborative filtering algorithm, in the paper, the two steps which are similarity calculation and user rating prediction are taken into account. Besides, this paper has filled appropriately the missing value and improved memory-based collaborative filtering recommendation algorithms to integrate the social relations. The experiment on the Epinions dataset shows that the improved algorithm can effectively alleviate the sparsity problem of user rating data and perform better than other classic algorithms in RMSE and MAP evaluation metrics.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132024916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Smart Business Cloud Based on Hadoop 基于Hadoop的智能商业云
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.20
Ouyang Hao, Wang Zhi Wen, H. Jin, H. Ping
Traditional business can't provide suitable decision for manager, and can't provide personal service for customer. With computing and network technology development, cloud computing and big data technology now is popularly used in many fields. This paper provides a smart business cloud system based on Hadoop. Firstly, it uses Hadoop to build up a cloud computing system, it can provide powerful storage and computing ability, then uses big data mining to analyze these huge data and get rules or knowledge. For a smart business system, it not only need compute or analyze, it need provide ways to collect data and push message to customers. In this smart system, it provides the module that can collect data by many sensors and network, and uses Android system to push personal and valuable message to different customers.
传统企业不能为管理者提供合适的决策,也不能为顾客提供个性化的服务。随着计算和网络技术的发展,云计算和大数据技术现在在许多领域得到了广泛的应用。本文提出了一个基于Hadoop的智能商业云系统。首先使用Hadoop构建一个云计算系统,它可以提供强大的存储和计算能力,然后使用大数据挖掘对这些庞大的数据进行分析,得到规则或知识。对于智能商业系统来说,它不仅需要计算或分析,还需要提供收集数据和向客户推送消息的方法。在这个智能系统中,它提供了可以通过多个传感器和网络收集数据的模块,并使用Android系统向不同的客户推送个性化和有价值的信息。
{"title":"Smart Business Cloud Based on Hadoop","authors":"Ouyang Hao, Wang Zhi Wen, H. Jin, H. Ping","doi":"10.1109/WISA.2017.20","DOIUrl":"https://doi.org/10.1109/WISA.2017.20","url":null,"abstract":"Traditional business can't provide suitable decision for manager, and can't provide personal service for customer. With computing and network technology development, cloud computing and big data technology now is popularly used in many fields. This paper provides a smart business cloud system based on Hadoop. Firstly, it uses Hadoop to build up a cloud computing system, it can provide powerful storage and computing ability, then uses big data mining to analyze these huge data and get rules or knowledge. For a smart business system, it not only need compute or analyze, it need provide ways to collect data and push message to customers. In this smart system, it provides the module that can collect data by many sensors and network, and uses Android system to push personal and valuable message to different customers.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134023555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Influence Maximization Algorithm Research in Big Graph 大图中新的影响最大化算法研究
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.50
Guigang Zhang, Sujie Li, Jian Wang, Ping Liu, Yibing Chen, Yunchuan Luo
Influence maximization is a very hot research in social network. However, it is difficult to find a good algorithm to keep balance between the time complexity and computing result' accuracy. In order to solve this problem, in this paper, we propose two new algorithms. Firstly, we present a heuristic algorithm based on the greedy algorithm, which can reduce the time complexity a lot and it will have a good result, too. Then, we present another new algorithm. We use the k-means idea to solve the IM problem. We use the k-means idea to find s seed nodes. At the same time, we prove these two new algorithms.
影响力最大化是社交网络领域的一个研究热点。然而,很难找到一种好的算法来平衡时间复杂度和计算结果的准确性。为了解决这一问题,本文提出了两种新的算法。首先,在贪心算法的基础上提出了一种启发式算法,该算法大大降低了时间复杂度,并取得了良好的效果。然后,我们提出了另一种新的算法。我们使用k-均值的思想来解决IM问题。我们使用k-均值的思想来找到s个种子节点。同时,对这两种新算法进行了证明。
{"title":"New Influence Maximization Algorithm Research in Big Graph","authors":"Guigang Zhang, Sujie Li, Jian Wang, Ping Liu, Yibing Chen, Yunchuan Luo","doi":"10.1109/WISA.2017.50","DOIUrl":"https://doi.org/10.1109/WISA.2017.50","url":null,"abstract":"Influence maximization is a very hot research in social network. However, it is difficult to find a good algorithm to keep balance between the time complexity and computing result' accuracy. In order to solve this problem, in this paper, we propose two new algorithms. Firstly, we present a heuristic algorithm based on the greedy algorithm, which can reduce the time complexity a lot and it will have a good result, too. Then, we present another new algorithm. We use the k-means idea to solve the IM problem. We use the k-means idea to find s seed nodes. At the same time, we prove these two new algorithms.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128469815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2017 14th Web Information Systems and Applications Conference (WISA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1