首页 > 最新文献

2017 14th Web Information Systems and Applications Conference (WISA)最新文献

英文 中文
Research on Portfolio Optimization Based on Affinity Propagation and Genetic Algorithm 基于亲和传播和遗传算法的投资组合优化研究
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.9
Chong Liu, Wenyan Gan, Yutian Chen
Portfolio optimization refers to the reasonable allocation of assets to achieve the investment objectives. At present, the investment environment depression was due to sluggish economic conditions. For investors, they expect to find a balance between return and risk in a complex investment environment. In order to solve this problem, this paper proposes a portfolio optimization algorithm named Portfolio Optimization based on Affinity propagation and Genetic algorithm, as also called POGA, which based on affinity propagation and genetic algorithm. Firstly, the affinity propagation algorithm is used to construct a candidate set of portfolio based on the correlation analysis of the stock time series. Secondly, using the Sharpe-ratio as the Optimization objective function, the genetic algorithm is used to solve an optimal portfolio strategy with higher-return and lower-risk. Finally, the experimental result of real-world stock data show that a portfolio with higher return and lower risk will be selected.
投资组合优化是指合理配置资产以实现投资目标。目前,投资环境的低迷是由于经济状况低迷。对于投资者来说,他们希望在复杂的投资环境中找到回报与风险之间的平衡。为了解决这一问题,本文提出了一种基于亲和传播和遗传算法的投资组合优化算法POGA,即基于亲和传播和遗传算法的组合优化算法。首先,基于股票时间序列的相关性分析,利用亲和传播算法构建候选投资组合集;其次,以夏普比率为优化目标函数,采用遗传算法求解高收益、低风险的最优投资组合策略;最后,实际股票数据的实验结果表明,将选择一个高收益、低风险的投资组合。
{"title":"Research on Portfolio Optimization Based on Affinity Propagation and Genetic Algorithm","authors":"Chong Liu, Wenyan Gan, Yutian Chen","doi":"10.1109/WISA.2017.9","DOIUrl":"https://doi.org/10.1109/WISA.2017.9","url":null,"abstract":"Portfolio optimization refers to the reasonable allocation of assets to achieve the investment objectives. At present, the investment environment depression was due to sluggish economic conditions. For investors, they expect to find a balance between return and risk in a complex investment environment. In order to solve this problem, this paper proposes a portfolio optimization algorithm named Portfolio Optimization based on Affinity propagation and Genetic algorithm, as also called POGA, which based on affinity propagation and genetic algorithm. Firstly, the affinity propagation algorithm is used to construct a candidate set of portfolio based on the correlation analysis of the stock time series. Secondly, using the Sharpe-ratio as the Optimization objective function, the genetic algorithm is used to solve an optimal portfolio strategy with higher-return and lower-risk. Finally, the experimental result of real-world stock data show that a portfolio with higher return and lower risk will be selected.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116004379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Cluster Correction on Polysemy and Synonymy 一词多义和同义词的聚类校正
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.45
Zemin Qin, Hao Lian, Tieke He, B. Luo
Document clustering (or text clustering) is the application of cluster analysis to textual documents. It has applications in automatic document organization, topic extraction and fast information retrieval or filtering. At the same time, there are still many challenges, for example the accuracy of clustering needs to be improved. In this regard, the process of cluster correction becomes the object of analysis. In this paper, we focus on the polysemy and synonymy issue in clustering process. Polysemy represents the ambiguity of an individual word or phrase that can be used (in different contexts) to express two or more different meanings. However, synonymy is the semantic relation that holds between two or more words that can (in a given context) express the same meaning. These two conditions will affect our results of clustering. In order that, we use bag of words model to distinguish contexts of the same words and word2vec to re-cluster word with the similar meaning. Cosine similarity is also use to measure of similarity between two nonzero vectors in these two model.
文档聚类(或文本聚类)是聚类分析在文本文档中的应用。它在自动文档组织、主题提取和快速信息检索或过滤等方面具有广泛的应用。同时,还存在许多挑战,如聚类的准确性有待提高。在这方面,聚类校正过程成为分析的对象。本文主要研究聚类过程中的一词多义问题。一词多义是指一个词或短语的歧义性,它可以(在不同的语境中)用来表达两种或更多不同的意思。然而,同义词是两个或多个单词之间的语义关系,这些单词(在给定的上下文中)可以表达相同的意思。这两个条件都会影响聚类的结果。为此,我们使用词袋模型来区分相同词的上下文,并使用word2vec对具有相似意思的词进行重新聚类。余弦相似度也用于度量这两个模型中两个非零向量之间的相似度。
{"title":"Cluster Correction on Polysemy and Synonymy","authors":"Zemin Qin, Hao Lian, Tieke He, B. Luo","doi":"10.1109/WISA.2017.45","DOIUrl":"https://doi.org/10.1109/WISA.2017.45","url":null,"abstract":"Document clustering (or text clustering) is the application of cluster analysis to textual documents. It has applications in automatic document organization, topic extraction and fast information retrieval or filtering. At the same time, there are still many challenges, for example the accuracy of clustering needs to be improved. In this regard, the process of cluster correction becomes the object of analysis. In this paper, we focus on the polysemy and synonymy issue in clustering process. Polysemy represents the ambiguity of an individual word or phrase that can be used (in different contexts) to express two or more different meanings. However, synonymy is the semantic relation that holds between two or more words that can (in a given context) express the same meaning. These two conditions will affect our results of clustering. In order that, we use bag of words model to distinguish contexts of the same words and word2vec to re-cluster word with the similar meaning. Cosine similarity is also use to measure of similarity between two nonzero vectors in these two model.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116669890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Topic Analysis and Influential Paper Discovery on Scientific Publications 科学出版物的主题分析与有影响力的论文发现
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.69
Ye Li, Jun He, Hongyan Liu
With the development of scientific research, scientific publications are valuable resources for new-comers in the research field. But massive scientific publications make it a challenge for researchers diving into a new research field. As a good practice to this problem, topics are put forward to organize publications. In this paper, we propose two modified LDA topic models as solutions to topic analysis and influential paper discovery on scientific publications, cc-LDA and cp-LDA. Compared to state-of-the-art researches on LDA, we incorporate citation information including its occurrence times and occurrence position into our models. Model cc-LDA integrates paper content and citation occurrence into LDA model, while cp-LDA considers both occurrence and position of citations. Both models can not only find topics in the form of citation distribution, but also help discover influential papers under certain topics. Furthermore, both models can extract more representative vectors for papers, which achieve good performance in subsequent clustering.
随着科学研究的发展,科学出版物是研究领域新人的宝贵资源。但是,大量的科学出版物使研究人员进入一个新的研究领域成为一个挑战。作为对这一问题的良好实践,提出了专题组织出版物。本文提出了两个改进的LDA主题模型cc-LDA和cp-LDA,以解决科学出版物的主题分析和有影响力的论文发现问题。与现有的LDA研究相比,我们将引文的出现次数和出现位置等信息纳入到模型中。cc-LDA模型将论文内容和被引频次集成到LDA模型中,而cp-LDA模型同时考虑被引频次和被引位置。这两种模型不仅可以以引文分布的形式找到主题,而且可以帮助发现特定主题下有影响力的论文。此外,这两种模型都可以为论文提取更多具有代表性的向量,从而在后续聚类中获得良好的性能。
{"title":"Topic Analysis and Influential Paper Discovery on Scientific Publications","authors":"Ye Li, Jun He, Hongyan Liu","doi":"10.1109/WISA.2017.69","DOIUrl":"https://doi.org/10.1109/WISA.2017.69","url":null,"abstract":"With the development of scientific research, scientific publications are valuable resources for new-comers in the research field. But massive scientific publications make it a challenge for researchers diving into a new research field. As a good practice to this problem, topics are put forward to organize publications. In this paper, we propose two modified LDA topic models as solutions to topic analysis and influential paper discovery on scientific publications, cc-LDA and cp-LDA. Compared to state-of-the-art researches on LDA, we incorporate citation information including its occurrence times and occurrence position into our models. Model cc-LDA integrates paper content and citation occurrence into LDA model, while cp-LDA considers both occurrence and position of citations. Both models can not only find topics in the form of citation distribution, but also help discover influential papers under certain topics. Furthermore, both models can extract more representative vectors for papers, which achieve good performance in subsequent clustering.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"114 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124792298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Proactive Data Service Model to Encapsulating Stream Sensor Data into Service 一种将流传感器数据封装到服务中的主动数据服务模型
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.5
Shouli Zhang, Chen Liu, Shen Su, Yanbo Han, Dandan Feng
Abnormality Detection in power plant is a typical IoT application which aims to identify anomalies in these routinely collected monitoring sensor data; intend to help detect possible faults in the equipment. However, on the development of abnormality detection, we find that there are three challenges. The first one is the lack of cooperation between sensors. It means that the physical sensors cannot share and interact with each other. Secondly, the rapid increase in volume of sensor data and dynamic situation of production result in challenges to predefine all possible associations between sensors. Thirdly, it is difficult to build IoT application for developers who have little or no professional knowledge about production process. In this paper, we proposed a proactive data service model to encapsulate stream sensor data into services. We spread events among the proactive data services. By analysis of event correlations, we have realized service hyperlinks which help to offer the proactive real-time interaction with services. Real application and experiments verified that our proactive data service based method is more effective compare with traditional rule-based methods to detect abnormalities in power plant.
电厂异常检测是典型的物联网应用,旨在识别这些常规采集的监测传感器数据中的异常;帮助发现设备中可能存在的故障。然而,在异常检测的发展过程中,我们发现存在着三个挑战。首先是传感器之间缺乏合作。这意味着物理传感器不能相互共享和交互。其次,传感器数据量的快速增长和生产的动态情况给预先定义传感器之间所有可能的关联带来了挑战。第三,对于没有或很少有生产流程专业知识的开发人员来说,很难构建物联网应用程序。本文提出了一种主动数据服务模型,将流传感器数据封装到服务中。我们在主动数据服务之间传播事件。通过对事件相关性的分析,实现了服务超链接,实现了与服务的主动实时交互。实际应用和实验证明,与传统的基于规则的电厂异常检测方法相比,基于主动数据服务的电厂异常检测方法更加有效。
{"title":"A Proactive Data Service Model to Encapsulating Stream Sensor Data into Service","authors":"Shouli Zhang, Chen Liu, Shen Su, Yanbo Han, Dandan Feng","doi":"10.1109/WISA.2017.5","DOIUrl":"https://doi.org/10.1109/WISA.2017.5","url":null,"abstract":"Abnormality Detection in power plant is a typical IoT application which aims to identify anomalies in these routinely collected monitoring sensor data; intend to help detect possible faults in the equipment. However, on the development of abnormality detection, we find that there are three challenges. The first one is the lack of cooperation between sensors. It means that the physical sensors cannot share and interact with each other. Secondly, the rapid increase in volume of sensor data and dynamic situation of production result in challenges to predefine all possible associations between sensors. Thirdly, it is difficult to build IoT application for developers who have little or no professional knowledge about production process. In this paper, we proposed a proactive data service model to encapsulate stream sensor data into services. We spread events among the proactive data services. By analysis of event correlations, we have realized service hyperlinks which help to offer the proactive real-time interaction with services. Real application and experiments verified that our proactive data service based method is more effective compare with traditional rule-based methods to detect abnormalities in power plant.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128978694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Method to Identify Personal Desktop Activities 识别个人桌面活动的方法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.39
Ruolan Li, Huan Liao, Huili Su, Yukun Li, Yongxuan Lai
As people acquire much more personal information as a result of personal and work activities, the management of these information becomes a serious problem and an important research issue. Modeling personal desktop activities and identifying them are two basic problems for supporting activity-based operations. To the best of our knowledge there is no literature on formalizing and identifying desktop activity from personal information management perspective. There are a number of challenges to this work, including the fact that people exhibit personalized behaviors, have individual interests, needs and resources, no available experimental data set, etc. In this paper, we perform a user experiment to learn about user desktop activities in a personal information management context. We collected information access activities in a naturalistic setting and propose a conceptual activity model by analyzing features of user behaviors at their desktop computers. We present an effective and efficient method of automatically identifying desktop activities. To evaluate performance of our method, we develop a prototype system to collect real users activities, and evaluate our methods for identifying activities. The results verify the effectiveness and efficiency of our methods.
随着人们在生活和工作活动中获取越来越多的个人信息,这些信息的管理成为一个严重的问题,也是一个重要的研究课题。为个人桌面活动建模和识别它们是支持基于活动的操作的两个基本问题。据我们所知,目前还没有从个人信息管理的角度形式化和识别桌面活动的文献。这项工作面临着许多挑战,包括人们表现出个性化的行为,有个人的兴趣、需求和资源,没有可用的实验数据集等。在本文中,我们执行了一个用户实验,以了解个人信息管理上下文中的用户桌面活动。我们收集了一个自然环境下的信息访问活动,并通过分析用户在台式电脑上的行为特征,提出了一个概念活动模型。我们提出了一种有效的自动识别桌面活动的方法。为了评估我们方法的性能,我们开发了一个原型系统来收集真实的用户活动,并评估我们识别活动的方法。结果验证了该方法的有效性和高效性。
{"title":"A Method to Identify Personal Desktop Activities","authors":"Ruolan Li, Huan Liao, Huili Su, Yukun Li, Yongxuan Lai","doi":"10.1109/WISA.2017.39","DOIUrl":"https://doi.org/10.1109/WISA.2017.39","url":null,"abstract":"As people acquire much more personal information as a result of personal and work activities, the management of these information becomes a serious problem and an important research issue. Modeling personal desktop activities and identifying them are two basic problems for supporting activity-based operations. To the best of our knowledge there is no literature on formalizing and identifying desktop activity from personal information management perspective. There are a number of challenges to this work, including the fact that people exhibit personalized behaviors, have individual interests, needs and resources, no available experimental data set, etc. In this paper, we perform a user experiment to learn about user desktop activities in a personal information management context. We collected information access activities in a naturalistic setting and propose a conceptual activity model by analyzing features of user behaviors at their desktop computers. We present an effective and efficient method of automatically identifying desktop activities. To evaluate performance of our method, we develop a prototype system to collect real users activities, and evaluate our methods for identifying activities. The results verify the effectiveness and efficiency of our methods.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134390971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Personalized Book Recommender System Based on Chinese Library Classification 基于中图分类法的个性化图书推荐系统
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.42
H. Zhang, Yingyuan Xiao, Zhongjing Bu
with the continuous construction and development of university library, how to find interesting books from the massive books is becoming a concerned problem. In this paper, we develop a personalized book recommender system based on Chinese Library Classification Method named CLCM. CLCM uses Upper and Lower Level Relations Model (ULLRM) to describe the characteristic words and fuses the Dominant and Recessive Feedback Model (DRFM) to update the users' preferences. And visualization of book inquiry improves the efficiency of inquiring. The experimental results show that CLCM performs much better than the state-of-the art approaches in the university library.
随着高校图书馆的不断建设和发展,如何从海量的图书中发掘出有趣的图书已成为人们关注的问题。本文开发了一个基于中图分类法的个性化图书推荐系统CLCM。CLCM使用上下关系模型(ULLRM)来描述特色词,并融合显性和隐性反馈模型(DRFM)来更新用户的偏好。图书查询的可视化提高了查询的效率。实验结果表明,在高校图书馆中,CLCM的性能明显优于目前最先进的方法。
{"title":"Personalized Book Recommender System Based on Chinese Library Classification","authors":"H. Zhang, Yingyuan Xiao, Zhongjing Bu","doi":"10.1109/WISA.2017.42","DOIUrl":"https://doi.org/10.1109/WISA.2017.42","url":null,"abstract":"with the continuous construction and development of university library, how to find interesting books from the massive books is becoming a concerned problem. In this paper, we develop a personalized book recommender system based on Chinese Library Classification Method named CLCM. CLCM uses Upper and Lower Level Relations Model (ULLRM) to describe the characteristic words and fuses the Dominant and Recessive Feedback Model (DRFM) to update the users' preferences. And visualization of book inquiry improves the efficiency of inquiring. The experimental results show that CLCM performs much better than the state-of-the art approaches in the university library.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"2012 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132137906","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Collaborative Filtering Algorithm of Calculating Similarity Based on Item Rating and Attributes 一种基于物品等级和属性的相似度计算协同过滤算法
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.35
Zelong Li, Mengxing Huang, Yu Zhang
Nowadays, the collaborative filtering techniques have demonstrated an excellent performance in the top-N recommendation. However conventional methods in similarity measurement are insufficient when the condition of data sparsity and cold start occur, which leads to a poor accuracy in prediction. In order to concur the limitation, a collaborative filtering algorithm of calculating similarity based on item rating and attributes is proposed. Firstly, we calculate the similarity of item attributes, then calculate the similarity of the project according to the user rating of the project. Meanwhile, a weighted control coefficient is proposed to combine the similarity between item attributes and rating of items, which contribute to obtain nearest neighbors. Experiments have shown that our algorithm has major potential in solving the problem of cold start, therefore improving the precision of the recommendation system.
目前,协同过滤技术在top-N推荐中表现出了优异的性能。然而,传统的相似性度量方法在数据稀疏性和冷启动条件下存在不足,导致预测精度较差。为了克服这一局限性,提出了一种基于物品等级和属性计算相似度的协同过滤算法。首先计算项目属性的相似度,然后根据用户对项目的评价计算项目的相似度。同时,提出了一种加权控制系数,将物品属性之间的相似度与物品等级相结合,有助于获得最近邻。实验表明,我们的算法在解决冷启动问题上具有很大的潜力,从而提高了推荐系统的精度。
{"title":"A Collaborative Filtering Algorithm of Calculating Similarity Based on Item Rating and Attributes","authors":"Zelong Li, Mengxing Huang, Yu Zhang","doi":"10.1109/WISA.2017.35","DOIUrl":"https://doi.org/10.1109/WISA.2017.35","url":null,"abstract":"Nowadays, the collaborative filtering techniques have demonstrated an excellent performance in the top-N recommendation. However conventional methods in similarity measurement are insufficient when the condition of data sparsity and cold start occur, which leads to a poor accuracy in prediction. In order to concur the limitation, a collaborative filtering algorithm of calculating similarity based on item rating and attributes is proposed. Firstly, we calculate the similarity of item attributes, then calculate the similarity of the project according to the user rating of the project. Meanwhile, a weighted control coefficient is proposed to combine the similarity between item attributes and rating of items, which contribute to obtain nearest neighbors. Experiments have shown that our algorithm has major potential in solving the problem of cold start, therefore improving the precision of the recommendation system.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"235 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132628653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Named Entity Recognition in Chinese Electronic Medical Records Based on CRF 基于CRF的中文电子病历命名实体识别
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.8
Kaixin Liu, Qingcheng Hu, Jianwei Liu, Chunxiao Xing
Massive Electronic Medical Records (EMRs) contain a lot of knowledge and Named Entity Recognition (NER) in Chinese EMR is a very important task. However, due to the lack of Chinese medical dictionary, there are few studies on NER in Chinese EMR. In this paper, we first build a medical dictionary. We then investigated the effects of different types of features in Chinese clinical NER tasks based on Condition Random Fields (CRF) algorithm, the most popular algorithm for NER, including bag-of-characters, part of speech, dictionary feature, and word clustering features. In the experimental section, we randomly selected 220 clinical texts from Peking Anzhen Hospital. The experimental results showed that these features were beneficial in varying degrees to Chinese named entity recognition. Finally, after analyzing the experimental results, we get some rules of thumb.
海量电子病历包含了大量的知识,中文电子病历的命名实体识别是一项非常重要的任务。然而,由于缺乏中文医学词典,对中文电子病历中NER的研究很少。本文首先构建了一个医学词典。基于条件随机场(CRF)算法,包括字符袋特征、词性特征、字典特征和聚类特征,研究了不同类型特征对汉语临床NER任务的影响。在实验部分,我们随机抽取北京安贞医院的220篇临床文献。实验结果表明,这些特征对中文命名实体识别都有不同程度的帮助。最后,通过对实验结果的分析,得出了一些经验法则。
{"title":"Named Entity Recognition in Chinese Electronic Medical Records Based on CRF","authors":"Kaixin Liu, Qingcheng Hu, Jianwei Liu, Chunxiao Xing","doi":"10.1109/WISA.2017.8","DOIUrl":"https://doi.org/10.1109/WISA.2017.8","url":null,"abstract":"Massive Electronic Medical Records (EMRs) contain a lot of knowledge and Named Entity Recognition (NER) in Chinese EMR is a very important task. However, due to the lack of Chinese medical dictionary, there are few studies on NER in Chinese EMR. In this paper, we first build a medical dictionary. We then investigated the effects of different types of features in Chinese clinical NER tasks based on Condition Random Fields (CRF) algorithm, the most popular algorithm for NER, including bag-of-characters, part of speech, dictionary feature, and word clustering features. In the experimental section, we randomly selected 220 clinical texts from Peking Anzhen Hospital. The experimental results showed that these features were beneficial in varying degrees to Chinese named entity recognition. Finally, after analyzing the experimental results, we get some rules of thumb.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131121891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
The Influence of the Attention Decay in an Information Spreading Model 信息传播模型中注意力衰减的影响
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.13
Zili Xiong, Zaobin Gan, Haifeng Xiang, Hongwei Lu
Some work in information spreading show that the attention plays an important role in explaining human behaviors and the attention decay exists in the process of information spreading. However, few researchers take the attention decay into consideration when studying the spreading dynamics of information. In this paper, we propose a susceptible-received-accepted-immune (SRAI) information spreading model to explore the attention decay's effect on the spread dynamics of information, integrating the memory, the social reinforcement and the attention decay. We simulate the model in different complex networks and verify the impacts of the attention decay on the information spreading process. Particularly, simulation results show that in some situations, the effect of the attention decay will decrease with the increasement of the network's randomness. Our work can provide insights to the understanding of the role of the attention decay in information spreading.
一些关于信息传播的研究表明,注意在解释人类行为中起着重要的作用,在信息传播过程中存在着注意衰减。然而,很少有研究者在研究信息的传播动态时考虑到注意力衰减。本文提出一个易感-接受-接受-免疫(SRAI)信息传播模型,整合记忆、社会强化和注意衰减,探讨注意衰减对信息传播动态的影响。我们在不同的复杂网络中对模型进行了仿真,验证了注意力衰减对信息传播过程的影响。仿真结果表明,在某些情况下,注意力衰减的影响会随着网络随机性的增加而减弱。我们的工作可以为理解注意力衰减在信息传播中的作用提供见解。
{"title":"The Influence of the Attention Decay in an Information Spreading Model","authors":"Zili Xiong, Zaobin Gan, Haifeng Xiang, Hongwei Lu","doi":"10.1109/WISA.2017.13","DOIUrl":"https://doi.org/10.1109/WISA.2017.13","url":null,"abstract":"Some work in information spreading show that the attention plays an important role in explaining human behaviors and the attention decay exists in the process of information spreading. However, few researchers take the attention decay into consideration when studying the spreading dynamics of information. In this paper, we propose a susceptible-received-accepted-immune (SRAI) information spreading model to explore the attention decay's effect on the spread dynamics of information, integrating the memory, the social reinforcement and the attention decay. We simulate the model in different complex networks and verify the impacts of the attention decay on the information spreading process. Particularly, simulation results show that in some situations, the effect of the attention decay will decrease with the increasement of the network's randomness. Our work can provide insights to the understanding of the role of the attention decay in information spreading.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116192250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Multi-step Ahead Time Series Forecasting for Different Data Patterns Based on LSTM Recurrent Neural Network 基于LSTM递归神经网络的不同数据模式多步提前时间序列预测
Pub Date : 2017-11-01 DOI: 10.1109/WISA.2017.25
L. Yunpeng, Hou Di, Bao Junpeng, Qi Yong
Time series prediction problems can play an important role in many areas, and multi-step ahead time series forecast, like river flow forecast, stock price forecast, could help people to make right decisions. Many predictive models do not work very well in multi-step ahead predictions. LSTM (Long Short-Term Memory) is an iterative structure in the hidden layer of the recurrent neural network which could capture the long-term dependency in time series. In this paper, we try to model different types of data patterns, use LSTM RNN for multi-step ahead prediction, and compare the prediction result with other traditional models.
时间序列预测问题可以在许多领域发挥重要作用,多步超前时间序列预测,如河流流量预测、股票价格预测,可以帮助人们做出正确的决策。许多预测模型在多步预测中并不能很好地工作。LSTM (Long - Short-Term Memory)是递归神经网络隐层中的一种迭代结构,能够捕捉时间序列中的长期依赖关系。在本文中,我们尝试对不同类型的数据模式进行建模,使用LSTM RNN进行多步预测,并将预测结果与其他传统模型进行比较。
{"title":"Multi-step Ahead Time Series Forecasting for Different Data Patterns Based on LSTM Recurrent Neural Network","authors":"L. Yunpeng, Hou Di, Bao Junpeng, Qi Yong","doi":"10.1109/WISA.2017.25","DOIUrl":"https://doi.org/10.1109/WISA.2017.25","url":null,"abstract":"Time series prediction problems can play an important role in many areas, and multi-step ahead time series forecast, like river flow forecast, stock price forecast, could help people to make right decisions. Many predictive models do not work very well in multi-step ahead predictions. LSTM (Long Short-Term Memory) is an iterative structure in the hidden layer of the recurrent neural network which could capture the long-term dependency in time series. In this paper, we try to model different types of data patterns, use LSTM RNN for multi-step ahead prediction, and compare the prediction result with other traditional models.","PeriodicalId":204706,"journal":{"name":"2017 14th Web Information Systems and Applications Conference (WISA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122370533","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
期刊
2017 14th Web Information Systems and Applications Conference (WISA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1