首页 > 最新文献

J. Digit. Inf. Manag.最新文献

英文 中文
A Graph-Based Approach for Aspect Extraction from Online Customer Reviews 基于图的在线客户评论方面提取方法
Pub Date : 2020-06-01 DOI: 10.6025/jdim/2020/18/3/99-108
Rakesh Kumar, Aditi Sharan
E-commerce websites have become main market players in the 21st century due to advancement in the internet technology. Apart from buying products online, customers are also providing reviews on the products purchased by them. These reviews help new customers to buy various products according to their needs, liking, and preferences. However, millions of reviews are added by the customer on a daily basis. To extract meaningful information manually from these huge amounts of reviews is a tough task. So, it is required to develop an automatic analytics tool for the review sentences. Aspect extraction is one of the vital tasks in the process of meaningful information extraction from the products having various entities. In this work, a novel product aspect extraction approach has been proposed which utilize a graphbased technique with the integration of statistical and semantic information. The analysis of experimental results shows that the proposed approach is efficient and effective in comparison to the state of art methods. Subject Categories and Descriptors [H.2.8 Database Applications]: Data mining; [K.4.4 Electronic Commerce] General Terms: E-Commerce, Customer Reviews, Opinion mining
由于互联网技术的进步,电子商务网站已经成为21世纪的主要市场参与者。除了在网上购买产品,顾客也会对他们购买的产品进行评论。这些评论帮助新客户根据他们的需求、喜好和偏好购买各种产品。然而,用户每天都会添加数百万条评论。从这些大量的评论中手动提取有意义的信息是一项艰巨的任务。因此,需要开发一种针对复习句子的自动分析工具。在从具有多种实体的产品中提取有意义信息的过程中,方面提取是关键任务之一。在这项工作中,提出了一种新的产品方面提取方法,该方法利用基于图的技术,将统计信息和语义信息相结合。实验结果分析表明,与现有方法相比,该方法是有效的。主题分类和描述符[H.2.8数据库应用]:数据挖掘;[K.4.4电子商务]一般术语:电子商务、客户评论、意见挖掘
{"title":"A Graph-Based Approach for Aspect Extraction from Online Customer Reviews","authors":"Rakesh Kumar, Aditi Sharan","doi":"10.6025/jdim/2020/18/3/99-108","DOIUrl":"https://doi.org/10.6025/jdim/2020/18/3/99-108","url":null,"abstract":"E-commerce websites have become main market players in the 21st century due to advancement in the internet technology. Apart from buying products online, customers are also providing reviews on the products purchased by them. These reviews help new customers to buy various products according to their needs, liking, and preferences. However, millions of reviews are added by the customer on a daily basis. To extract meaningful information manually from these huge amounts of reviews is a tough task. So, it is required to develop an automatic analytics tool for the review sentences. Aspect extraction is one of the vital tasks in the process of meaningful information extraction from the products having various entities. In this work, a novel product aspect extraction approach has been proposed which utilize a graphbased technique with the integration of statistical and semantic information. The analysis of experimental results shows that the proposed approach is efficient and effective in comparison to the state of art methods. Subject Categories and Descriptors [H.2.8 Database Applications]: Data mining; [K.4.4 Electronic Commerce] General Terms: E-Commerce, Customer Reviews, Opinion mining","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128547259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge-Intensive Decision Support System for Manufacturing Equipment Maintenance 面向制造设备维修的知识密集型决策支持系统
Pub Date : 2020-06-01 DOI: 10.6025/jdim/2020/18/3/85-98
Djamila Bouhalouan, Bakhta Nachet, A. Adla
To ensure continuous production in industrial plants, the high valued manufacturing eqipments should be kept in good working conditions. This brings plants to search for means to control and reduce equipment failures. When faults emerge in plants, appropriate actions for fault diagnosis and reparation must be executed promptly and effectively to prevent large costs due to breakdowns. To provide reliable and effective maintenance support, the aid of advanced decision support technology utilizing previous repair experience is of crucial importance for the expert operators as it provides them valuable troubleshooting clues for new faults. Artificial intelligence (AI) technology, particularly, knowledge-based approach is promising for this domain. It captures efficiency of problem solving expertise from the domain experts; guides the expert operators in rapid fault detection and maintenance. This paper focuses on the design and development of a Knowledge-Intensive Decision Support System (KI-DSS) for Manufacturing Equipment Maintenance in industrial plants to support better maintenance decision and improve maintenance efficiency. With integration of casebased Reasoning and ontology, the KiDSS not only carries out data matching retrieval, but also performs semantic associated data access which is important for intelligent knowledge retrieval in decision support system. A case is executed to illustrate the use of the proposed KI-DSS to show the feasibility of our ap proach and the benefit of the ontology support.
为了保证工业工厂的连续生产,高价值的制造设备应保持良好的工作状态。这促使工厂寻找控制和减少设备故障的方法。当工厂出现故障时,必须及时有效地采取适当的故障诊断和修复措施,以防止因故障造成的巨大损失。为了提供可靠和有效的维修支持,利用以往维修经验的先进决策支持技术对专家操作员来说至关重要,因为它为他们提供了宝贵的故障排除线索。人工智能(AI)技术,特别是基于知识的方法在这一领域很有前景。它捕获了领域专家解决问题的专业知识的效率;指导专业操作人员快速检测和维护故障。本文研究了工业工厂制造设备维修知识密集型决策支持系统(KI-DSS)的设计与开发,以支持更好的维修决策,提高维修效率。通过案例推理和本体的结合,KiDSS不仅可以实现数据匹配检索,还可以实现语义关联数据访问,这对决策支持系统中的智能知识检索具有重要意义。通过一个案例来说明所提出的KI-DSS的使用,以显示我们的方法的可行性和本体支持的好处。
{"title":"Knowledge-Intensive Decision Support System for Manufacturing Equipment Maintenance","authors":"Djamila Bouhalouan, Bakhta Nachet, A. Adla","doi":"10.6025/jdim/2020/18/3/85-98","DOIUrl":"https://doi.org/10.6025/jdim/2020/18/3/85-98","url":null,"abstract":"To ensure continuous production in industrial plants, the high valued manufacturing eqipments should be kept in good working conditions. This brings plants to search for means to control and reduce equipment failures. When faults emerge in plants, appropriate actions for fault diagnosis and reparation must be executed promptly and effectively to prevent large costs due to breakdowns. To provide reliable and effective maintenance support, the aid of advanced decision support technology utilizing previous repair experience is of crucial importance for the expert operators as it provides them valuable troubleshooting clues for new faults. Artificial intelligence (AI) technology, particularly, knowledge-based approach is promising for this domain. It captures efficiency of problem solving expertise from the domain experts; guides the expert operators in rapid fault detection and maintenance. This paper focuses on the design and development of a Knowledge-Intensive Decision Support System (KI-DSS) for Manufacturing Equipment Maintenance in industrial plants to support better maintenance decision and improve maintenance efficiency. With integration of casebased Reasoning and ontology, the KiDSS not only carries out data matching retrieval, but also performs semantic associated data access which is important for intelligent knowledge retrieval in decision support system. A case is executed to illustrate the use of the proposed KI-DSS to show the feasibility of our ap proach and the benefit of the ontology support.","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131394938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Study of Data Requirements for Data Mining Applications in Banking 银行业数据挖掘应用的数据需求研究
Pub Date : 2020-06-01 DOI: 10.6025/jdim/2020/18/3/109-117
M. Ranjbarfard, Shahideh Ahmadi
There are many studies that have applied data mining to banking. However, the lack of proper data mounts a serious obstacle to the employment of data mining techniques by banks. This paper examines previous data mining research in the field of banking to extract all served entities and attributes required for analytical purposes, categorize these attributes and ultimately present a data model for analysis. After analyzing a wide range of data mining applications in banking, 28 entities with 423 attributes were identified and the final proposed entity-relationship model was drawn. Also, a checklist was provided based on the model for auditing data gap in banks and applied to a real case. The results of this paper can be seen as a supportive tool for improving bank‘s business intelligence maturity from the data perspective and enabling managers for analyzing data requirement of information systems. Subject Categories and Descriptors [H.2.8 Database Applications]; Data mining: [D.3.3 Language Constructs and Features]; Data types and structures General Terms: Data Mining, Banking Data, Data Analysis
有许多研究将数据挖掘应用于银行业。然而,缺乏适当的数据给银行使用数据挖掘技术带来了严重的障碍。本文考察了以前在银行领域的数据挖掘研究,以提取分析目的所需的所有服务实体和属性,对这些属性进行分类,并最终提出用于分析的数据模型。在分析了银行业中广泛的数据挖掘应用后,确定了28个具有423个属性的实体,并绘制了最终提出的实体关系模型。在此基础上,提出了银行数据缺口审计模型清单,并将其应用于实际案例。本文的研究结果可以从数据的角度作为提高银行商业智能成熟度的辅助工具,使管理者能够分析信息系统的数据需求。主题分类和描述符[H.2.8数据库应用];数据挖掘:[D.3.3语言结构和特征];一般术语:数据挖掘,银行数据,数据分析
{"title":"A Study of Data Requirements for Data Mining Applications in Banking","authors":"M. Ranjbarfard, Shahideh Ahmadi","doi":"10.6025/jdim/2020/18/3/109-117","DOIUrl":"https://doi.org/10.6025/jdim/2020/18/3/109-117","url":null,"abstract":"There are many studies that have applied data mining to banking. However, the lack of proper data mounts a serious obstacle to the employment of data mining techniques by banks. This paper examines previous data mining research in the field of banking to extract all served entities and attributes required for analytical purposes, categorize these attributes and ultimately present a data model for analysis. After analyzing a wide range of data mining applications in banking, 28 entities with 423 attributes were identified and the final proposed entity-relationship model was drawn. Also, a checklist was provided based on the model for auditing data gap in banks and applied to a real case. The results of this paper can be seen as a supportive tool for improving bank‘s business intelligence maturity from the data perspective and enabling managers for analyzing data requirement of information systems. Subject Categories and Descriptors [H.2.8 Database Applications]; Data mining: [D.3.3 Language Constructs and Features]; Data types and structures General Terms: Data Mining, Banking Data, Data Analysis","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130652737","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Is the Binary Search Faster when Two Variables are Added in the Middle of the Data? 当在数据中间添加两个变量时,二分查找更快吗?
Pub Date : 2020-04-01 DOI: 10.6025/jdim/2020/18/2/57-64
Djasen Tjendry, Wirawan Istiono
{"title":"Is the Binary Search Faster when Two Variables are Added in the Middle of the Data?","authors":"Djasen Tjendry, Wirawan Istiono","doi":"10.6025/jdim/2020/18/2/57-64","DOIUrl":"https://doi.org/10.6025/jdim/2020/18/2/57-64","url":null,"abstract":"","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125669336","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Real Estate Loan Knowledge-Based Recommender System 基于知识的房地产贷款推荐系统
Pub Date : 2020-04-01 DOI: 10.6025/jdim/2020/18/2/65-77
A. Adla
In decision making, the decision-makers frequently employ and perform routine tasks. These processes normally are time-intensive, complex, and in most cases occur regularly. To address this challenge decision makers reuse the already successful decisions. During difficult times, such actions may lead to save time, energy and man-hours, and also result in effective decision making. Memory building depends on how we successfully store earlier knowledge. We through this work introduce a recommender system which is names as BLKBRS which utilized the earlier successful models. In this work we use a case of bank loan and experimented using a semi-structured multiple attribute recommendation environment, and equate the RL-KBRS with a conventional case based reasoning system. RL-KBRS will compensate for lack of experience of young bank consultants, which permits the spread of knowledge distribution to other banks. Subject Categories and Descriptors [H.3] Information Storage and Retrieval; [I.2] Artificial Intelligence General Terms: Memory-based Approach, Information Search, and retrieval, Recommending systems, Case-Based Reasoning
在决策过程中,决策者经常使用和执行常规任务。这些过程通常是耗时的,复杂的,并且在大多数情况下有规律地发生。为了应对这一挑战,决策者重用已经成功的决策。在困难时期,这样的行动可能会节省时间、精力和人力,也会产生有效的决策。记忆的建立取决于我们如何成功地存储早期的知识。通过这项工作,我们引入了一个名为BLKBRS的推荐系统,该系统利用了早期成功的模型。在这项工作中,我们以银行贷款为例,使用半结构化的多属性推荐环境进行实验,并将RL-KBRS与传统的基于案例的推理系统等同起来。RL-KBRS将弥补年轻银行顾问的经验不足,这使得知识传播到其他银行。主题分类和描述符[j]。[3]信息存储与检索;[我。[2]人工智能术语:基于记忆的方法,信息搜索与检索,推荐系统,基于案例的推理
{"title":"Real Estate Loan Knowledge-Based Recommender System","authors":"A. Adla","doi":"10.6025/jdim/2020/18/2/65-77","DOIUrl":"https://doi.org/10.6025/jdim/2020/18/2/65-77","url":null,"abstract":"In decision making, the decision-makers frequently employ and perform routine tasks. These processes normally are time-intensive, complex, and in most cases occur regularly. To address this challenge decision makers reuse the already successful decisions. During difficult times, such actions may lead to save time, energy and man-hours, and also result in effective decision making. Memory building depends on how we successfully store earlier knowledge. We through this work introduce a recommender system which is names as BLKBRS which utilized the earlier successful models. In this work we use a case of bank loan and experimented using a semi-structured multiple attribute recommendation environment, and equate the RL-KBRS with a conventional case based reasoning system. RL-KBRS will compensate for lack of experience of young bank consultants, which permits the spread of knowledge distribution to other banks. Subject Categories and Descriptors [H.3] Information Storage and Retrieval; [I.2] Artificial Intelligence General Terms: Memory-based Approach, Information Search, and retrieval, Recommending systems, Case-Based Reasoning","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117103718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of the Effects Stemmer Porter and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism Stemmer Porter和Nazief-Adriani对衡量抄袭的筛选算法性能的影响比较
Pub Date : 2020-04-01 DOI: 10.6025/jdim/2020/18/2/49-56
A. Rahmatulloh, Neng Ika Kurniati, I. Darmawan, Adi Zaenal Asyikin, Deden Witarsyah
Current technological developments change physical paper patterns into digital, which has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to do research on the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the porter stemmer is superior in increasing the processing time to 69% faster. Subject Categories and Descriptors [I.1.2 Algorithms]; [H.3.3 Information Search and Retrieval] General Terms: Plagiarism Detection, Winnowing algorithms, Stemmers
当前的技术发展将物理纸张模式转变为数字模式,这具有非常高的影响。积极的影响一方面是因为纸张的浪费减少了,另一方面,数字数据的猖獗复制增加了剽窃的数量。目前,为了克服抄袭问题,专家们做了很多努力,其中之一就是利用筛选算法作为检测抄袭数据的工具。在其发展过程中,许多优化筛选算法都使用了词干提取技术。使用最广泛的stemmer算法包括stemmer porter和nazief-adriani。然而,在衡量抄袭价值时,使用stemmer的性能与筛选算法的效果比较,尚未有讨论。因此,有必要研究stemmer算法对筛选算法的影响,使抄袭检测结果更加优化。本研究结果表明,nazief-adriani茎秆对筛选算法的影响优于茎秆搬运工,仅降低了0.28%相似值的检测性能,而茎秆搬运工则将处理时间提高了69%。主题类别和描述符[I.1.2算法];[H.3.3信息检索]一般术语:抄袭检测、筛选算法、Stemmers
{"title":"Comparison of the Effects Stemmer Porter and Nazief-Adriani on the Performance of Winnowing Algorithms for Measuring Plagiarism","authors":"A. Rahmatulloh, Neng Ika Kurniati, I. Darmawan, Adi Zaenal Asyikin, Deden Witarsyah","doi":"10.6025/jdim/2020/18/2/49-56","DOIUrl":"https://doi.org/10.6025/jdim/2020/18/2/49-56","url":null,"abstract":"Current technological developments change physical paper patterns into digital, which has a very high impact. Positive impact because paper waste is reduced, on the other hand, the rampant copying of digital data raises the amount of plagiarism that is increasing. At present, there are many efforts made by experts to overcome the problem of plagiarism, one of which is by utilizing the winnowing algorithm as a tool to detect plagiarism data. In its development, many optimizing winnowing algorithms used stemming techniques. The most widely used stemmer algorithms include stemmer porter and nazief-adriani. However, there has not been a discussion on the comparison of the effect of performance using stemmer on the winnowing algorithm in measuring the value of plagiarism. So it is necessary to do research on the effect of stemmer algorithms on winnowing algorithms so that the results of plagiarism detection are more optimal. The results of this study indicate that the effect of nazief-adriani stemmer on the winnowing algorithm is superior to the stemmer porter, only decreasing the detection performance of the 0.28% similarity value while the porter stemmer is superior in increasing the processing time to 69% faster. Subject Categories and Descriptors [I.1.2 Algorithms]; [H.3.3 Information Search and Retrieval] General Terms: Plagiarism Detection, Winnowing algorithms, Stemmers","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121305333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault Prediction in Fuzzy Discrete Event Systems: A Diagnoser Approach 模糊离散事件系统的故障预测:一种诊断方法
Pub Date : 2019-12-01 DOI: 10.6025/jdim/2019/17/6/337-345
Bilal Benmessahel, Farid Nouioua, Mohamed Touahria, A. Chariete
In this workm we study the fault prediction in fuzzy discrete event systems. Fuzzy discrete event systems are proposed to deal with vagueness, impreciseness, and subjectivity in real-world problems. The verification is divided into two steps. In the first step, we give a method to construct a Diagnoser. And in the second step, based on the structure of diagnoser we give the necessary and sufficient conditions to verify the future occurrence of the fault. The newly proposed approach allows us to deal with the problem of fault prediction for both crisp DESs and FDESs. Finally, an example is provided to illustrate the efficiency of the proposed approach. Subject Categories and Descriptors: [I.2.3 Deduction and Theorem Proving]; Uncertainty, “fuzzy,” and probabilistic reasoning: [B.1.3 Control Structure Reliability, Testing, and Fault-Tolerance] General Terms: Fault Prediction, Fuzzy Models, Discrete Events
本文主要研究模糊离散事件系统的故障预测问题。提出模糊离散事件系统是为了解决现实问题中的模糊性、不精确性和主观性问题。验证分为两个步骤。在第一步中,我们给出了一个构造诊断器的方法。第二步,根据诊断器的结构给出了验证故障未来发生的充分必要条件。新提出的方法使我们能够同时处理crisp DESs和FDESs的故障预测问题。最后,通过一个算例说明了所提方法的有效性。主题范畴与描述符:[I.2.3演绎与定理证明];不确定性、“模糊”和概率推理:[B.1.3控制结构可靠性、测试和容错]一般术语:故障预测、模糊模型、离散事件
{"title":"Fault Prediction in Fuzzy Discrete Event Systems: A Diagnoser Approach","authors":"Bilal Benmessahel, Farid Nouioua, Mohamed Touahria, A. Chariete","doi":"10.6025/jdim/2019/17/6/337-345","DOIUrl":"https://doi.org/10.6025/jdim/2019/17/6/337-345","url":null,"abstract":"In this workm we study the fault prediction in fuzzy discrete event systems. Fuzzy discrete event systems are proposed to deal with vagueness, impreciseness, and subjectivity in real-world problems. The verification is divided into two steps. In the first step, we give a method to construct a Diagnoser. And in the second step, based on the structure of diagnoser we give the necessary and sufficient conditions to verify the future occurrence of the fault. The newly proposed approach allows us to deal with the problem of fault prediction for both crisp DESs and FDESs. Finally, an example is provided to illustrate the efficiency of the proposed approach. Subject Categories and Descriptors: [I.2.3 Deduction and Theorem Proving]; Uncertainty, “fuzzy,” and probabilistic reasoning: [B.1.3 Control Structure Reliability, Testing, and Fault-Tolerance] General Terms: Fault Prediction, Fuzzy Models, Discrete Events","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116463188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Query Expansion in Text Information Retrieval with Local Context and Distributional Model 基于局部上下文和分布模型的文本信息检索中的查询扩展
Pub Date : 2019-12-01 DOI: 10.6025/jdim/2019/17/6/313-320
Fabiano Tavares da Silva, J. Maia
The Semantic Distributional Model is based on the frequency of contexts of use of language terms in large open corpus such as the web, to establish similarity or the relationship between words. These relationships or similarities can be used to add terms when expanding queries. The idea explored in this paper is that, for queries in closed collections of text documents, a posterior filter based on the restricted vocabulary of the collection can improve the effectiveness of automatic query expansion. This idea is developed and evaluated in publicly available benchmarks presenting promising results. Subject Categories and Descriptors: [H.3.3 Information Search and Retrieval]; Query formulation: [I.2.7 Natural Language Processing] Text analysis [F.4.2 Grammars and Other Rewriting Systems]; Grammar types General Terms: Distributional Semantic Model, Information Retrieval, Local Context Analysis.
语义分布模型是基于大型开放语料库(如网络)中语言术语使用上下文的频率,来建立词之间的相似度或关系。这些关系或相似性可用于在展开查询时添加术语。本文探索的思想是,对于封闭文本文档集合中的查询,基于集合的受限词汇的后验过滤器可以提高自动查询扩展的有效性。这个想法是在公开可用的基准中开发和评估的,呈现出有希望的结果。主题类别和描述符:[H.3.3信息检索];查询公式:[I.2.7自然语言处理]文本分析[F.4.2语法和其他重写系统];语法类型一般术语:分布语义模型,信息检索,局部上下文分析。
{"title":"Query Expansion in Text Information Retrieval with Local Context and Distributional Model","authors":"Fabiano Tavares da Silva, J. Maia","doi":"10.6025/jdim/2019/17/6/313-320","DOIUrl":"https://doi.org/10.6025/jdim/2019/17/6/313-320","url":null,"abstract":"The Semantic Distributional Model is based on the frequency of contexts of use of language terms in large open corpus such as the web, to establish similarity or the relationship between words. These relationships or similarities can be used to add terms when expanding queries. The idea explored in this paper is that, for queries in closed collections of text documents, a posterior filter based on the restricted vocabulary of the collection can improve the effectiveness of automatic query expansion. This idea is developed and evaluated in publicly available benchmarks presenting promising results. Subject Categories and Descriptors: [H.3.3 Information Search and Retrieval]; Query formulation: [I.2.7 Natural Language Processing] Text analysis [F.4.2 Grammars and Other Rewriting Systems]; Grammar types General Terms: Distributional Semantic Model, Information Retrieval, Local Context Analysis.","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129530611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
An Agent-Based Approach for Extracting Business Association Rules from Centralized Databases Systems 基于agent的集中式数据库系统业务关联规则提取方法
Pub Date : 2019-10-01 DOI: 10.6025/jdim/2019/17/5/270-288
Nadjib Mesbahi, Merouane Zoubeidi, Abdelhak Merizig, O. Kazar
{"title":"An Agent-Based Approach for Extracting Business Association Rules from Centralized Databases Systems","authors":"Nadjib Mesbahi, Merouane Zoubeidi, Abdelhak Merizig, O. Kazar","doi":"10.6025/jdim/2019/17/5/270-288","DOIUrl":"https://doi.org/10.6025/jdim/2019/17/5/270-288","url":null,"abstract":"","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121939714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Subjective Sentiment Analysis for Arabic Newswire Comments 阿拉伯通讯社评论的主观情绪分析
Pub Date : 2019-10-01 DOI: 10.6025/jdim/2019/17/5/289-295
Sadik Bessou, Rania Aberkane
This paper presents an approach based on supervised machine learning methods to discriminate between positive, negative and neutral Arabic reviews in online newswire. The corpus is labeled for subjectivity and sentiment analysis (SSA) at the sentence-level. The model uses both count and TF-IDF representations and apply six machine learning algorithms; Multinomial Naive Bayes, Support Vector Machines (SVM), Random Forest, Logistic Regression, Multi-layer perceptron and k-nearest neighbors using uni-grams, bi-grams features. With the goal of extracting users sentiment from written text. Experimental results showed that n-gram features could substantially improve performance; and showed that the Multinomial Naive Bayes approach is the most accurate in predicting topic polarity. Best results were achieved using count vectors trained by combination of word-based uni-grams and bi-grams with an overall accuracy of 85.57% over two classes and 65.64% over three classes.
本文提出了一种基于监督机器学习的方法来区分在线新闻中积极、消极和中立的阿拉伯语评论。语料库被标记为句子级的主观性和情感分析(SSA)。该模型同时使用计数和TF-IDF表示,并应用六种机器学习算法;多项朴素贝叶斯,支持向量机(SVM),随机森林,逻辑回归,多层感知器和k近邻使用单位图,双图特征。目的是从书面文本中提取用户情感。实验结果表明,n-gram特征可以显著提高性能;结果表明,多项朴素贝叶斯方法在预测主题极性方面是最准确的。使用基于单词的单格和双格组合训练的计数向量获得了最好的结果,两类的总体准确率为85.57%,三类的总体准确率为65.64%。
{"title":"Subjective Sentiment Analysis for Arabic Newswire Comments","authors":"Sadik Bessou, Rania Aberkane","doi":"10.6025/jdim/2019/17/5/289-295","DOIUrl":"https://doi.org/10.6025/jdim/2019/17/5/289-295","url":null,"abstract":"This paper presents an approach based on supervised machine learning methods to discriminate between positive, negative and neutral Arabic reviews in online newswire. The corpus is labeled for subjectivity and sentiment analysis (SSA) at the sentence-level. The model uses both count and TF-IDF representations and apply six machine learning algorithms; Multinomial Naive Bayes, Support Vector Machines (SVM), Random Forest, Logistic Regression, Multi-layer perceptron and k-nearest neighbors using uni-grams, bi-grams features. With the goal of extracting users sentiment from written text. Experimental results showed that n-gram features could substantially improve performance; and showed that the Multinomial Naive Bayes approach is the most accurate in predicting topic polarity. Best results were achieved using count vectors trained by combination of word-based uni-grams and bi-grams with an overall accuracy of 85.57% over two classes and 65.64% over three classes.","PeriodicalId":303976,"journal":{"name":"J. Digit. Inf. Manag.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131867349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
J. Digit. Inf. Manag.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1