首页 > 最新文献

2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)最新文献

英文 中文
Improving Collaborative Filtering’s Rating Prediction Coverage in Sparse Datasets through the Introduction of Virtual Near Neighbors 通过引入虚拟近邻提高稀疏数据集协同过滤的评级预测覆盖率
Dionisis Margaris, Dionysios Vasilopoulos, C. Vassilakis, D. Spiliotopoulos
Collaborative filtering creates personalized recommendations by considering ratings entered by users. Collaborative filtering algorithms initially detect users whose likings are alike, by exploring the similarity between ratings that have insofar been submitted. Users having a high degree of similarity regarding their ratings are termed near neighbors, and in order to formulate a recommendation for a user, her near neighbors’ ratings are extracted and form the basis for the recommendation. Collaborative filtering algorithms however exhibit the problem commonly referred to as “gray sheep this pertains to the case where for some users no near neighbors can be identified, and hence no personalized recommendations can be computed. The “gray sheep” problem is more severe in sparse datasets, i.e. datasets where the number of ratings is small, compared to the number of items and users. In this paper, we address the “gray sheep” problem by introducing the concept of virtual near neighbors and a related algorithm for their creation on the basis of the existing ones. We evaluate the proposed algorithm, which is termed as CFVNN, using eight widely used datasets and considering two correlation metrics which are widely used in Collaborative Filtering research, namely the Pearson Correlation Coefficient and the Cosine Similarity. The results show that the proposed algorithm considerably leverages the capability of a Collaborative Filtering system to compute personalized recommendations in the context of sparse datasets, tackling thus efficiently the “gray sheep” problem. In parallel, the CFVNN algorithm achieves improvements in rating prediction quality, as this is expressed through the Mean Absolute Error and the Root Mean Square Error metrics.
协同过滤通过考虑用户输入的评分创建个性化推荐。协同过滤算法最初通过探索迄今为止提交的评分之间的相似性来检测喜欢相似的用户。在评分方面具有高度相似性的用户被称为近邻用户,为了制定对用户的推荐,提取其近邻的评分并形成推荐的基础。然而,协同过滤算法显示了通常被称为“灰羊”的问题,这涉及到某些用户无法识别近邻,因此无法计算个性化推荐的情况。“灰羊”问题在稀疏数据集中更为严重,即与项目和用户数量相比,评级数量较少的数据集。在本文中,我们通过引入虚拟近邻的概念以及在现有近邻的基础上创建虚拟近邻的相关算法来解决“灰羊”问题。我们使用8个广泛使用的数据集,并考虑在协同过滤研究中广泛使用的两个相关度量,即Pearson相关系数和余弦相似度,来评估所提出的CFVNN算法。结果表明,该算法充分利用了协同过滤系统在稀疏数据集背景下计算个性化推荐的能力,从而有效地解决了“灰羊”问题。与此同时,CFVNN算法在评级预测质量上取得了进步,因为这是通过Mean Absolute Error和Root Mean Square Error度量来表达的。
{"title":"Improving Collaborative Filtering’s Rating Prediction Coverage in Sparse Datasets through the Introduction of Virtual Near Neighbors","authors":"Dionisis Margaris, Dionysios Vasilopoulos, C. Vassilakis, D. Spiliotopoulos","doi":"10.1109/IISA.2019.8900678","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900678","url":null,"abstract":"Collaborative filtering creates personalized recommendations by considering ratings entered by users. Collaborative filtering algorithms initially detect users whose likings are alike, by exploring the similarity between ratings that have insofar been submitted. Users having a high degree of similarity regarding their ratings are termed near neighbors, and in order to formulate a recommendation for a user, her near neighbors’ ratings are extracted and form the basis for the recommendation. Collaborative filtering algorithms however exhibit the problem commonly referred to as “gray sheep this pertains to the case where for some users no near neighbors can be identified, and hence no personalized recommendations can be computed. The “gray sheep” problem is more severe in sparse datasets, i.e. datasets where the number of ratings is small, compared to the number of items and users. In this paper, we address the “gray sheep” problem by introducing the concept of virtual near neighbors and a related algorithm for their creation on the basis of the existing ones. We evaluate the proposed algorithm, which is termed as CFVNN, using eight widely used datasets and considering two correlation metrics which are widely used in Collaborative Filtering research, namely the Pearson Correlation Coefficient and the Cosine Similarity. The results show that the proposed algorithm considerably leverages the capability of a Collaborative Filtering system to compute personalized recommendations in the context of sparse datasets, tackling thus efficiently the “gray sheep” problem. In parallel, the CFVNN algorithm achieves improvements in rating prediction quality, as this is expressed through the Mean Absolute Error and the Root Mean Square Error metrics.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129960067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Enhancing Automatic Reasoning of human errors in an operating system using fuzzy logic 利用模糊逻辑增强操作系统中人为错误的自动推理
K. Chrysafiadi, M. Virvou
In this paper a novel fuzzy mechanism for automatic reasoning of human errors in operating systems is presented. The presented mechanism combines Human Plausible Reasoning (HPR) theory with fuzzy logic. HPR is used for inferring the commands the user of an operating system should have type and fuzzy logic is used to handle the uncertainty that characterizes the complex reasoning process by modelling the errors’ types in a more realistic way. Particularly, the output of HPR theory guesses all the possible command that the user may wants to type. These guesses can include many types of errors varying from typographic to wrong use of a legal command. In the presented mechanism, these guesses are input in a fuzzy reasoner, which takes into account the needs, characteristics and misconceptions of each individual user and decides about the most appropriate explanation of user’s error and gives personalized advice that fits better in each context and situation. The mechanism has been applied on the sub-domain of file manipulation of UNIX. The potential of the presented mechanism to reason about operating system’s users’ slips and misconceptions are discussed.
本文提出了一种用于操作系统人为错误自动推理的模糊机制。该机制将人类似是而非的推理理论与模糊逻辑相结合。HPR用于推断操作系统用户应该具有的命令类型,模糊逻辑用于处理复杂推理过程的不确定性,通过以更现实的方式建模错误类型。特别是,HPR理论的输出猜测了用户可能想要输入的所有可能的命令。这些猜测可能包括多种类型的错误,从排版到错误地使用合法命令。在本文提出的机制中,这些猜测被输入到一个模糊推理器中,该推理器考虑每个用户的需求、特征和误解,决定对用户错误的最合适的解释,并给出更适合每种上下文和情况的个性化建议。该机制已应用于UNIX的文件操作子域。讨论了所提出的机制对操作系统用户的错误和误解进行推理的潜力。
{"title":"Enhancing Automatic Reasoning of human errors in an operating system using fuzzy logic","authors":"K. Chrysafiadi, M. Virvou","doi":"10.1109/IISA.2019.8900775","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900775","url":null,"abstract":"In this paper a novel fuzzy mechanism for automatic reasoning of human errors in operating systems is presented. The presented mechanism combines Human Plausible Reasoning (HPR) theory with fuzzy logic. HPR is used for inferring the commands the user of an operating system should have type and fuzzy logic is used to handle the uncertainty that characterizes the complex reasoning process by modelling the errors’ types in a more realistic way. Particularly, the output of HPR theory guesses all the possible command that the user may wants to type. These guesses can include many types of errors varying from typographic to wrong use of a legal command. In the presented mechanism, these guesses are input in a fuzzy reasoner, which takes into account the needs, characteristics and misconceptions of each individual user and decides about the most appropriate explanation of user’s error and gives personalized advice that fits better in each context and situation. The mechanism has been applied on the sub-domain of file manipulation of UNIX. The potential of the presented mechanism to reason about operating system’s users’ slips and misconceptions are discussed.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"22 8","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132090948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
NLP-based error analysis and dynamic motivation techniques in mobile learning 移动学习中基于nlp的误差分析与动态激励技术
C. Troussas, Akrivi Krouska, M. Virvou
Mobile learning uncovers new dimensions of learning and personal growth. Mobile phones have completely dominated our lives from communication and entertainment to socializing and learning. In view of providing more individualized learning through mobile phones, several intelligent techniques should be incorporated in mobile-assisted learning systems. As such, this paper presents an effective analysis of students’ errors during the assessment process in mobile learning using Natural Language Processing (NLP) techniques. The error analysis can reason between grammatical, syntax and careless errors using the Levenshtein distance. Moreover, it describes dynamic methods for motivating students in order to improve their learning experience. As such, students can receive motivation in case of making errors, cognitive inconsistencies, etc. Dynamic motivation is enriched with the delivery of badges as a means to further enhance knowledge acquisition. As a testbed for our research, a mobile language learning application for tutoring the English language has been designed, fully developed and evaluated. Concluding, this paper presents real examples of operation of the presented system and the evaluation results show the acceptance of the NLP-based error analysis and the dynamic motivation techniques by students.
移动学习揭示了学习和个人成长的新维度。手机已经完全支配了我们的生活,从沟通和娱乐到社交和学习。考虑到通过移动电话提供更加个性化的学习,在移动辅助学习系统中应纳入几种智能技术。因此,本文采用自然语言处理(NLP)技术对移动学习中学生在评估过程中的错误进行了有效的分析。错误分析可以利用Levenshtein距离对语法错误、句法错误和粗心错误进行推理。此外,它还描述了激励学生的动态方法,以改善他们的学习体验。这样,学生在犯错、认知不一致等情况下可以获得动力。作为进一步加强知识获取的手段,徽章的发放丰富了动态动机。作为我们研究的试验台,我们设计、开发并评估了一款用于英语辅导的移动语言学习应用程序。最后,本文给出了系统运行的实例,评价结果表明学生对基于nlp的误差分析和动态激励技术的接受程度。
{"title":"NLP-based error analysis and dynamic motivation techniques in mobile learning","authors":"C. Troussas, Akrivi Krouska, M. Virvou","doi":"10.1109/IISA.2019.8900729","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900729","url":null,"abstract":"Mobile learning uncovers new dimensions of learning and personal growth. Mobile phones have completely dominated our lives from communication and entertainment to socializing and learning. In view of providing more individualized learning through mobile phones, several intelligent techniques should be incorporated in mobile-assisted learning systems. As such, this paper presents an effective analysis of students’ errors during the assessment process in mobile learning using Natural Language Processing (NLP) techniques. The error analysis can reason between grammatical, syntax and careless errors using the Levenshtein distance. Moreover, it describes dynamic methods for motivating students in order to improve their learning experience. As such, students can receive motivation in case of making errors, cognitive inconsistencies, etc. Dynamic motivation is enriched with the delivery of badges as a means to further enhance knowledge acquisition. As a testbed for our research, a mobile language learning application for tutoring the English language has been designed, fully developed and evaluated. Concluding, this paper presents real examples of operation of the presented system and the evaluation results show the acceptance of the NLP-based error analysis and the dynamic motivation techniques by students.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122177517","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinical profile prediction by multiple instance learning from multi-sensorial data 基于多感官数据的多实例学习的临床特征预测
Argyro Tsirtsi, E. Zacharaki, Spyridon Kalogiannis, V. Megalooikonomou
The last years there is a great interest in developing unobtrusive health monitoring systems with a predictive component, aiming to recognize signs of illness in an attempt to assist clinicians in delivering early interventions. The objective of this work is to investigate whether the physiological and kinetic functioning and human activity of daily living monitored by multiple sensors can be used as surrogate of the standard clinical assessment. We focus on the older population and propose to utilize Multiple Instance Learning (MIL) to predict their clinical profile from the multi-sensorial data. ReliefF-MI is applied to achieve dimensionality reduction and to discover the most important features that are associated with each clinical metric, while the BagSMOTE algorithm is utilized to mitigate the class imbalance problem. The proposed methodology was evaluated on a multi-parametric dataset of 86 older adults containing clinical parameters from various domains (cognitive, physical, medical, psychological, social and showed high prognostic capacity for the person’s functionality (Katz index) and social interaction (phone calls).
近年来,人们对开发具有预测成分的不显眼的健康监测系统非常感兴趣,旨在识别疾病迹象,以协助临床医生提供早期干预措施。本研究的目的是探讨由多个传感器监测的生理和动力学功能以及人类日常生活活动是否可以作为标准临床评估的替代品。我们将重点放在老年人群上,并建议利用多实例学习(MIL)从多感官数据中预测他们的临床特征。relief - mi用于实现降维并发现与每个临床指标相关的最重要特征,而BagSMOTE算法用于缓解类别不平衡问题。所提出的方法在86名老年人的多参数数据集上进行了评估,该数据集包含来自各个领域(认知、身体、医学、心理、社会)的临床参数,并显示出对人的功能(Katz指数)和社会互动(电话)的高预后能力。
{"title":"Clinical profile prediction by multiple instance learning from multi-sensorial data","authors":"Argyro Tsirtsi, E. Zacharaki, Spyridon Kalogiannis, V. Megalooikonomou","doi":"10.1109/IISA.2019.8900761","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900761","url":null,"abstract":"The last years there is a great interest in developing unobtrusive health monitoring systems with a predictive component, aiming to recognize signs of illness in an attempt to assist clinicians in delivering early interventions. The objective of this work is to investigate whether the physiological and kinetic functioning and human activity of daily living monitored by multiple sensors can be used as surrogate of the standard clinical assessment. We focus on the older population and propose to utilize Multiple Instance Learning (MIL) to predict their clinical profile from the multi-sensorial data. ReliefF-MI is applied to achieve dimensionality reduction and to discover the most important features that are associated with each clinical metric, while the BagSMOTE algorithm is utilized to mitigate the class imbalance problem. The proposed methodology was evaluated on a multi-parametric dataset of 86 older adults containing clinical parameters from various domains (cognitive, physical, medical, psychological, social and showed high prognostic capacity for the person’s functionality (Katz index) and social interaction (phone calls).","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126103709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Hyperparameter Optimization of LSTM Network Models through Genetic Algorithm 基于遗传算法的LSTM网络模型超参数优化
N. Gorgolis, I. Hatzilygeroudis, Z. Istenes, Lazlo-Grad Gyenne
Next word prediction is an important problem in the domain of NLP, hence in modern artificial intelligence. It draws both scientific and industrial interest, as it consists the core of many processes, like autocorrection, text generation, review prediction etc. Currently, the most efficient and common approach used is classification, using artificial neural networks (ANNs). One of the main drawbacks of ANNs is fine – tuning their hyperparameters, a procedure which is essential to the performance of the model. On the other hand, the approaches usually used for fine – tuning are either computationally unaffordable (e.g. grid search) or of uncertain efficiency (e.g. trial & error). As a response to the above, through the current paper is presented a simple genetic algorithm approach, which is used for the hyperparameter tuning of a common language model and it achieves tuning efficiency without following an exhaustive search.
下一个词预测是自然语言处理领域的一个重要问题,因此也是现代人工智能中的一个重要问题。它吸引了科学界和工业界的兴趣,因为它包括许多过程的核心,如自动纠错、文本生成、评论预测等。目前,最有效和最常用的方法是使用人工神经网络(ann)进行分类。人工神经网络的主要缺点之一是对其超参数进行微调,这一过程对模型的性能至关重要。另一方面,通常用于微调的方法要么在计算上负担不起(例如网格搜索),要么效率不确定(例如试错)。作为对上述问题的回应,本文提出了一种简单的遗传算法方法,用于公共语言模型的超参数调优,该方法无需穷举搜索即可达到调优效率。
{"title":"Hyperparameter Optimization of LSTM Network Models through Genetic Algorithm","authors":"N. Gorgolis, I. Hatzilygeroudis, Z. Istenes, Lazlo-Grad Gyenne","doi":"10.1109/IISA.2019.8900675","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900675","url":null,"abstract":"Next word prediction is an important problem in the domain of NLP, hence in modern artificial intelligence. It draws both scientific and industrial interest, as it consists the core of many processes, like autocorrection, text generation, review prediction etc. Currently, the most efficient and common approach used is classification, using artificial neural networks (ANNs). One of the main drawbacks of ANNs is fine – tuning their hyperparameters, a procedure which is essential to the performance of the model. On the other hand, the approaches usually used for fine – tuning are either computationally unaffordable (e.g. grid search) or of uncertain efficiency (e.g. trial & error). As a response to the above, through the current paper is presented a simple genetic algorithm approach, which is used for the hyperparameter tuning of a common language model and it achieves tuning efficiency without following an exhaustive search.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125084531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Combining Active Learning with Self-train algorithm for classification of multimodal problems 结合主动学习与自训练算法的多模态问题分类
Stamatis Karlos, V. G. Kanas, Christos K. Aridas, Nikos Fazakis, S. Kotsiantis
In real-world cases, handling of both labeled and unlabeled data has raised the interest of several data scientists and Machine Learning engineers, leading to several demonstrations that apply data augmenting approaches to achieve an effective learning behavior. Although the majority of them propose either the exploitation of Semi-supervised or Active Learning approaches, individually, their combination has not been widely used. The ambition of this strategy is the efficient utilization of the available human knowledge relying along with the decisions driven by automated methods under a common framework. Thus, we conduct an empirical evaluation of such a combinatory approach over three problems, related to multimodal data operating under the pool-based scenario: Gender Identification, Recognition of Offensive Language and Emotion Detection. Into the proposed learning framework, which exploits initially labeled instances with small cardinality, our results prove the benefits of adopting such kind of semi-automated approaches regarding both the achieved predictive correctness and the reduced consumption of time and cost resources, as well as the smoothness of the learning convergence, mainly using ensemble classifiers.
在现实世界中,处理标记和未标记数据引起了一些数据科学家和机器学习工程师的兴趣,导致了一些应用数据增强方法来实现有效学习行为的演示。尽管他们中的大多数人单独提出了半监督或主动学习方法的利用,但他们的组合并没有被广泛使用。该策略的目标是在一个共同的框架下,依靠自动化方法驱动的决策,有效地利用可用的人类知识。因此,我们对这种组合方法进行了实证评估,涉及到在基于池的场景下运行的多模态数据的三个问题:性别识别、攻击性语言识别和情绪检测。在我们提出的学习框架中,我们的结果证明了采用这种半自动方法在预测正确性和减少时间和资源消耗方面的好处,以及学习收敛的平滑性,主要使用集成分类器。
{"title":"Combining Active Learning with Self-train algorithm for classification of multimodal problems","authors":"Stamatis Karlos, V. G. Kanas, Christos K. Aridas, Nikos Fazakis, S. Kotsiantis","doi":"10.1109/IISA.2019.8900724","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900724","url":null,"abstract":"In real-world cases, handling of both labeled and unlabeled data has raised the interest of several data scientists and Machine Learning engineers, leading to several demonstrations that apply data augmenting approaches to achieve an effective learning behavior. Although the majority of them propose either the exploitation of Semi-supervised or Active Learning approaches, individually, their combination has not been widely used. The ambition of this strategy is the efficient utilization of the available human knowledge relying along with the decisions driven by automated methods under a common framework. Thus, we conduct an empirical evaluation of such a combinatory approach over three problems, related to multimodal data operating under the pool-based scenario: Gender Identification, Recognition of Offensive Language and Emotion Detection. Into the proposed learning framework, which exploits initially labeled instances with small cardinality, our results prove the benefits of adopting such kind of semi-automated approaches regarding both the achieved predictive correctness and the reduced consumption of time and cost resources, as well as the smoothness of the learning convergence, mainly using ensemble classifiers.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131299518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Examining the Impact of Discretization Technique on Sentiment Analysis for the Greek Language 考察离散化技术对希腊语情感分析的影响
Nikolaos Spatiotis, I. Perikos, I. Mporas, M. Paraskevas
Nowadays, information, communication and interaction between people worldwide have been facilitated by the rapid development of technology and they are mainly achieved through the internet. Internet users are now new creators of information data and express their ideas, their opinions, their feelings and their attitudes about products and services rather than passive information recipients. Given the evolution of modern technological advances, such as the proliferation of mobile devices social networks and services is extending. User-generated content in social media constitutes a very meaningful information source and consists of opinions towards various events and services. In this paper, we present a methodology that aims to analyze Greek text and extract indicative info towards users’ opinions and attitudes. Specifically, we describe a supervised approach adopted that analyzes and classifies comments and reviews into the appropriate polarity category. Discretization techniques are also applied to improve the performance and the accuracy of classification procedures. Finally, we present an experimental evaluation that was designed and conducted and which revealed quite interesting findings.
如今,科技的飞速发展促进了世界各地人们之间的信息、交流和互动,这些主要是通过互联网实现的。互联网用户现在是信息数据的新创造者,他们表达自己的想法、观点、感受和对产品和服务的态度,而不是被动的信息接受者。鉴于现代技术的进步,如移动设备的扩散,社交网络和服务正在扩展。社交媒体中的用户生成内容是一个非常有意义的信息源,它包含了对各种事件和服务的意见。在本文中,我们提出了一种方法,旨在分析希腊文本并提取用户意见和态度的指示性信息。具体来说,我们描述了一种被采用的监督方法,该方法分析并将评论和评论分类到适当的极性类别中。离散化技术也被用于提高分类程序的性能和准确性。最后,我们提出了一个设计和实施的实验评估,并揭示了相当有趣的发现。
{"title":"Examining the Impact of Discretization Technique on Sentiment Analysis for the Greek Language","authors":"Nikolaos Spatiotis, I. Perikos, I. Mporas, M. Paraskevas","doi":"10.1109/IISA.2019.8900699","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900699","url":null,"abstract":"Nowadays, information, communication and interaction between people worldwide have been facilitated by the rapid development of technology and they are mainly achieved through the internet. Internet users are now new creators of information data and express their ideas, their opinions, their feelings and their attitudes about products and services rather than passive information recipients. Given the evolution of modern technological advances, such as the proliferation of mobile devices social networks and services is extending. User-generated content in social media constitutes a very meaningful information source and consists of opinions towards various events and services. In this paper, we present a methodology that aims to analyze Greek text and extract indicative info towards users’ opinions and attitudes. Specifically, we describe a supervised approach adopted that analyzes and classifies comments and reviews into the appropriate polarity category. Discretization techniques are also applied to improve the performance and the accuracy of classification procedures. Finally, we present an experimental evaluation that was designed and conducted and which revealed quite interesting findings.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128457406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Tangible Programming Language for the Educational Robot Thymio 面向教育机器人Thymio的有形编程语言
Andrea Mussati, Christian Giang, Alberto Piatti, F. Mondada
In the past, the use of tangible programming languages has shown several advantages compared to screenbased graphical programming languages. Especially when presented to novices, such interfaces may represent a more intuitive and straightforward alternative to teach basic computer science and programming concepts. Previous studies have reported increased interest and improved collaboration when tangible programming languages were used. However, additional financial expenses have often hindered the use of such interfaces in formal education settings. This work therefore presents a low-cost and customizable solution of a tangible programming language for Thymio, an educational robot widely used in primary and secondary schools. Using a computer vision algorithm, graphical icons printed on paper are captured by a camera, and subsequently interpreted and sent to the robot for execution. Two user studies with in total 77 university students showed promising results, indicating that the devised interface can elicit more interest and a higher level of collaboration within groups.
在过去,与基于屏幕的图形化编程语言相比,有形编程语言的使用显示出了一些优势。特别是当呈现给新手时,这样的接口可能是教授基本计算机科学和编程概念的更直观和直接的选择。以前的研究报告说,当使用有形的编程语言时,会增加兴趣并改善协作。然而,额外的财政开支往往阻碍了在正规教育环境中使用这种接口。因此,这项工作为Thymio提供了一种低成本和可定制的有形编程语言解决方案,Thymio是一种广泛应用于中小学的教育机器人。使用计算机视觉算法,打印在纸上的图形图标被相机捕获,随后被解释并发送给机器人执行。两项共有77名大学生参与的用户研究显示出令人鼓舞的结果,表明设计的界面可以引起更多的兴趣,并在小组内提高协作水平。
{"title":"A Tangible Programming Language for the Educational Robot Thymio","authors":"Andrea Mussati, Christian Giang, Alberto Piatti, F. Mondada","doi":"10.1109/IISA.2019.8900743","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900743","url":null,"abstract":"In the past, the use of tangible programming languages has shown several advantages compared to screenbased graphical programming languages. Especially when presented to novices, such interfaces may represent a more intuitive and straightforward alternative to teach basic computer science and programming concepts. Previous studies have reported increased interest and improved collaboration when tangible programming languages were used. However, additional financial expenses have often hindered the use of such interfaces in formal education settings. This work therefore presents a low-cost and customizable solution of a tangible programming language for Thymio, an educational robot widely used in primary and secondary schools. Using a computer vision algorithm, graphical icons printed on paper are captured by a camera, and subsequently interpreted and sent to the robot for execution. Two user studies with in total 77 university students showed promising results, indicating that the devised interface can elicit more interest and a higher level of collaboration within groups.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126491551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Improving Collaborative Filtering’s Rating Prediction Accuracy by Introducing the Common Item Rating Past Criterion 引入公共项目评定过去标准提高协同过滤评定预测精度
Dionisis Margaris, Dionysios Vasilopoulos, C. Vassilakis, D. Spiliotopoulos
Collaborative filtering formulates personalized recommendations by considering ratings submitted by users. Collaborative filtering algorithms initially find people having similar likings, by inspecting the similarity of ratings already present in the ratings database. Users exhibiting high similarity regarding their likings are classified as “near neighbors” (NNs) and the ratings entered by each user’s near neighbors drive the formulation of recommendations for that user. To quantify the similarity between users, in order to determine a user’s NNs, a similarity metric is used. Insofar, similarity metrics proposed in the literature either consider all user ratings equally or take into account temporal variations within the users’ or items’ ratings history. However users’ ratings are co-shaped according to the experiences that they had in the past; therefore if two users enter similar (or dissimilar) ratings for an item while having experienced to a large extent the same items in the past, this constitutes stronger evidence about user similarity (or dissimilarity). Insofar however, no similarity metric takes into account this aspect. In this work, we propose and evaluate an algorithm that considers the common item rating past when computing rating predictions, in order to increase rating prediction accuracy.
协同过滤通过考虑用户提交的评分来制定个性化推荐。协同过滤算法首先通过检查评分数据库中已经存在的评分的相似性来找到具有相似喜好的人。在喜好方面表现出高相似性的用户被归类为“近邻”(nn),每个用户的近邻输入的评级驱动了对该用户的推荐的制定。为了量化用户之间的相似性,为了确定用户的神经网络,使用了相似性度量。到目前为止,文献中提出的相似性度量要么平等地考虑所有用户评级,要么考虑用户或项目评级历史中的时间变化。然而,用户的评分是根据他们过去的体验共同塑造的;因此,如果两个用户输入相似(或不同)的评分,而他们在很大程度上经历过相同的物品,这就构成了关于用户相似性(或不相似性)的更有力的证据。然而,到目前为止,没有相似度度量考虑到这方面。在这项工作中,我们提出并评估了一种在计算评级预测时考虑过去常见项目评级的算法,以提高评级预测的准确性。
{"title":"Improving Collaborative Filtering’s Rating Prediction Accuracy by Introducing the Common Item Rating Past Criterion","authors":"Dionisis Margaris, Dionysios Vasilopoulos, C. Vassilakis, D. Spiliotopoulos","doi":"10.1109/IISA.2019.8900758","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900758","url":null,"abstract":"Collaborative filtering formulates personalized recommendations by considering ratings submitted by users. Collaborative filtering algorithms initially find people having similar likings, by inspecting the similarity of ratings already present in the ratings database. Users exhibiting high similarity regarding their likings are classified as “near neighbors” (NNs) and the ratings entered by each user’s near neighbors drive the formulation of recommendations for that user. To quantify the similarity between users, in order to determine a user’s NNs, a similarity metric is used. Insofar, similarity metrics proposed in the literature either consider all user ratings equally or take into account temporal variations within the users’ or items’ ratings history. However users’ ratings are co-shaped according to the experiences that they had in the past; therefore if two users enter similar (or dissimilar) ratings for an item while having experienced to a large extent the same items in the past, this constitutes stronger evidence about user similarity (or dissimilarity). Insofar however, no similarity metric takes into account this aspect. In this work, we propose and evaluate an algorithm that considers the common item rating past when computing rating predictions, in order to increase rating prediction accuracy.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133890117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Self-trained eXtreme Gradient Boosting Trees 自我训练的极端梯度增强树
Nikos Fazakis, Georgios Kostopoulos, Stamatis Karlos, S. Kotsiantis, K. Sgarbas
Semi-Supervised Learning (SSL) is an ever-growing research area offering a powerful set of methods, either single or multi-view, for exploiting both labeled and unlabeled instances in the most effective manner. Self-training is a representative SSL algorithm which has been efficiently implemented for solving several classification problems in a wide range of scientific fields. Moreover, self-training has served as the base for the development of several self-labeled methods. In addition, gradient boosting is an advanced machine learning technique, a boosting algorithm for both classification and regression problems, which produces a predictive model in the form of decision trees. In this context, the principal objective of this paper is to put forward an improved self-training algorithm for classification tasks utilizing the efficacy of eXtreme Gradient Boosting (XGBoost) trees in a self-labeled scheme in order to build a highly accurate and robust classification model. A number of experiments on benchmark datasets were executed demonstrating the superiority of the proposed method over representative semi-supervised methods, as statistically verified by the Friedman non-parametric test.
半监督学习(SSL)是一个不断发展的研究领域,提供了一组强大的方法,可以是单视图或多视图,以最有效的方式利用标记和未标记的实例。自训练算法是一种代表性的SSL算法,已在广泛的科学领域中有效地解决了许多分类问题。此外,自我训练已成为几种自我标记方法发展的基础。此外,梯度增强是一种先进的机器学习技术,是一种用于分类和回归问题的增强算法,它以决策树的形式产生预测模型。在此背景下,本文的主要目标是利用自标记方案中极端梯度增强(XGBoost)树的有效性,提出一种改进的分类任务自训练算法,以构建高度准确和鲁棒的分类模型。在基准数据集上进行的大量实验表明,所提出的方法优于代表性的半监督方法,并通过Friedman非参数检验进行了统计验证。
{"title":"Self-trained eXtreme Gradient Boosting Trees","authors":"Nikos Fazakis, Georgios Kostopoulos, Stamatis Karlos, S. Kotsiantis, K. Sgarbas","doi":"10.1109/IISA.2019.8900737","DOIUrl":"https://doi.org/10.1109/IISA.2019.8900737","url":null,"abstract":"Semi-Supervised Learning (SSL) is an ever-growing research area offering a powerful set of methods, either single or multi-view, for exploiting both labeled and unlabeled instances in the most effective manner. Self-training is a representative SSL algorithm which has been efficiently implemented for solving several classification problems in a wide range of scientific fields. Moreover, self-training has served as the base for the development of several self-labeled methods. In addition, gradient boosting is an advanced machine learning technique, a boosting algorithm for both classification and regression problems, which produces a predictive model in the form of decision trees. In this context, the principal objective of this paper is to put forward an improved self-training algorithm for classification tasks utilizing the efficacy of eXtreme Gradient Boosting (XGBoost) trees in a self-labeled scheme in order to build a highly accurate and robust classification model. A number of experiments on benchmark datasets were executed demonstrating the superiority of the proposed method over representative semi-supervised methods, as statistically verified by the Friedman non-parametric test.","PeriodicalId":371385,"journal":{"name":"2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125643292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
期刊
2019 10th International Conference on Information, Intelligence, Systems and Applications (IISA)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1