首页 > 最新文献

Computer science & information technology最新文献

英文 中文
Quantum Clustering Analysis: Minima of the Potential Energy Function 量子聚类分析:势能函数的极小值
Pub Date : 2020-12-19 DOI: 10.5121/csit.2020.101914
A. Maignan, Tony C. Scott
Quantum clustering (QC), is a data clustering algorithm based on quantum mechanics which is accomplished by substituting each point in a given dataset with a Gaussian. The width of the Gaussian is a 𝜎 value, a hyper-parameter which can be manually defined and manipulated to suit the application. Numerical methods are used to find all the minima of the quantum potential as they correspond to cluster centers. Herein, we investigate the mathematical task of expressing and finding all the roots of the exponential polynomial corresponding to the minima of a two-dimensional quantum potential. This is an outstanding task because normally such expressions are impossible to solve analytically. However, we prove that if the points are all included in a square region of size 𝜎, there is only one minimum. This bound is not only useful in the number of solutions to look for, by numerical means, it allows to to propose a new numerical approach “per block”. This technique decreases the number of particles (or samples) by approximating some groups of particles to weighted particles. These findings are not only useful to the quantum clustering problem but also for the exponential polynomials encountered in quantum chemistry, Solid-state Physics and other applications.
量子聚类(QC)是一种基于量子力学的数据聚类算法,它通过将给定数据集中的每个点替换为高斯分布来实现。高斯分布的宽度是一个参数,它是一个超参数,可以通过人工定义和操作来适应实际应用。用数值方法求出所有量子势的最小值,因为它们对应于簇中心。在此,我们研究了表示和找到与二维量子势的最小值对应的指数多项式的所有根的数学任务。这是一项突出的任务,因为通常这样的表达式是不可能解析求解的。然而,我们证明了如果所有点都包含在一个面积为φ的正方形区域内,则只有一个最小值。这个边界不仅在通过数值方法寻找解的数量上有用,它允许提出一个新的“每个块”的数值方法。这种技术通过将一些粒子组近似为加权粒子来减少粒子(或样本)的数量。这些发现不仅对量子聚类问题有帮助,而且对量子化学、固体物理和其他应用中遇到的指数多项式也有帮助。
{"title":"Quantum Clustering Analysis: Minima of the Potential Energy Function","authors":"A. Maignan, Tony C. Scott","doi":"10.5121/csit.2020.101914","DOIUrl":"https://doi.org/10.5121/csit.2020.101914","url":null,"abstract":"Quantum clustering (QC), is a data clustering algorithm based on quantum mechanics which is accomplished by substituting each point in a given dataset with a Gaussian. The width of the Gaussian is a 𝜎 value, a hyper-parameter which can be manually defined and manipulated to suit the application. Numerical methods are used to find all the minima of the quantum potential as they correspond to cluster centers. Herein, we investigate the mathematical task of expressing and finding all the roots of the exponential polynomial corresponding to the minima of a two-dimensional quantum potential. This is an outstanding task because normally such expressions are impossible to solve analytically. However, we prove that if the points are all included in a square region of size 𝜎, there is only one minimum. This bound is not only useful in the number of solutions to look for, by numerical means, it allows to to propose a new numerical approach “per block”. This technique decreases the number of particles (or samples) by approximating some groups of particles to weighted particles. These findings are not only useful to the quantum clustering problem but also for the exponential polynomials encountered in quantum chemistry, Solid-state Physics and other applications.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45380088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On Some Desired Properties of Data Augmentation by Illumination Simulation for Color Constancy 基于颜色恒定性的光照模拟数据增强的一些期望性质
Pub Date : 2020-12-19 DOI: 10.5121/csit.2020.101903
Nikola Banić, Karlo Koščević, M. Subašić, S. Lončarić
Computational color constancy is used in almost all digital cameras to reduce the influence of scene illumination on object colors. Many of the highly accurate published illumination estimation methods use deep learning, which relies on large amounts of images with known ground-truth illuminations. Since the size of the appropriate publicly available training datasets is relatively small, data augmentation is often used also by simulating the appearance of a given image under another illumination. Still, there are practically no reports on any desired properties of such simulated images or on the limits of their usability. In this paper, several experiments for determining some of these properties are proposed and conducted by comparing the behavior of the simplest illumination estimation methods on images of the same scenes obtained under real illuminations and images obtained through data augmentation. The experimental results are presented and discussed.
几乎所有的数码相机都使用计算颜色恒定性来减少场景照明对物体颜色的影响。许多已发表的高精度照明估计方法使用深度学习,该方法依赖于具有已知地面实况照明的大量图像。由于适当的公开可用训练数据集的大小相对较小,因此还经常通过模拟给定图像在另一照明下的外观来使用数据增强。尽管如此,实际上还没有关于这种模拟图像的任何期望特性或其可用性的限制的报告。在本文中,通过比较在真实照明下获得的相同场景的图像和通过数据增强获得的图像上最简单的照明估计方法的行为,提出并进行了几个确定其中一些特性的实验。给出并讨论了实验结果。
{"title":"On Some Desired Properties of Data Augmentation by Illumination Simulation for Color Constancy","authors":"Nikola Banić, Karlo Koščević, M. Subašić, S. Lončarić","doi":"10.5121/csit.2020.101903","DOIUrl":"https://doi.org/10.5121/csit.2020.101903","url":null,"abstract":"Computational color constancy is used in almost all digital cameras to reduce the influence of scene illumination on object colors. Many of the highly accurate published illumination estimation methods use deep learning, which relies on large amounts of images with known ground-truth illuminations. Since the size of the appropriate publicly available training datasets is relatively small, data augmentation is often used also by simulating the appearance of a given image under another illumination. Still, there are practically no reports on any desired properties of such simulated images or on the limits of their usability. In this paper, several experiments for determining some of these properties are proposed and conducted by comparing the behavior of the simplest illumination estimation methods on images of the same scenes obtained under real illuminations and images obtained through data augmentation. The experimental results are presented and discussed.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41487025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting Disease Activity for Biologic Selection in Rheumatoid Arthritis 类风湿关节炎疾病活动性的生物选择预测
Pub Date : 2020-12-19 DOI: 10.5121/csit.2020.101913
M. Yamauchi, K. Nakano, Yoshiya Tanaka, K. Horio
In this article, we implemented a regression model and conducted experiments for predicting disease activity using data from 1929 rheumatoid arthritis patients to assist in the selection of biologics for rheumatoid arthritis. On modelling, the missing variables in the data were completed by three different methods, mean value, self-organizing map and random value. Experimental results showed that the prediction error of the regression model was large regardless of the missing completion method, making it difficult to predict the prognosis of rheumatoid arthritis patients.
在这篇文章中,我们使用1929年类风湿性关节炎患者的数据实施了一个回归模型并进行了预测疾病活动性的实验,以帮助选择治疗类风湿性关节炎的生物制剂。在建模方面,采用均值、自组织映射和随机值三种不同的方法对数据中的缺失变量进行补全。实验结果表明,无论采用何种缺失补全方法,回归模型的预测误差都较大,难以预测类风湿关节炎患者的预后。
{"title":"Predicting Disease Activity for Biologic Selection in Rheumatoid Arthritis","authors":"M. Yamauchi, K. Nakano, Yoshiya Tanaka, K. Horio","doi":"10.5121/csit.2020.101913","DOIUrl":"https://doi.org/10.5121/csit.2020.101913","url":null,"abstract":"In this article, we implemented a regression model and conducted experiments for predicting disease activity using data from 1929 rheumatoid arthritis patients to assist in the selection of biologics for rheumatoid arthritis. On modelling, the missing variables in the data were completed by three different methods, mean value, self-organizing map and random value. Experimental results showed that the prediction error of the regression model was large regardless of the missing completion method, making it difficult to predict the prognosis of rheumatoid arthritis patients.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41511787","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multilevel Technology Acceptance Management Model 一种多级技术验收管理模型
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101802
Gilbert Busolo, L. Nderu, Kennedy Ogada
Knowledge is a strategic resource for successful data driven decision making in any organization. To harness this knowledge, successful adoption of a technological intervention is key. Institutions leverage on technology to drive knowledge management (KM) initiatives for quality service delivery and prudent data management. These initiatives provide the overall strategy for managing data resources. They make available knowledge organization tools and techniques while enabling regular updates. Derived benefits of positive deployment of a technological intervention are competency enhancement through gained knowledge, raised quality of service and promotion of healthy development of e-commerce. Successful and timely adoption of technological interventions through which knowledge management initiatives are deployed remains a key challenge to many organizations. This paper proposes a wholesome multilevel technology acceptance management model. The proposed model takes into account human, technological and organizational variables, which exist in a deployment environment. This model will be vital in driving early technology acceptance prediction and timely deployment of mitigation measures to deploy technological interventions successfully.
在任何组织中,知识都是成功的数据驱动决策的战略资源。要利用这些知识,成功采用技术干预是关键。机构利用技术来推动知识管理(KM)计划,以提供优质服务和谨慎的数据管理。这些举措提供了管理数据资源的总体策略。它们提供了可用的知识组织工具和技术,同时支持定期更新。积极采用技术干预措施的好处包括:通过获取知识提高能力、提高服务质量和促进电子商务的健康发展。成功和及时地采用技术干预措施,通过这些措施部署知识管理计划,仍然是许多组织面临的关键挑战。本文提出了一个完整的多层次技术验收管理模型。所建议的模型考虑了部署环境中存在的人员、技术和组织变量。该模型对于推动早期技术接受度预测和及时部署缓解措施以成功部署技术干预措施至关重要。
{"title":"A Multilevel Technology Acceptance Management Model","authors":"Gilbert Busolo, L. Nderu, Kennedy Ogada","doi":"10.5121/csit.2020.101802","DOIUrl":"https://doi.org/10.5121/csit.2020.101802","url":null,"abstract":"Knowledge is a strategic resource for successful data driven decision making in any organization. To harness this knowledge, successful adoption of a technological intervention is key. Institutions leverage on technology to drive knowledge management (KM) initiatives for quality service delivery and prudent data management. These initiatives provide the overall strategy for managing data resources. They make available knowledge organization tools and techniques while enabling regular updates. Derived benefits of positive deployment of a technological intervention are competency enhancement through gained knowledge, raised quality of service and promotion of healthy development of e-commerce. Successful and timely adoption of technological interventions through which knowledge management initiatives are deployed remains a key challenge to many organizations. This paper proposes a wholesome multilevel technology acceptance management model. The proposed model takes into account human, technological and organizational variables, which exist in a deployment environment. This model will be vital in driving early technology acceptance prediction and timely deployment of mitigation measures to deploy technological interventions successfully.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42396351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Intellectual Approach to Design Personal Study Plan via Machine Learning 通过机器学习设计个人学习计划的智能方法
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101804
Shiyuan Zhang, Evan Gunnell, Marisabel Chang, Yu Sun
As more students are required to have standardized test scores to enter higher education, developing vocabulary becomes essential for achieving ideal scores. Each individual has his or her own study style that maximizes the efficiency, and there are various approaches to memorize. However, it is difficult to find a specific learning method that fits the best to a person. This paper designs a tool to customize personal study plans based on clients’ different habits including difficulty distribution, difficulty order of learning words, and the types of vocabulary. We applied our application to educational software and conducted a quantitative evaluation of the approach via three types of machine learning models. By calculating cross-validation scores, we evaluated the accuracy of each model and discovered the best model that returns the most accurate predictions. The results reveal that linear regression has the highest cross validation score, and it can provide the most efficient personal study plans.
随着越来越多的学生被要求有标准化的考试成绩才能进入高等教育,提高词汇量对于获得理想的成绩至关重要。每个人都有他或她自己的学习方式,以最大限度地提高效率,有各种各样的方法来记忆。然而,很难找到一种最适合个人的具体学习方法。本文设计了一个工具,根据客户不同的学习习惯,包括学习单词的难度分布、难度顺序和词汇类型,定制个性化的学习计划。我们将我们的应用程序应用于教育软件,并通过三种类型的机器学习模型对该方法进行了定量评估。通过计算交叉验证分数,我们评估了每个模型的准确性,并发现了返回最准确预测的最佳模型。结果表明,线性回归具有最高的交叉验证分数,它可以提供最有效的个人学习计划。
{"title":"An Intellectual Approach to Design Personal Study Plan via Machine Learning","authors":"Shiyuan Zhang, Evan Gunnell, Marisabel Chang, Yu Sun","doi":"10.5121/csit.2020.101804","DOIUrl":"https://doi.org/10.5121/csit.2020.101804","url":null,"abstract":"As more students are required to have standardized test scores to enter higher education, developing vocabulary becomes essential for achieving ideal scores. Each individual has his or her own study style that maximizes the efficiency, and there are various approaches to memorize. However, it is difficult to find a specific learning method that fits the best to a person. This paper designs a tool to customize personal study plans based on clients’ different habits including difficulty distribution, difficulty order of learning words, and the types of vocabulary. We applied our application to educational software and conducted a quantitative evaluation of the approach via three types of machine learning models. By calculating cross-validation scores, we evaluated the accuracy of each model and discovered the best model that returns the most accurate predictions. The results reveal that linear regression has the highest cross validation score, and it can provide the most efficient personal study plans.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44120232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PNFUZZ: A Stateful Network Protocol Fuzzing Approach Based on Packet Clustering PNFUZZ:一种基于分组聚类的状态网络协议模糊化方法
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101805
Huihui He, Yongjun Wang
Due to the interactivity of stateful network protocol, network protocol fuzzing has higher blindness and lower testcase validity. The existing blackbox-based fuzzing has the disadvantages of high randomness and blindness. The manual description of protocol specification which requires more expert knowledge, is tedious and does not support the protocol without public document, which limits the effect of current network protocol fuzzer. In this paper, we present PNFUZZ, a fuzzer that adopts the state inference based on packet clustering algorithm and coverage oriented mutation strategy. We train a clustering model through the target protocol packet, and use the model to identify the server’s protocol state, thereby optimizing the process of testcase generation. The experimental results show that the proposed approach has a certain improvement in fuzzing effect.
由于有状态网络协议的交互性,网络协议模糊具有较高的盲目性和较低的测试用例有效性。现有的基于黑盒的模糊具有随机性强、盲目性强的缺点。协议规范的手动描述需要更多的专家知识,繁琐且不支持没有公开文件的协议,这限制了当前网络协议模糊器的效果。在本文中,我们提出了PNFUZZ,一个模糊器,它采用了基于分组聚类算法和面向覆盖的变异策略的状态推理。我们通过目标协议包训练集群模型,并使用该模型来识别服务器的协议状态,从而优化测试用例生成过程。实验结果表明,该方法对模糊效果有一定的改善。
{"title":"PNFUZZ: A Stateful Network Protocol Fuzzing Approach Based on Packet Clustering","authors":"Huihui He, Yongjun Wang","doi":"10.5121/csit.2020.101805","DOIUrl":"https://doi.org/10.5121/csit.2020.101805","url":null,"abstract":"Due to the interactivity of stateful network protocol, network protocol fuzzing has higher blindness and lower testcase validity. The existing blackbox-based fuzzing has the disadvantages of high randomness and blindness. The manual description of protocol specification which requires more expert knowledge, is tedious and does not support the protocol without public document, which limits the effect of current network protocol fuzzer. In this paper, we present PNFUZZ, a fuzzer that adopts the state inference based on packet clustering algorithm and coverage oriented mutation strategy. We train a clustering model through the target protocol packet, and use the model to identify the server’s protocol state, thereby optimizing the process of testcase generation. The experimental results show that the proposed approach has a certain improvement in fuzzing effect.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43160064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Importance of the Single-Span Task Formulation to Extractive Question-answering 单跨任务提法对抽取式问答的重要性
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101809
Marie-Anne Xu, Rahul Khanna
Recent progress in machine reading comprehension and question-answering has allowed machines to reach and even surpass human question-answering. However, the majority of these questions have only one answer, and more substantial testing on questions with multiple answers, or multi-span questions, has not yet been applied. Thus, we introduce a newly compiled dataset consisting of questions with multiple answers that originate from previously existing datasets. In addition, we run BERT-based models pre-trained for question-answering on our constructed dataset to evaluate their reading comprehension abilities. Among the three of BERT-based models we ran, RoBERTa exhibits the highest consistent performance, regardless of size. We find that all our models perform similarly on this new, multi-span dataset (21.492% F1) compared to the single-span source datasets (~33.36% F1). While the models tested on the source datasets were slightly fine-tuned, performance is similar enough to judge that task formulation does not drastically affect question-answering abilities. Our evaluations indicate that these models are indeed capable of adjusting to answer questions that require multiple answers. We hope that our findings will assist future development in questionanswering and improve existing question-answering products and methods.
最近在机器阅读理解和问答方面的进展使机器能够达到甚至超越人类的问答能力。然而,这些问题中的大多数只有一个答案,而更多的有多个答案的问题或多跨度问题的测试尚未应用。因此,我们引入了一个新编译的数据集,该数据集由来自先前存在的数据集的带有多个答案的问题组成。此外,我们在构建的数据集上运行基于bert的预训练问答模型,以评估他们的阅读理解能力。在我们运行的三个基于bert的模型中,RoBERTa表现出最高的一致性性能,无论大小如何。我们发现,与单跨度源数据集(~33.36% F1)相比,我们所有的模型在这个新的多跨度数据集上的表现相似(21.492% F1)。虽然在源数据集上测试的模型稍微进行了微调,但性能足够相似,可以判断任务公式不会严重影响回答问题的能力。我们的评估表明,这些模型确实能够调整以回答需要多个答案的问题。我们希望我们的发现将有助于未来的问题回答的发展,并改进现有的问题回答产品和方法。
{"title":"Importance of the Single-Span Task Formulation to Extractive Question-answering","authors":"Marie-Anne Xu, Rahul Khanna","doi":"10.5121/csit.2020.101809","DOIUrl":"https://doi.org/10.5121/csit.2020.101809","url":null,"abstract":"Recent progress in machine reading comprehension and question-answering has allowed machines to reach and even surpass human question-answering. However, the majority of these questions have only one answer, and more substantial testing on questions with multiple answers, or multi-span questions, has not yet been applied. Thus, we introduce a newly compiled dataset consisting of questions with multiple answers that originate from previously existing datasets. In addition, we run BERT-based models pre-trained for question-answering on our constructed dataset to evaluate their reading comprehension abilities. Among the three of BERT-based models we ran, RoBERTa exhibits the highest consistent performance, regardless of size. We find that all our models perform similarly on this new, multi-span dataset (21.492% F1) compared to the single-span source datasets (~33.36% F1). While the models tested on the source datasets were slightly fine-tuned, performance is similar enough to judge that task formulation does not drastically affect question-answering abilities. Our evaluations indicate that these models are indeed capable of adjusting to answer questions that require multiple answers. We hope that our findings will assist future development in questionanswering and improve existing question-answering products and methods.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42684366","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Generation of Text for Match Recaps using Esport Caster Commentaries 使用EsportCaster解说自动生成比赛记录文本
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101810
Oluseyi Olarewaju, A. Kokkinakis, Simon Demediuk, Justus Roberstson, Isabelle Nölle, Sagarika Patra, Daniel Slawson, A. Chitayat, Alistair Coates, B. Kirman, Anders Drachen, M. Ursu, Florian Block, Jonathan Hook
Unlike traditional physical sports, Esport games are played using wholly digital platforms. As a consequence, there exists rich data (in-game, audio and video) about the events that take place in matches. These data offer viable linguistic resources for generating comprehensible text descriptions of matches, which could, be used as the basis of novel text-based spectator experiences. We present a study that investigates if users perceive text generated by the NLG system as an accurate recap of highlight moments. We also explore how the text generated supported viewer understanding of highlight moments in two scenarios: i) text as an alternative way to spectate a match, instead of viewing the main broadcast; and ii) text as an additional information resource to be consumed while viewing the main broadcast. Our study provided insights on the implications of the presentation strategies for use of text in recapping highlight moments to Dota 2 spectators.
与传统的体育运动不同,电子竞技游戏是使用完全数字化的平台进行的。因此,存在关于比赛中发生的事件的丰富数据(游戏内、音频和视频)。这些数据为生成可理解的比赛文本描述提供了可行的语言资源,可以作为新颖的基于文本的观众体验的基础。我们提出了一项研究,调查用户是否认为NLG系统生成的文本是对精彩时刻的准确回顾。我们还探讨了文本生成如何支持观众在两种场景中理解精彩时刻:i)文本作为观看比赛的替代方式,而不是观看主广播;以及ii)作为在观看主广播时要消耗的附加信息资源的文本。我们的研究提供了关于在向Dota 2观众重述精彩时刻时使用文本的呈现策略的含义的见解。
{"title":"Automatic Generation of Text for Match Recaps using Esport Caster Commentaries","authors":"Oluseyi Olarewaju, A. Kokkinakis, Simon Demediuk, Justus Roberstson, Isabelle Nölle, Sagarika Patra, Daniel Slawson, A. Chitayat, Alistair Coates, B. Kirman, Anders Drachen, M. Ursu, Florian Block, Jonathan Hook","doi":"10.5121/csit.2020.101810","DOIUrl":"https://doi.org/10.5121/csit.2020.101810","url":null,"abstract":"Unlike traditional physical sports, Esport games are played using wholly digital platforms. As a consequence, there exists rich data (in-game, audio and video) about the events that take place in matches. These data offer viable linguistic resources for generating comprehensible text descriptions of matches, which could, be used as the basis of novel text-based spectator experiences. We present a study that investigates if users perceive text generated by the NLG system as an accurate recap of highlight moments. We also explore how the text generated supported viewer understanding of highlight moments in two scenarios: i) text as an alternative way to spectate a match, instead of viewing the main broadcast; and ii) text as an additional information resource to be consumed while viewing the main broadcast. Our study provided insights on the implications of the presentation strategies for use of text in recapping highlight moments to Dota 2 spectators.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42830522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
SMARTAJWEED Automatic Recognition of Arabic Quranic Recitation Rules SMARTAJWEED阿拉伯语古兰经背诵规则的自动识别
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101812
Ali M. Alagrami, Maged M. Eljazzar
Tajweed is a set of rules to read the Quran in a correct Pronunciation of the letters with all its Qualities, while Reciting the Quran. which means you have to give every letter in the Quran its due of characteristics and apply it to this particular letter in this specific situation while reading, which may differ in other times. These characteristics include melodic rules, like where to stop and for how long, when to merge two letters in pronunciation or when to stretch some, or even when to put more strength on some letters over other. Most of the papers focus mainly on the main recitation rules and the pronunciation but not (Ahkam AL Tajweed) which give different rhythm and different melody to the pronunciation with every different rule of (Tajweed). Which is also considered very important and essential in Reading the Quran as it can give different meanings to the words. In this paper we discuss in detail full system for automatic recognition of Quran Recitation Rules (Tajweed) by using support vector machine and threshold scoring system.
Tajweed是一套诵读《古兰经》的规则,在诵读《古兰经》时,要用正确的发音读出所有的字母。这意味着你必须给《古兰经》里的每一个字母赋予它应有的特征,并在阅读的时候把它应用到这个特定的字母上,在其他时候可能会有所不同。这些特征包括旋律规则,比如停在哪里,停多长时间,什么时候在发音中合并两个字母,什么时候拉伸一些字母,甚至什么时候在一些字母上放更多的力量。大多数论文主要关注的是主要的背诵规则和读音,而不是(Ahkam AL Tajweed),每一个不同的(Tajweed)规则给发音带来不同的节奏和不同的旋律。这在阅读《古兰经》时也被认为是非常重要和必不可少的,因为它可以赋予单词不同的含义。本文详细讨论了基于支持向量机和阈值评分系统的《古兰经》诵读规则自动识别系统。
{"title":"SMARTAJWEED Automatic Recognition of Arabic Quranic Recitation Rules","authors":"Ali M. Alagrami, Maged M. Eljazzar","doi":"10.5121/csit.2020.101812","DOIUrl":"https://doi.org/10.5121/csit.2020.101812","url":null,"abstract":"Tajweed is a set of rules to read the Quran in a correct Pronunciation of the letters with all its Qualities, while Reciting the Quran. which means you have to give every letter in the Quran its due of characteristics and apply it to this particular letter in this specific situation while reading, which may differ in other times. These characteristics include melodic rules, like where to stop and for how long, when to merge two letters in pronunciation or when to stretch some, or even when to put more strength on some letters over other. Most of the papers focus mainly on the main recitation rules and the pronunciation but not (Ahkam AL Tajweed) which give different rhythm and different melody to the pronunciation with every different rule of (Tajweed). Which is also considered very important and essential in Reading the Quran as it can give different meanings to the words. In this paper we discuss in detail full system for automatic recognition of Quran Recitation Rules (Tajweed) by using support vector machine and threshold scoring system.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47588480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Fuzzy BI-Objective Model for SVM with an Interactive Procedure to Identify the Best Compromise Solution 支持向量机的模糊BI目标模型及其交互过程识别最佳折衷方案
Pub Date : 2020-12-18 DOI: 10.5121/csit.2020.101808
Hager Ali Yahia, Mohammed Zakaria Moustafa, Mohammed Rizk Mohammed, H. Khater
A support vector machine (SVM) learns the decision surface from two different classes of the input points. In many applications, there are misclassifications in some of the input points and each is not fully assigned to one of these two classes. In this paper a bi-objective quadratic programming model with fuzzy parameters is utilized and different feature quality measures are optimized simultaneously. An α-cut is defined to transform the fuzzy model to a family of classical bi-objective quadratic programming problems. The weighting method is used to optimize each of these problems. An important contribution will be added for the proposed fuzzy bi-objective quadratic programming model by getting different efficient support vectors due to changing the weighting values. The experimental results show the effectiveness of the α-cut with the weighting parameters on reducing the misclassification between two classes of the input points. An interactive procedure will be added to identify the best compromise solution from the generated efficient solutions.
支持向量机(SVM)从两类不同的输入点中学习决策面。在许多应用中,在一些输入点中存在错误分类,并且每个输入点都没有完全分配到这两个类中的一个。本文采用了一种带有模糊参数的双目标二次规划模型,同时优化了不同的特征质量度量。定义了α-切,将模糊模型转化为一类经典的双目标二次规划问题。采用加权法对每个问题进行优化。通过改变权重值,得到不同的有效支持向量,为提出的模糊双目标二次规划模型增加了重要贡献。实验结果表明,采用加权参数的α-切法可以有效地减少两类输入点之间的误分类。将增加一个互动程序,从生成的有效解决方案中确定最佳折衷解决方案。
{"title":"A Fuzzy BI-Objective Model for SVM with an Interactive Procedure to Identify the Best Compromise Solution","authors":"Hager Ali Yahia, Mohammed Zakaria Moustafa, Mohammed Rizk Mohammed, H. Khater","doi":"10.5121/csit.2020.101808","DOIUrl":"https://doi.org/10.5121/csit.2020.101808","url":null,"abstract":"A support vector machine (SVM) learns the decision surface from two different classes of the input points. In many applications, there are misclassifications in some of the input points and each is not fully assigned to one of these two classes. In this paper a bi-objective quadratic programming model with fuzzy parameters is utilized and different feature quality measures are optimized simultaneously. An α-cut is defined to transform the fuzzy model to a family of classical bi-objective quadratic programming problems. The weighting method is used to optimize each of these problems. An important contribution will be added for the proposed fuzzy bi-objective quadratic programming model by getting different efficient support vectors due to changing the weighting values. The experimental results show the effectiveness of the α-cut with the weighting parameters on reducing the misclassification between two classes of the input points. An interactive procedure will be added to identify the best compromise solution from the generated efficient solutions.","PeriodicalId":72673,"journal":{"name":"Computer science & information technology","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2020-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41826272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Computer science & information technology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1