首页 > 最新文献

Proceedings of the 6th International Conference on Information System and Data Mining最新文献

英文 中文
Clustering Faster and Better with Projected Data 使用投影数据更快更好地聚类
Alibek Zhakubayev, Greg Hamerly
The K-means clustering algorithm can take a lot of time to converge, especially for large datasets in high dimension and a large number of clusters. By applying several enhancements it is possible to improve the performance without significantly changing the quality of the clustering. In this paper we first find a good clustering in a reduced-dimension version of the dataset, before fine-tuning the clustering in the original dimension. This saves time because accelerated K-means algorithms are fastest in low dimension, and the initial low-dimensional clustering bring us close to a good solution for the original data. We use random projection to reduce the dimension, as it is fast and maintains the cluster properties we want to preserve. In our experiments, we see that this approach significantly reduces the time needed for clustering a dataset and in most cases produces better results.
K-means聚类算法收敛时间较长,特别是对于高维的大型数据集和大量的聚类。通过应用一些增强功能,可以在不显著改变集群质量的情况下提高性能。在本文中,我们首先在数据集的降维版本中找到一个好的聚类,然后在原始维度上微调聚类。这节省了时间,因为加速K-means算法在低维上是最快的,并且初始的低维聚类使我们接近于原始数据的一个很好的解。我们使用随机投影来降低维数,因为它既快速又保持了我们想要保留的簇属性。在我们的实验中,我们看到这种方法大大减少了聚类数据集所需的时间,并且在大多数情况下产生了更好的结果。
{"title":"Clustering Faster and Better with Projected Data","authors":"Alibek Zhakubayev, Greg Hamerly","doi":"10.1145/3546157.3546158","DOIUrl":"https://doi.org/10.1145/3546157.3546158","url":null,"abstract":"The K-means clustering algorithm can take a lot of time to converge, especially for large datasets in high dimension and a large number of clusters. By applying several enhancements it is possible to improve the performance without significantly changing the quality of the clustering. In this paper we first find a good clustering in a reduced-dimension version of the dataset, before fine-tuning the clustering in the original dimension. This saves time because accelerated K-means algorithms are fastest in low dimension, and the initial low-dimensional clustering bring us close to a good solution for the original data. We use random projection to reduce the dimension, as it is fast and maintains the cluster properties we want to preserve. In our experiments, we see that this approach significantly reduces the time needed for clustering a dataset and in most cases produces better results.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122768167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kaiser-Meyer-Olkin Factor Analysis: A Quantitative Approach on Mobile Gaming Addiction using Random Forest Classifier Kaiser-Meyer-Olkin因子分析:基于随机森林分类器的手机游戏成瘾定量分析方法
Jefferson A. Costales, J. J. J. Catulay, Jeffrey Costales, Noel Bermudez
Technology allows us to progress and innovate in today's world, which advances at a faster pace. We innovate from traditional to digital life with the use of technology. There have been several technological advances, such as Mobile Gaming, that have occurred as a result of the evolution of technology. Gaming is a recreational pastime that has become available through technology in the form of apps for mobile devices. Technology is beneficial, but it also has a negative side effect: addiction. The goal of the study is to see if there is any link between a student's time spent playing mobile games and their social interactions. The study also attempts to discover different features that are important in mobile gaming addiction since they can be used to detect early signs of addiction. For the feature of importance, the researchers applied Random Forest Classifier. To ensure that the data is adequate, the study will employ factor analysis and Kaiser-Meyer-Olkin. This information can be utilized to improve the future study. The researchers gathered and used data from 513 college students from several Philippine universities.
在当今世界,技术使我们进步和创新,发展速度更快。我们利用科技从传统生活向数字化生活创新。有一些技术进步,比如手机游戏,是技术发展的结果。游戏是一种娱乐消遣,通过移动设备上的应用程序的形式变得可行。科技是有益的,但它也有负面影响:上瘾。这项研究的目的是了解学生玩手机游戏的时间和他们的社交活动之间是否存在联系。该研究还试图发现在手游成瘾中重要的不同特征,因为它们可以用来检测成瘾的早期迹象。对于重要性特征,研究人员采用了随机森林分类器。为了保证数据的充足性,本研究将采用因子分析和Kaiser-Meyer-Olkin方法。这些信息可以用来改进未来的研究。研究人员收集并使用了来自菲律宾几所大学的513名大学生的数据。
{"title":"Kaiser-Meyer-Olkin Factor Analysis: A Quantitative Approach on Mobile Gaming Addiction using Random Forest Classifier","authors":"Jefferson A. Costales, J. J. J. Catulay, Jeffrey Costales, Noel Bermudez","doi":"10.1145/3546157.3546161","DOIUrl":"https://doi.org/10.1145/3546157.3546161","url":null,"abstract":"Technology allows us to progress and innovate in today's world, which advances at a faster pace. We innovate from traditional to digital life with the use of technology. There have been several technological advances, such as Mobile Gaming, that have occurred as a result of the evolution of technology. Gaming is a recreational pastime that has become available through technology in the form of apps for mobile devices. Technology is beneficial, but it also has a negative side effect: addiction. The goal of the study is to see if there is any link between a student's time spent playing mobile games and their social interactions. The study also attempts to discover different features that are important in mobile gaming addiction since they can be used to detect early signs of addiction. For the feature of importance, the researchers applied Random Forest Classifier. To ensure that the data is adequate, the study will employ factor analysis and Kaiser-Meyer-Olkin. This information can be utilized to improve the future study. The researchers gathered and used data from 513 college students from several Philippine universities.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131995396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Towards Simplifying and Formalizing UML Class Diagram Generalization/Specialization Relationship with Mathematical Set Theory 用数学集合论简化和形式化UML类图泛化/专门化关系
Kruti Shah, Emanuel S. Grant
The Unified Modeling Language (UML) is considered the de facto standard for object-oriented software model development. This makes it appropriate to be used in academia courses at both the graduate and undergraduate levels of education. Some challenges to using the UML is academia are its large number of model concepts and the imprecise semantic of some of these concepts. These challenges are daunting for students who are being introduced to the UML. One approach that can be taken in teaching UML towards addressing these concerns is to limit the number of UML concepts taught and recognize that students may not be able to develop correct UML system models. This approach leads to research work that develop a limited set of UML model concepts that are fewer in number and have more precise semantics. In this paper, we present a new approach to resolve an aspect of this problem by simplifying the generalization/specialization semantics of the class diagram through the application of mathematical formality to usage of these class diagram concepts. This research work derives a core set of concepts suitable for graduate and undergraduate comprehension of UML modeling and defines more precise semantics for those modeling concepts. The applicable mathematical principles applied in this work are from the domains of set theory and predicate logic. This approach is particularly relevant for the pedagogy of software engineering and the development of software systems that require a high level of reliability.
统一建模语言(UML)被认为是面向对象软件模型开发的事实标准。这使得它适合用于研究生和本科教育水平的学术课程。使用UML学术界的一些挑战是其大量的模型概念和其中一些概念的不精确语义。这些挑战对于刚接触UML的学生来说是令人畏惧的。在教授UML以解决这些问题时,可以采取的一种方法是限制所教授的UML概念的数量,并认识到学生可能无法开发正确的UML系统模型。这种方法导致研究工作开发一组有限的UML模型概念,这些概念在数量上更少,并且具有更精确的语义。在本文中,我们提出了一种新的方法来解决这一问题的一个方面,即通过将数学形式应用于类图概念的使用,简化类图的泛化/专门化语义。这项研究工作衍生了一组核心概念,适合研究生和本科生对UML建模的理解,并为这些建模概念定义了更精确的语义。在这项工作中应用的数学原理来自集合论和谓词逻辑的领域。这种方法特别适用于软件工程的教学方法和需要高可靠性的软件系统的开发。
{"title":"Towards Simplifying and Formalizing UML Class Diagram Generalization/Specialization Relationship with Mathematical Set Theory","authors":"Kruti Shah, Emanuel S. Grant","doi":"10.1145/3546157.3546171","DOIUrl":"https://doi.org/10.1145/3546157.3546171","url":null,"abstract":"The Unified Modeling Language (UML) is considered the de facto standard for object-oriented software model development. This makes it appropriate to be used in academia courses at both the graduate and undergraduate levels of education. Some challenges to using the UML is academia are its large number of model concepts and the imprecise semantic of some of these concepts. These challenges are daunting for students who are being introduced to the UML. One approach that can be taken in teaching UML towards addressing these concerns is to limit the number of UML concepts taught and recognize that students may not be able to develop correct UML system models. This approach leads to research work that develop a limited set of UML model concepts that are fewer in number and have more precise semantics. In this paper, we present a new approach to resolve an aspect of this problem by simplifying the generalization/specialization semantics of the class diagram through the application of mathematical formality to usage of these class diagram concepts. This research work derives a core set of concepts suitable for graduate and undergraduate comprehension of UML modeling and defines more precise semantics for those modeling concepts. The applicable mathematical principles applied in this work are from the domains of set theory and predicate logic. This approach is particularly relevant for the pedagogy of software engineering and the development of software systems that require a high level of reliability.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"94 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114040708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Examining User Acceptance and Adoption of the Internet of Things in Indonesia 考察印尼用户对物联网的接受程度和采用情况
Meiryani Meiryani, Cindy Cornelia, Satami Doi Kikkawa, H. Ulinnuha, Lidiyawati Lidiyawati
As we've seen in the history of IoT devices, connecting traditionally unconnected objects like the refrigerator at Carnegie Mellon has been possible since the early 1980s, but the costs are significant. This requires the processing power of the DEC PDP11 mainframe computer. Moore's Law demonstrates an increase in the number and density of transistors in silicon chipsets, while Dennard scaling improves the computer's power profile. Given these two trends, we are now producing devices that use more powerful CPUs and increased memory capacity and run operating systems capable of running the full network stack. Only with these requirements met, IoT has become an industry into itself.
正如我们在物联网设备的历史中所看到的,自20世纪80年代初以来,连接卡内基梅隆大学冰箱等传统上未连接的物体已经成为可能,但成本很高。这需要DEC PDP11大型计算机的处理能力。摩尔定律证明了硅芯片组中晶体管的数量和密度的增加,而登纳德缩放提高了计算机的功率配置。考虑到这两个趋势,我们现在正在生产使用更强大的cpu和更大的内存容量的设备,并运行能够运行完整网络堆栈的操作系统。只有满足了这些要求,物联网才会成为一个行业。
{"title":"Examining User Acceptance and Adoption of the Internet of Things in Indonesia","authors":"Meiryani Meiryani, Cindy Cornelia, Satami Doi Kikkawa, H. Ulinnuha, Lidiyawati Lidiyawati","doi":"10.1145/3546157.3546176","DOIUrl":"https://doi.org/10.1145/3546157.3546176","url":null,"abstract":"As we've seen in the history of IoT devices, connecting traditionally unconnected objects like the refrigerator at Carnegie Mellon has been possible since the early 1980s, but the costs are significant. This requires the processing power of the DEC PDP11 mainframe computer. Moore's Law demonstrates an increase in the number and density of transistors in silicon chipsets, while Dennard scaling improves the computer's power profile. Given these two trends, we are now producing devices that use more powerful CPUs and increased memory capacity and run operating systems capable of running the full network stack. Only with these requirements met, IoT has become an industry into itself.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121315728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Engaging Undergraduate Students in an Introductory A.I. Course through a Knowledge-Based Chatbot Workshop 通过基于知识的聊天机器人研讨会吸引本科生参加人工智能入门课程
T. Menkhoff, Ying Qian Lydia Teo
In this paper we share interim results of an ongoing mixed method evaluative study of 43 students enrolled in an elective course “Doing Business with A.I.” at the Lee Kong Chian School of Business (LKCSB), Singapore Management University. A key component of the course design is an experiential chatbot workshop that provides non-STEM students with an opportunity to acquire basic skills to build a chatbot prototype using the ‘Dialogflow’ program. The workshop and the experiential learning activity were designed to impart students with relevant knowledge and skills such as conversation and user-centric design know how and know why that are transferrable to other situational contexts beyond the course. Based on ongoing class surveys and qualitative interviews with students, we are trying to corroborate a conceptual model developed from learning theories and models related to technology mediated learning (TML) aimed at measuring the effects of a hands-on knowledge-based chatbot workshop designed by the authors on students’ engagement and motivation as drivers of acquiring AI-related competencies such as natural language processing skills (NLP). One important didactical aspect during the design and roll-out of the chatbot workshop is that novice learners with no or very little knowledge about A.I. recognize and create the important linkage between knowledge inputs and outputs of NLP-powered conversational agents (chatbots) so that user queries are effectively addressed. The knowledge-based chatbot workshop design as described in the paper provides useful practical information for instructors interested in designing educational chatbot prototypes for effective digital teaching and learning in a business school (higher education) context that can be transferred to other organizational units beyond the university (e.g. quality customer service) in order to make learners future-ready.
在本文中,我们分享了对新加坡管理大学李光前商学院(LKCSB)选修课程“与人工智能做生意”的43名学生进行的混合方法评估研究的中期结果。课程设计的一个关键组成部分是一个体验式聊天机器人研讨会,为非stem学生提供一个获得使用“Dialogflow”程序构建聊天机器人原型的基本技能的机会。工作坊和体验式学习活动旨在向学生传授相关知识和技能,如对话和以用户为中心的设计,这些知识和技能可以转移到课程以外的其他情境中。基于正在进行的课堂调查和对学生的定性访谈,我们试图证实一个概念模型,该模型是从与技术中介学习(TML)相关的学习理论和模型中发展出来的,旨在衡量作者设计的基于知识的实际聊天机器人研讨会对学生的参与和动机的影响,这些学生是获得人工智能相关能力(如自然语言处理技能(NLP))的驱动因素。在聊天机器人研讨会的设计和推出过程中,一个重要的教学方面是,对人工智能没有或很少了解的新手学习者能够识别并创建基于nlp的会话代理(聊天机器人)的知识输入和输出之间的重要联系,从而有效地解决用户查询。本文中描述的基于知识的聊天机器人研讨会设计为有兴趣设计教育聊天机器人原型的教师提供了有用的实用信息,这些原型可以在商学院(高等教育)环境中进行有效的数字化教学和学习,可以转移到大学以外的其他组织单位(例如优质客户服务),以使学习者为未来做好准备。
{"title":"Engaging Undergraduate Students in an Introductory A.I. Course through a Knowledge-Based Chatbot Workshop","authors":"T. Menkhoff, Ying Qian Lydia Teo","doi":"10.1145/3546157.3546175","DOIUrl":"https://doi.org/10.1145/3546157.3546175","url":null,"abstract":"In this paper we share interim results of an ongoing mixed method evaluative study of 43 students enrolled in an elective course “Doing Business with A.I.” at the Lee Kong Chian School of Business (LKCSB), Singapore Management University. A key component of the course design is an experiential chatbot workshop that provides non-STEM students with an opportunity to acquire basic skills to build a chatbot prototype using the ‘Dialogflow’ program. The workshop and the experiential learning activity were designed to impart students with relevant knowledge and skills such as conversation and user-centric design know how and know why that are transferrable to other situational contexts beyond the course. Based on ongoing class surveys and qualitative interviews with students, we are trying to corroborate a conceptual model developed from learning theories and models related to technology mediated learning (TML) aimed at measuring the effects of a hands-on knowledge-based chatbot workshop designed by the authors on students’ engagement and motivation as drivers of acquiring AI-related competencies such as natural language processing skills (NLP). One important didactical aspect during the design and roll-out of the chatbot workshop is that novice learners with no or very little knowledge about A.I. recognize and create the important linkage between knowledge inputs and outputs of NLP-powered conversational agents (chatbots) so that user queries are effectively addressed. The knowledge-based chatbot workshop design as described in the paper provides useful practical information for instructors interested in designing educational chatbot prototypes for effective digital teaching and learning in a business school (higher education) context that can be transferred to other organizational units beyond the university (e.g. quality customer service) in order to make learners future-ready.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"108 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116439659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recipe Recommendation Method by Similarity Measure with Food Image Recognition 基于食物图像识别的相似度度量的食谱推荐方法
Shota Tamaru, Hyuga Taki, Rune Usuki, T. Nakanishi
This paper presents a recipe recommendation method by similarity measure with food image recognition. In general, it is difficult for users with little cooking experience to find out what kind of dishes they can make from the ingredients they currently have. Therefore, we propose a system that recommends recipes based on the ingredients in the user's current inventory, thereby increasing the number of dishes in the user's cooking repertoire. This system uses camera images of foodstuffs as input, recognizes the foodstuffs, and searches for recipes. In the experiment, we conducted a questionnaire survey of the recognized food ingredients and a questionnaire survey of recipe suggestions, and the results showed that more than 3/4 of the respondents answered that the recognition results and recipe contents were correct for some of the images. In this way, possible for users to search for recipes with fewer steps.
提出了一种基于食品图像识别的相似度度量的食谱推荐方法。一般来说,对于没有烹饪经验的用户来说,很难知道用现有的食材可以做出什么样的菜。因此,我们提出了一个系统,根据用户当前库存中的食材推荐食谱,从而增加用户烹饪曲目中的菜肴数量。该系统使用食物的相机图像作为输入,识别食物,并搜索食谱。在实验中,我们对识别的食品成分进行了问卷调查,并对食谱建议进行了问卷调查,结果显示,超过3/4的受访者回答识别结果和食谱内容对部分图像是正确的。这样,用户可以用更少的步骤搜索食谱。
{"title":"Recipe Recommendation Method by Similarity Measure with Food Image Recognition","authors":"Shota Tamaru, Hyuga Taki, Rune Usuki, T. Nakanishi","doi":"10.1145/3546157.3546170","DOIUrl":"https://doi.org/10.1145/3546157.3546170","url":null,"abstract":"This paper presents a recipe recommendation method by similarity measure with food image recognition. In general, it is difficult for users with little cooking experience to find out what kind of dishes they can make from the ingredients they currently have. Therefore, we propose a system that recommends recipes based on the ingredients in the user's current inventory, thereby increasing the number of dishes in the user's cooking repertoire. This system uses camera images of foodstuffs as input, recognizes the foodstuffs, and searches for recipes. In the experiment, we conducted a questionnaire survey of the recognized food ingredients and a questionnaire survey of recipe suggestions, and the results showed that more than 3/4 of the respondents answered that the recognition results and recipe contents were correct for some of the images. In this way, possible for users to search for recipes with fewer steps.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122189039","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Framework for Exploring Computational Models of Novelty in Unstructured Text 探索非结构化文本新颖性计算模型的框架
M. Mohseni, M. Maher
Novelty modeling in unstructured text data is a research topic within the Natural Language Processing (NLP) Community. Effective novelty models can play a key role in providing relevant and interesting content to the users which is the central goal in many applications including education and recommender systems. This paper presents a framework for comparing different approaches and applications of computational models of novelty in unstructured text data. We focus on computational models that apply methods such as natural language processing and information theory. The framework provides an ontology for computational novelty with respect to the source of text data, methods for representing the data, and models for measuring novelty. We explore the value of the framework by applying it to research on computational novelty in news articles, research publications, books, and recipes. This framework is independent of the type of data in the items and can be used as a tool for researchers to study, compare, and extend existing computational novelty models and applications.
非结构化文本数据的新颖性建模是自然语言处理(NLP)领域的一个研究课题。有效的新颖性模型可以在向用户提供相关和有趣的内容方面发挥关键作用,这是包括教育和推荐系统在内的许多应用程序的中心目标。本文提出了一个框架,用于比较非结构化文本数据中新颖性计算模型的不同方法和应用。我们专注于应用自然语言处理和信息理论等方法的计算模型。该框架提供了关于文本数据源的计算新颖性的本体、表示数据的方法和测量新颖性的模型。我们通过将其应用于新闻文章、研究出版物、书籍和食谱中计算新颖性的研究来探索该框架的价值。该框架独立于项目中的数据类型,可以作为研究人员研究、比较和扩展现有计算新颖性模型和应用程序的工具。
{"title":"A Framework for Exploring Computational Models of Novelty in Unstructured Text","authors":"M. Mohseni, M. Maher","doi":"10.1145/3546157.3546164","DOIUrl":"https://doi.org/10.1145/3546157.3546164","url":null,"abstract":"Novelty modeling in unstructured text data is a research topic within the Natural Language Processing (NLP) Community. Effective novelty models can play a key role in providing relevant and interesting content to the users which is the central goal in many applications including education and recommender systems. This paper presents a framework for comparing different approaches and applications of computational models of novelty in unstructured text data. We focus on computational models that apply methods such as natural language processing and information theory. The framework provides an ontology for computational novelty with respect to the source of text data, methods for representing the data, and models for measuring novelty. We explore the value of the framework by applying it to research on computational novelty in news articles, research publications, books, and recipes. This framework is independent of the type of data in the items and can be used as a tool for researchers to study, compare, and extend existing computational novelty models and applications.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128820694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Visualized Knowledge Map of Enterprise Social Capital in the Context of Incubation Networks 孵化网络背景下企业社会资本的可视化知识图谱
Ke Ding, Hongxia Li
Purpose – This study aims to provide a systematic knowledge map for researchers who work in the field of enterprise social capital. Additionally, the aim is to help them quickly understand the trending research topics, hot spots and evolutionary trends from the context of incubation networks. Design/methodology/approach – The authors searched for 874 articles on corporate social capital from the Web of Science database source journals from 1986 to 2021. Then, CITESPACE and VOSviewer software are used to extract the fields of the literature and perform data visualization. Findings – The results show that the number of corporate social capital research papers under the incubation network increases year by year. At present, the research hotspots include social capital, corporate social responsibility, corporate performance, innovation, corporate management and governance, network and strategy. Research will be paid more attention to corporate social responsibility, social capital promotion of new enterprises under incubator network, and social capital accumulation of family enterprises. Incubation network provides an integrated platform for new enterprises to obtain external social capital, and strengthens the social capital of enterprises through cooperative relations. Originality/value – This paper adopts bibliometrics method to mine data and draws knowledge map to systematically reveal research progress and trend.
目的:本研究旨在为企业社会资本领域的研究者提供一个系统的知识图谱。此外,旨在帮助他们从孵化网络的背景下快速了解趋势研究主题,热点和进化趋势。设计/方法/方法-作者从1986年至2021年的Web of Science数据库源期刊中检索了874篇关于企业社会资本的文章。然后利用CITESPACE和VOSviewer软件提取文献字段并进行数据可视化。研究结果表明:孵化网络下的企业社会资本研究论文数量逐年增加。目前的研究热点包括社会资本、企业社会责任、企业绩效、创新、企业管理与治理、网络与战略。企业社会责任、孵化器网络下新企业的社会资本提升、家族企业的社会资本积累等方面的研究将更加关注。孵化网络为新企业获取外部社会资本提供了一个整合平台,通过合作关系强化企业的社会资本。原创性/价值——本文采用文献计量学方法挖掘数据,绘制知识图谱,系统揭示研究进展和趋势。
{"title":"A Visualized Knowledge Map of Enterprise Social Capital in the Context of Incubation Networks","authors":"Ke Ding, Hongxia Li","doi":"10.1145/3546157.3546174","DOIUrl":"https://doi.org/10.1145/3546157.3546174","url":null,"abstract":"Purpose – This study aims to provide a systematic knowledge map for researchers who work in the field of enterprise social capital. Additionally, the aim is to help them quickly understand the trending research topics, hot spots and evolutionary trends from the context of incubation networks. Design/methodology/approach – The authors searched for 874 articles on corporate social capital from the Web of Science database source journals from 1986 to 2021. Then, CITESPACE and VOSviewer software are used to extract the fields of the literature and perform data visualization. Findings – The results show that the number of corporate social capital research papers under the incubation network increases year by year. At present, the research hotspots include social capital, corporate social responsibility, corporate performance, innovation, corporate management and governance, network and strategy. Research will be paid more attention to corporate social responsibility, social capital promotion of new enterprises under incubator network, and social capital accumulation of family enterprises. Incubation network provides an integrated platform for new enterprises to obtain external social capital, and strengthens the social capital of enterprises through cooperative relations. Originality/value – This paper adopts bibliometrics method to mine data and draws knowledge map to systematically reveal research progress and trend.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125379664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Customer Lifetime Value Analysis Based on Machine Learning 基于机器学习的客户终身价值分析
Xinqian Dai
Customer lifetime value (CLV) is a powerful tool to determine the value of customers and filter customers most likely to attrite or most likely to make their first purchase, especially for e-commerce companies. This article reviewed machine learning models in analyzing CLV and prospected some potential directions for future research. Data of 8099 samples were collected and analyzed through four kinds of machine learning methods: Linear Regression, Support Vector Machine, Random Forest, Neural Network. The correlations between features showed that CLV are generally affected by monthly premium auto, total claim amount, and coverage. Analysis through machine learning models has high precision and Random Forest performs best. CLV prediction and customer segmentation are vital in business field today. Marketers could take advantage of the huge amount of data and machine learning models to portrait customer behaviors. Collecting browsing and purchase histories is also beneficial for providing best offers to individual customers.
客户终身价值(CLV)是一个强大的工具,可以确定客户的价值,并过滤最有可能流失或最有可能进行首次购买的客户,尤其是对电子商务公司而言。本文综述了机器学习模型在CLV分析中的应用,并对未来的研究方向进行了展望。通过线性回归、支持向量机、随机森林、神经网络四种机器学习方法,收集8099个样本的数据并进行分析。特征间的相关性表明,CLV一般受月保费、总理赔金额和保额的影响。通过机器学习模型进行分析精度高,随机森林表现最好。CLV预测和客户细分在当今的商业领域至关重要。营销人员可以利用大量数据和机器学习模型来描绘客户行为。收集浏览和购买历史记录也有利于为个人客户提供最佳优惠。
{"title":"Customer Lifetime Value Analysis Based on Machine Learning","authors":"Xinqian Dai","doi":"10.1145/3546157.3546160","DOIUrl":"https://doi.org/10.1145/3546157.3546160","url":null,"abstract":"Customer lifetime value (CLV) is a powerful tool to determine the value of customers and filter customers most likely to attrite or most likely to make their first purchase, especially for e-commerce companies. This article reviewed machine learning models in analyzing CLV and prospected some potential directions for future research. Data of 8099 samples were collected and analyzed through four kinds of machine learning methods: Linear Regression, Support Vector Machine, Random Forest, Neural Network. The correlations between features showed that CLV are generally affected by monthly premium auto, total claim amount, and coverage. Analysis through machine learning models has high precision and Random Forest performs best. CLV prediction and customer segmentation are vital in business field today. Marketers could take advantage of the huge amount of data and machine learning models to portrait customer behaviors. Collecting browsing and purchase histories is also beneficial for providing best offers to individual customers.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124199332","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Traffic Sign Recognition Based on Deep Learning Technique 基于深度学习技术的交通标志识别
Yihan Lai
Traffic sign recognition plays a significant role in intelligent transportation system. Therefore, in this paper, I propose a traffic sign recognition algorithm based on Convolutional Neural Network (CNN). The dataset collected to train and test in experiments is the “German Traffic Sign Recognition Benchmark” (GTSRB). In addition, the CNN model is evaluated by comparing with a Deep Neural Network (DNN) model based on the accuracy rate and loss rate. Finally, the result shows the proposed CNN model yields high accuracy rate on both training and test images.
交通标志识别在智能交通系统中起着重要的作用。因此,在本文中,我提出了一种基于卷积神经网络(CNN)的交通标志识别算法。收集的数据集用于训练和实验测试是“德国交通标志识别基准”(GTSRB)。此外,通过与深度神经网络(Deep Neural Network, DNN)模型的准确率和损失率对比,对CNN模型进行了评价。最后,实验结果表明,本文提出的CNN模型在训练图像和测试图像上都有较高的准确率。
{"title":"Traffic Sign Recognition Based on Deep Learning Technique","authors":"Yihan Lai","doi":"10.1145/3546157.3546167","DOIUrl":"https://doi.org/10.1145/3546157.3546167","url":null,"abstract":"Traffic sign recognition plays a significant role in intelligent transportation system. Therefore, in this paper, I propose a traffic sign recognition algorithm based on Convolutional Neural Network (CNN). The dataset collected to train and test in experiments is the “German Traffic Sign Recognition Benchmark” (GTSRB). In addition, the CNN model is evaluated by comparing with a Deep Neural Network (DNN) model based on the accuracy rate and loss rate. Finally, the result shows the proposed CNN model yields high accuracy rate on both training and test images.","PeriodicalId":422215,"journal":{"name":"Proceedings of the 6th International Conference on Information System and Data Mining","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126144675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 6th International Conference on Information System and Data Mining
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1