首页 > 最新文献

International journal of artificial intelligence & applications最新文献

英文 中文
Subverting Characters Stereotypes: Exploring the Role of AI in Stereotype Subversion 颠覆人物刻板印象:探索AI在颠覆刻板印象中的作用
Pub Date : 2023-09-28 DOI: 10.5121/ijaia.2023.14502
Xiaohan Feng, Makoto Murakami
The Aim of this paper is to explore different ways of using AI to subvert stereotypes more efficiently and effectively. It will also enumerate the advantages and disadvantages of each approach, helping creators select the most appropriate method for their specific situations. AI opens up new possibilities, enabling anyone to effortlessly generate visually stunning images without the need for artistic skills. However, it also leads to the creation of more stereotypes when using large amounts of data. Consequently, stereotypes are becoming more prevalent and serious than ever before. Our belief is that we can use this situation in reverse, aiming to summarize stereotypes with AI and then subvert them through elemental exchange. In this study, we have attempted to develop a less time-consuming method to challenge character stereotypes while embracing the concept of "exchange." We selected two character archetypes, namely the "tyrant" and the "mad scientist," and summarized their stereotypes by generating AI images or asking ChatGPT questions. Additionally, we conducted a survey of real historical tyrants to gain insights into their behavior and characteristics. This step helped us comprehend the reasons behind stereotyping in artwork depicting tyrants. Based on this understanding, we made choices about which stereotypes to retain. The intention was to empower the audience to better evaluate the identity of the character. Finally, the two remaining character stereotypes were exchanged, and the design was completed. This paper documents the last and most time-consuming method. By examining a large number of sources and examining what stereotypical influences were used, we were able to achieve a greater effect of subverting stereotypes. The other method is much less time-consuming but somewhat more random. Whether one chooses by subjective experience or by the most frequent choices, there is no guarantee of the best outcome. In other words, it is the one that best guarantees that the audience will be able to quickly identify the original character and at the same time move the two characters the furthest away from the original stereotypical image of the original. In conclusion, if the designer has sufficient time, ai portrait + research or chatGPT + research can be chosen. If there is not enough time, the remaining methods can be chosen. The remaining methods take less time and the designer can try them all to get the desired result.
本文的目的是探索使用人工智能更有效地颠覆刻板印象的不同方法。它还将列举每种方法的优点和缺点,帮助创建者根据他们的具体情况选择最合适的方法。人工智能开辟了新的可能性,使任何人都可以毫不费力地生成视觉上令人惊叹的图像,而无需艺术技能。然而,当使用大量数据时,它也会导致创建更多的构造型。因此,陈规定型观念比以往任何时候都更加普遍和严重。我们的信念是,我们可以反过来利用这种情况,旨在用人工智能总结刻板印象,然后通过元素交换颠覆它们。在这项研究中,我们试图开发一种更省时的方法,在接受“交换”概念的同时挑战角色刻板印象。我们选择了两个角色原型,即“暴君”和“疯狂科学家”,并通过生成AI图像或向ChatGPT提问来总结他们的刻板印象。此外,我们对历史上真实的暴君进行了调查,以深入了解他们的行为和特征。这一步帮助我们理解了在描绘暴君的艺术作品中刻板印象背后的原因。基于这种理解,我们选择保留哪些刻板印象。这样做的目的是为了让观众更好地评价角色的身份。最后将剩下的两种人物定型进行交换,完成设计。本文记录了最后一种也是最耗时的方法。通过检查大量的资源和检查使用了哪些刻板印象的影响,我们能够实现颠覆刻板印象的更大效果。另一种方法耗时少得多,但在某种程度上更具随机性。无论一个人是根据主观经验还是根据最频繁的选择来选择,都不能保证最好的结果。换句话说,它是最能保证观众能够快速识别出原始角色,同时使两个角色离原始的刻板印象最远的角色。综上所述,如果设计师有足够的时间,可以选择ai portrait + research或者chatGPT + research。如果时间不够,可以选择其余的方法。剩下的方法花费的时间更少,设计师可以尝试所有方法来获得想要的结果。
{"title":"Subverting Characters Stereotypes: Exploring the Role of AI in Stereotype Subversion","authors":"Xiaohan Feng, Makoto Murakami","doi":"10.5121/ijaia.2023.14502","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14502","url":null,"abstract":"The Aim of this paper is to explore different ways of using AI to subvert stereotypes more efficiently and effectively. It will also enumerate the advantages and disadvantages of each approach, helping creators select the most appropriate method for their specific situations. AI opens up new possibilities, enabling anyone to effortlessly generate visually stunning images without the need for artistic skills. However, it also leads to the creation of more stereotypes when using large amounts of data. Consequently, stereotypes are becoming more prevalent and serious than ever before. Our belief is that we can use this situation in reverse, aiming to summarize stereotypes with AI and then subvert them through elemental exchange. In this study, we have attempted to develop a less time-consuming method to challenge character stereotypes while embracing the concept of \"exchange.\" We selected two character archetypes, namely the \"tyrant\" and the \"mad scientist,\" and summarized their stereotypes by generating AI images or asking ChatGPT questions. Additionally, we conducted a survey of real historical tyrants to gain insights into their behavior and characteristics. This step helped us comprehend the reasons behind stereotyping in artwork depicting tyrants. Based on this understanding, we made choices about which stereotypes to retain. The intention was to empower the audience to better evaluate the identity of the character. Finally, the two remaining character stereotypes were exchanged, and the design was completed. This paper documents the last and most time-consuming method. By examining a large number of sources and examining what stereotypical influences were used, we were able to achieve a greater effect of subverting stereotypes. The other method is much less time-consuming but somewhat more random. Whether one chooses by subjective experience or by the most frequent choices, there is no guarantee of the best outcome. In other words, it is the one that best guarantees that the audience will be able to quickly identify the original character and at the same time move the two characters the furthest away from the original stereotypical image of the original. In conclusion, if the designer has sufficient time, ai portrait + research or chatGPT + research can be chosen. If there is not enough time, the remaining methods can be chosen. The remaining methods take less time and the designer can try them all to get the desired result.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135470169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance Evaluation of Block-Sized Algorithms for Majority Vote in Facial Recognition 人脸识别中多数投票的块大小算法性能评价
Pub Date : 2023-09-28 DOI: 10.5121/ijaia.2023.14501
Andrea Ruiz-Hernandez, Jennifer Lee, Nawal Rehman, Jayanthi Raghavan, Majid Ahmadi
Facial recognition (FR) is a pattern recognition problem, in which images can be considered as a matrix of pixels.There are manychallenges that affect the performance of face recognitionincluding illumination variation, occlusion, and blurring. In this paper,a few preprocessing techniques are suggested to handle the illumination variationsproblem. Also, other phases of face recognition problems like feature extraction and classification are discussed. Preprocessing techniques like Histogram Equalization (HE), Gamma Intensity Correction (GIC), and Regional Histogram Equalization (RHE) are tested inthe AT&T database. For feature extraction, methods such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Independent Component Analysis (ICA), and Local Binary Pattern (LBP) are applied. Support Vector Machine (SVM) is used as the classifier. Both holistic and block-based methods are tested using the AT&T database. For twelve different combinations of preprocessing, feature extraction, and classification methods, experiments involving various block sizes are conducted to assess the computation performance and recognition accuracy for the AT&T dataset.Using the block-based method, 100% accuracy is achieved with the combination of GIC preprocessing, LDA feature extraction,and SVM classification using 2x2 block-sizingwhile the holistic method yields the maximum accuracy of 93.5%. The block-sized algorithm performs better than the holistic approach under poor lighting conditions.SVM Radial Basis Function performs extremely well on theAT&Tdataset for both holistic and block-based approaches.
人脸识别(FR)是一个模式识别问题,其中图像可以被认为是一个像素矩阵。影响人脸识别性能的挑战有很多,包括光照变化、遮挡和模糊。本文提出了几种处理光照变化问题的预处理技术。此外,还讨论了人脸识别的其他阶段问题,如特征提取和分类。预处理技术,如直方图均衡化(HE),伽马强度校正(GIC)和区域直方图均衡化(RHE)在AT&T数据库中进行了测试。在特征提取方面,采用了主成分分析(PCA)、线性判别分析(LDA)、独立成分分析(ICA)和局部二值模式(LBP)等方法。使用支持向量机(SVM)作为分类器。使用AT&T数据库对整体方法和基于块的方法进行了测试。针对预处理、特征提取和分类方法的12种不同组合,进行了涉及不同块大小的实验,以评估AT&T数据集的计算性能和识别精度。采用基于块的方法,采用GIC预处理、LDA特征提取和2x2块大小的SVM分类相结合的方法,准确率达到100%,而整体方法的准确率最高为93.5%。在较差的光照条件下,块大小的算法比整体方法性能更好。支持向量机径向基函数在整体和基于块的方法上都表现得非常好。
{"title":"Performance Evaluation of Block-Sized Algorithms for Majority Vote in Facial Recognition","authors":"Andrea Ruiz-Hernandez, Jennifer Lee, Nawal Rehman, Jayanthi Raghavan, Majid Ahmadi","doi":"10.5121/ijaia.2023.14501","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14501","url":null,"abstract":"Facial recognition (FR) is a pattern recognition problem, in which images can be considered as a matrix of pixels.There are manychallenges that affect the performance of face recognitionincluding illumination variation, occlusion, and blurring. In this paper,a few preprocessing techniques are suggested to handle the illumination variationsproblem. Also, other phases of face recognition problems like feature extraction and classification are discussed. Preprocessing techniques like Histogram Equalization (HE), Gamma Intensity Correction (GIC), and Regional Histogram Equalization (RHE) are tested inthe AT&T database. For feature extraction, methods such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), Independent Component Analysis (ICA), and Local Binary Pattern (LBP) are applied. Support Vector Machine (SVM) is used as the classifier. Both holistic and block-based methods are tested using the AT&T database. For twelve different combinations of preprocessing, feature extraction, and classification methods, experiments involving various block sizes are conducted to assess the computation performance and recognition accuracy for the AT&T dataset.Using the block-based method, 100% accuracy is achieved with the combination of GIC preprocessing, LDA feature extraction,and SVM classification using 2x2 block-sizingwhile the holistic method yields the maximum accuracy of 93.5%. The block-sized algorithm performs better than the holistic approach under poor lighting conditions.SVM Radial Basis Function performs extremely well on theAT&Tdataset for both holistic and block-based approaches.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135470325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characteristics of Networks Generated by Kernel Growing Neural Gas 核生长神经气体生成网络的特性
Pub Date : 2023-09-28 DOI: 10.5121/ijaia.2023.14503
Kazuhisa Fujita
This research aims to develop kernel GNG, a kernelized version of the growing neural gas (GNG) algorithm, and to investigate the features of the networks generated by the kernel GNG. The GNG is an unsupervised artificial neural network that can transform a dataset into an undirected graph, thereby extracting the features of the dataset as a graph. The GNG is widely used in vector quantization, clustering, and 3D graphics. Kernel methods are often used to map a dataset to feature space, with support vector machines being the most prominent application. This paper introduces the kernel GNG approach and explores the characteristics of the networks generated by kernel GNG. Five kernels, including Gaussian, Laplacian, Cauchy, inverse multiquadric, and log kernels, are used in this study. The results of this study show that the average degree and the average clustering coefficient decrease as the kernel parameter increases for Gaussian, Laplacian, Cauchy, and IMQ kernels. If we avoid more edges and a higher clustering coefficient (or more triangles), the kernel GNG with a larger value of the parameter will be more appropriate.
本研究旨在开发生长神经气体(GNG)算法的kernel版本GNG,并研究kernel GNG生成的网络的特征。GNG是一种无监督人工神经网络,它可以将数据集转换为无向图,从而将数据集的特征提取为图。GNG被广泛应用于矢量量化、聚类和三维图形。核方法通常用于将数据集映射到特征空间,其中支持向量机是最突出的应用。本文介绍了核GNG方法,探讨了核GNG生成的网络的特点。本文采用了高斯核、拉普拉斯核、柯西核、逆二次核和对数核等五种核函数。研究结果表明,高斯核、拉普拉斯核、柯西核和IMQ核的平均度和平均聚类系数随着核参数的增大而减小。如果我们避免更多的边和更高的聚类系数(或更多的三角形),则参数值较大的内核GNG将更合适。
{"title":"Characteristics of Networks Generated by Kernel Growing Neural Gas","authors":"Kazuhisa Fujita","doi":"10.5121/ijaia.2023.14503","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14503","url":null,"abstract":"This research aims to develop kernel GNG, a kernelized version of the growing neural gas (GNG) algorithm, and to investigate the features of the networks generated by the kernel GNG. The GNG is an unsupervised artificial neural network that can transform a dataset into an undirected graph, thereby extracting the features of the dataset as a graph. The GNG is widely used in vector quantization, clustering, and 3D graphics. Kernel methods are often used to map a dataset to feature space, with support vector machines being the most prominent application. This paper introduces the kernel GNG approach and explores the characteristics of the networks generated by kernel GNG. Five kernels, including Gaussian, Laplacian, Cauchy, inverse multiquadric, and log kernels, are used in this study. The results of this study show that the average degree and the average clustering coefficient decrease as the kernel parameter increases for Gaussian, Laplacian, Cauchy, and IMQ kernels. If we avoid more edges and a higher clustering coefficient (or more triangles), the kernel GNG with a larger value of the parameter will be more appropriate.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135470159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sentiment Analysis in Indian Elections: Unraveling Public Perception of the Karnataka Elections With Transformers 印度选举中的情绪分析:用变形金刚揭示卡纳塔克邦选举的公众认知
Pub Date : 2023-09-28 DOI: 10.5121/ijaia.2023.14504
Pranav Gunhal
This study explores the utility of sentiment classification in political decision-making through an analysis of Twitter sentiment surrounding the 2023 Karnataka elections. Utilizing transformer-based models for sentiment analysis in Indic languages, the research employs innovative data collection methodologies, including novel data augmentation techniques. The primary focus is on sentiment classification, discerning positive, negative, and neutral posts, particularly regarding the defeat of the Bharatiya JanataParty (BJP) or the victory of the Indian National Congress (INC). Leveraging high-performing transformer architectures like IndicBERT, coupled with precise hyper parameter tuning, the AI models used in this study exhibit exceptional predictive accuracy, notably predicting the INC's electoral success. These findings underscore the potential of state-of-the-art transformer-based models in capturing and understanding sentiment dynamics within Indian politics. Implications are far-reaching, providing invaluable insights for political stakeholders preparing for the 2024 Lok Sabha elections. This study stands as a testament to the potential of sentiment analysis as a pivotal tool in political decision-making, specifically in non-Western nations.
本研究通过分析围绕2023年卡纳塔克邦选举的Twitter情绪,探讨了情绪分类在政治决策中的效用。该研究利用基于转换器的模型进行印度语情感分析,采用创新的数据收集方法,包括新的数据增强技术。主要关注的是情绪分类,辨别积极、消极和中立的帖子,特别是关于印度人民党(BJP)的失败或印度国民大会党(INC)的胜利。利用IndicBERT等高性能变压器架构,加上精确的超参数调整,本研究中使用的人工智能模型表现出卓越的预测准确性,特别是预测INC的选举成功。这些发现强调了最先进的基于变压器的模型在捕捉和理解印度政治中的情绪动态方面的潜力。影响深远,为准备2024年人民院选举的政治利益攸关方提供了宝贵的见解。这项研究证明了情绪分析作为政治决策的关键工具的潜力,特别是在非西方国家。
{"title":"Sentiment Analysis in Indian Elections: Unraveling Public Perception of the Karnataka Elections With Transformers","authors":"Pranav Gunhal","doi":"10.5121/ijaia.2023.14504","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14504","url":null,"abstract":"This study explores the utility of sentiment classification in political decision-making through an analysis of Twitter sentiment surrounding the 2023 Karnataka elections. Utilizing transformer-based models for sentiment analysis in Indic languages, the research employs innovative data collection methodologies, including novel data augmentation techniques. The primary focus is on sentiment classification, discerning positive, negative, and neutral posts, particularly regarding the defeat of the Bharatiya JanataParty (BJP) or the victory of the Indian National Congress (INC). Leveraging high-performing transformer architectures like IndicBERT, coupled with precise hyper parameter tuning, the AI models used in this study exhibit exceptional predictive accuracy, notably predicting the INC's electoral success. These findings underscore the potential of state-of-the-art transformer-based models in capturing and understanding sentiment dynamics within Indian politics. Implications are far-reaching, providing invaluable insights for political stakeholders preparing for the 2024 Lok Sabha elections. This study stands as a testament to the potential of sentiment analysis as a pivotal tool in political decision-making, specifically in non-Western nations.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135470326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Text Classification Failures in Multilingual AI-Generated Content 识别多语言人工智能生成内容中的文本分类失败
Pub Date : 2023-09-28 DOI: 10.5121/ijaia.2023.14505
Raghav Subramaniam
With the rising popularity of generative AI tools, the nature of apparent classification failures by AI content detection softwares, especially between different languages, must be further observed. This paper aims to do this through testing OpenAI’s “AI Text Classifier” on a set of human and AI-generated texts inEnglish, German, Arabic, Hindi, Chinese, and Swahili. Given the unreliability of existing tools for detection of AIgenerated text, it is notable that specific types of classification failures often persist in slightly different ways when various languages are observed: misclassification of human-written content as “AI-generated” and vice versa may occur more frequently in specific language content than others. Our findings indicate that false negative labelings are more likely to occur in English, whereas false positives are more likely to occur in Hindi and Arabic. There was an observed tendency for other languages to not be confidently labeled at all.
随着生成式人工智能工具的日益普及,必须进一步观察人工智能内容检测软件明显分类失败的性质,特别是不同语言之间的分类失败。本文旨在通过测试OpenAI的“人工智能文本分类器”来实现这一目标,该分类器使用英语、德语、阿拉伯语、印地语、中文和斯瓦希里语对一组人类和人工智能生成的文本进行测试。鉴于现有检测人工生成文本的工具的不可靠性,值得注意的是,当观察到不同的语言时,特定类型的分类失败通常以略有不同的方式持续存在:在特定语言内容中,将人类编写的内容错误分类为“人工生成的”,反之亦然,可能比其他语言内容更频繁地发生。我们的研究结果表明,假阴性标签更有可能发生在英语中,而假阳性标签更有可能发生在印地语和阿拉伯语中。我们观察到的一种趋势是,其他语言根本没有被自信地贴上标签。
{"title":"Identifying Text Classification Failures in Multilingual AI-Generated Content","authors":"Raghav Subramaniam","doi":"10.5121/ijaia.2023.14505","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14505","url":null,"abstract":"With the rising popularity of generative AI tools, the nature of apparent classification failures by AI content detection softwares, especially between different languages, must be further observed. This paper aims to do this through testing OpenAI’s “AI Text Classifier” on a set of human and AI-generated texts inEnglish, German, Arabic, Hindi, Chinese, and Swahili. Given the unreliability of existing tools for detection of AIgenerated text, it is notable that specific types of classification failures often persist in slightly different ways when various languages are observed: misclassification of human-written content as “AI-generated” and vice versa may occur more frequently in specific language content than others. Our findings indicate that false negative labelings are more likely to occur in English, whereas false positives are more likely to occur in Hindi and Arabic. There was an observed tendency for other languages to not be confidently labeled at all.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"19 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135470168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PROCEDURAL GENERATION IN 2D METROIDVANIA GAME WITH ANSWER SET PROGRAMMING AND PERLIN NOISE 基于答案集编程和柏林噪声的二维银河恶魔城游戏程序生成
Pub Date : 2023-05-28 DOI: 10.5121/ijaia.2023.14302
John Xu, John Morris
Designing metroidvania games often poses a unique challenge to video game developers, namely the difficulty of consistently preventing soft-locking, which hinders or blocks the player’s ability to traverse through levels effectively [1]. As a result, many turn to hand-making all levels to ensure the level’s traversability, but in the process often forsaking the ability to rely on procedural generation to lessen the time and burden on human game developers [2]. On the other hand, when developers rely on popular ways of procedural generation such as using perlin noise, they find themselves unable to control those procedural algorithms to guarantee certain characteristics of the outputs such as traversability [3]. Our paper aims to present a procedural solution that can also effectively guarantee the traversability of the generated level. Our method uses Answer Set Programming (ASP) to verify generation based on restrictions we place, guaranteeing the outcome to be what we want [4]. The generation of a level is divided into rooms, which are first mapped out in a graph to ensure traversability from a starting room to an ending boss area. The rooms’ geometry is then generated accordingly to create the full level. Using perlin noise, we were also able to create a demonstration of how traversability works in another form of procedural generation, and compare it with our methodology to identify strengths and weaknesses. To demonstrate our method, we applied our solution as well as the perlin noise algorithm to a 2D metroidvania game made in the Unity game engine and conducted quantitative tests on the ASP method to assess how well our method works as a level generator [5].
设计《银河恶魔城》通常会给电子游戏开发者带来一个独特的挑战,即持续防止软锁定的难度,这会阻碍或阻碍玩家有效穿越关卡的能力[1]。因此,许多人转而手工制作所有关卡,以确保关卡的可穿越性,但在这个过程中,他们往往放弃了依赖程序生成的能力,以减少人类游戏开发者的时间和负担[2]。另一方面,当开发人员依赖流行的程序生成方法(如使用柏林噪声)时,他们发现自己无法控制这些程序算法来保证输出的某些特征,如可遍历性[3]。我们的论文旨在提出一个程序解决方案,也可以有效地保证生成的关卡的可遍历性。我们的方法使用答案集编程(ASP)根据我们设置的限制来验证生成,保证结果是我们想要的[4]。关卡的生成被划分为多个房间,这些房间首先被绘制成图表,以确保从起始房间到结束boss区域的可穿越性。然后,房间的几何形状相应地生成,以创建完整的关卡。使用柏林噪声,我们还能够创建可遍历性如何在另一种形式的程序生成中工作的演示,并将其与我们的方法进行比较,以确定优势和劣势。为了演示我们的方法,我们将我们的解决方案和柏林噪声算法应用于一款使用Unity游戏引擎制作的2D《恶魔城》游戏,并对ASP方法进行了定量测试,以评估我们的方法作为关卡生成器的效果[5]。
{"title":"PROCEDURAL GENERATION IN 2D METROIDVANIA GAME WITH ANSWER SET PROGRAMMING AND PERLIN NOISE","authors":"John Xu, John Morris","doi":"10.5121/ijaia.2023.14302","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14302","url":null,"abstract":"Designing metroidvania games often poses a unique challenge to video game developers, namely the difficulty of consistently preventing soft-locking, which hinders or blocks the player’s ability to traverse through levels effectively [1]. As a result, many turn to hand-making all levels to ensure the level’s traversability, but in the process often forsaking the ability to rely on procedural generation to lessen the time and burden on human game developers [2]. On the other hand, when developers rely on popular ways of procedural generation such as using perlin noise, they find themselves unable to control those procedural algorithms to guarantee certain characteristics of the outputs such as traversability [3]. Our paper aims to present a procedural solution that can also effectively guarantee the traversability of the generated level. Our method uses Answer Set Programming (ASP) to verify generation based on restrictions we place, guaranteeing the outcome to be what we want [4]. The generation of a level is divided into rooms, which are first mapped out in a graph to ensure traversability from a starting room to an ending boss area. The rooms’ geometry is then generated accordingly to create the full level. Using perlin noise, we were also able to create a demonstration of how traversability works in another form of procedural generation, and compare it with our methodology to identify strengths and weaknesses. To demonstrate our method, we applied our solution as well as the perlin noise algorithm to a 2D metroidvania game made in the Unity game engine and conducted quantitative tests on the ASP method to assess how well our method works as a level generator [5].","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135894857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparison of Document Similarity Algorithms 文档相似度算法的比较
Pub Date : 2023-03-30 DOI: 10.5121/ijaia.2023.14204
Nicholas Gahman, Vinayak Elangovan
Document similarity is an important part of Natural Language Processing and is most commonly used forplagiarism-detection and text summarization. Thus, finding the overall most effective document similarity algorithm could have a major positive impact on the field of Natural Language Processing. This report setsout to examine the numerous document similarity algorithms, and determine which ones are the mostuseful. It addresses the most effective document similarity algorithm by categorizing them into 3 types ofdocument similarity algorithms: statistical algorithms, neural networks, and corpus/knowledge-basedalgorithms. The most effective algorithms in each category are also compared in our work using a series of benchmark datasets and evaluations that test every possible area that each algorithm could be used in.
文档相似度是自然语言处理的重要组成部分,最常用于剽窃检测和文本摘要。因此,找到总体上最有效的文档相似度算法可能对自然语言处理领域产生重大的积极影响。本报告旨在检查众多文档相似度算法,并确定哪些是最有用的。它解决了最有效的文档相似度算法,将它们分为三种类型的文档相似度算法:统计算法、神经网络和基于语料库/知识的算法。在我们的工作中,还使用一系列基准数据集和评估来比较每个类别中最有效的算法,这些基准数据集和评估测试了每个算法可以使用的每个可能领域。
{"title":"A Comparison of Document Similarity Algorithms","authors":"Nicholas Gahman, Vinayak Elangovan","doi":"10.5121/ijaia.2023.14204","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14204","url":null,"abstract":"Document similarity is an important part of Natural Language Processing and is most commonly used forplagiarism-detection and text summarization. Thus, finding the overall most effective document similarity algorithm could have a major positive impact on the field of Natural Language Processing. This report setsout to examine the numerous document similarity algorithms, and determine which ones are the mostuseful. It addresses the most effective document similarity algorithm by categorizing them into 3 types ofdocument similarity algorithms: statistical algorithms, neural networks, and corpus/knowledge-basedalgorithms. The most effective algorithms in each category are also compared in our work using a series of benchmark datasets and evaluations that test every possible area that each algorithm could be used in.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135374875","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spot-the-Camel: Computer Vision for Safer Roads 发现骆驼:计算机视觉安全道路
Pub Date : 2023-03-30 DOI: 10.5121/ijaia.2023.14201
Khalid AlNujaidi, Ghadah AlHabib, Abdulaziz AlOdhieb
As the population grows and more land is being used for urbanization, ecosystems are disrupted by our roads and cars. This expansion of infrastructure cuts through wildlife territories, leading to many instances of Wildlife-Vehicle Collision (WVC). These instances of WVC are a global issue that is having a global socio-economic impact, resulting in billions of dollars in property damage and, at times, fatalitiesfor vehicle occupants. In Saudi Arabia, this issue is similar, with instances of Camel-Vehicle Collision (CVC) being particularly deadly due to the large size of camels, which results in a 25% fatality rate [1].The focus of this work is to test different object detection models on the task of detecting camels on theroad. The Deep Learning (DL) object detection models used in the experiments are: CenterNet, Efficient Det, Faster R-CNN, SSD, and YOLOv8. Results of the experiments show that YOLOv8 performed the best in terms of accuracy and was the most efficient in training. In the future, the plan is to expand on this work by developing a system to make countryside roads safer.
随着人口的增长和越来越多的土地被用于城市化,我们的道路和汽车破坏了生态系统。这种基础设施的扩张穿过了野生动物的领地,导致了许多野生动物与车辆碰撞(WVC)的事件。这些WVC事件是一个全球性问题,对全球社会经济产生影响,造成数十亿美元的财产损失,有时还造成车辆乘员死亡。在沙特阿拉伯,这一问题也类似,由于骆驼体型庞大,骆驼与车辆碰撞(CVC)的情况尤为严重,死亡率高达25%[1]。本工作的重点是测试不同的目标检测模型对道路上骆驼的检测任务。实验中使用的深度学习(DL)对象检测模型有:CenterNet、Efficient Det、Faster R-CNN、SSD和YOLOv8。实验结果表明,YOLOv8在准确率方面表现最好,在训练中效率最高。未来,该计划将通过开发一个系统来扩大这项工作,使农村道路更安全。
{"title":"Spot-the-Camel: Computer Vision for Safer Roads","authors":"Khalid AlNujaidi, Ghadah AlHabib, Abdulaziz AlOdhieb","doi":"10.5121/ijaia.2023.14201","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14201","url":null,"abstract":"As the population grows and more land is being used for urbanization, ecosystems are disrupted by our roads and cars. This expansion of infrastructure cuts through wildlife territories, leading to many instances of Wildlife-Vehicle Collision (WVC). These instances of WVC are a global issue that is having a global socio-economic impact, resulting in billions of dollars in property damage and, at times, fatalitiesfor vehicle occupants. In Saudi Arabia, this issue is similar, with instances of Camel-Vehicle Collision (CVC) being particularly deadly due to the large size of camels, which results in a 25% fatality rate [1].The focus of this work is to test different object detection models on the task of detecting camels on theroad. The Deep Learning (DL) object detection models used in the experiments are: CenterNet, Efficient Det, Faster R-CNN, SSD, and YOLOv8. Results of the experiments show that YOLOv8 performed the best in terms of accuracy and was the most efficient in training. In the future, the plan is to expand on this work by developing a system to make countryside roads safer.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"148 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135374876","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stock Broad-Index Trend Patterns Learning via Domain Knowledge Informed Generative Network 基于领域知识生成网络的股票指数趋势模式学习
Pub Date : 2023-03-30 DOI: 10.5121/ijaia.2023.14202
Jingyi Gu, Fadi P. Deek, Guiling Wang
Predicting the Stock movement attracts much attention from both industry and academia. Despite such significant efforts, the results remain unsatisfactory due to the inherently complicated nature of the stock market driven by factors including supply and demand, the state of the economy, the political climate, and even irrational human behavior. Recently, Generative Adversarial Networks (GAN) have been extended for time series data; however, robust methods are primarily for synthetic series generation, which fall short for appropriate stock prediction. This is because existing GANs for stock applications suffer from mode collapse and only consider one-step prediction, thus underutilizing the potential of GAN. Furthermore, merging news and market volatility are neglected in current GANs. To address these issues, we exploit expert domain knowledge in finance and, for the first time, attempt to formulate stock movement prediction into a Wasserstein GAN framework for multi-step prediction. We propose Index GAN, which includes deliberate designs for the inherent characteristics of the stock market, leverages news context learning to thoroughly investigate textual information and develop an attentive seq2seq learning network that captures the temporal dependency among stock prices, news, and market sentiment. We also utilize the critic to approximate the Wasserstein distance between actual and predicted sequences and develop a rolling strategy for deployment that mitigates noise from the financial market. Extensive experiments are conducted on real-world broad-based indices, demonstrating the superior performance of our architecture over other state-of-the-art baselines, also validating all its contributing components.
预测股市走势受到业界和学术界的广泛关注。尽管做出了如此巨大的努力,但由于股票市场的内在复杂性,包括供求关系、经济状况、政治气候,甚至非理性的人类行为,结果仍然令人不满意。最近,生成对抗网络(GAN)在时间序列数据上得到了扩展;然而,鲁棒方法主要用于合成序列的生成,无法进行适当的库存预测。这是因为现有用于库存应用的GAN存在模式崩溃并且只考虑一步预测,因此未充分利用GAN的潜力。此外,目前的gan忽略了合并新闻和市场波动。为了解决这些问题,我们利用金融领域的专家知识,并首次尝试将股票运动预测制定为多步骤预测的Wasserstein GAN框架。我们提出指数GAN,其中包括对股票市场固有特征的刻意设计,利用新闻上下文学习来彻底调查文本信息,并开发一个细心的seq2seq学习网络,以捕获股票价格,新闻和市场情绪之间的时间依赖性。我们还利用批评来近似实际序列和预测序列之间的Wasserstein距离,并开发了一种滚动部署策略,以减轻来自金融市场的噪音。在现实世界的基础指数上进行了广泛的实验,证明了我们的架构优于其他最先进的基线,也验证了它的所有贡献组件。
{"title":"Stock Broad-Index Trend Patterns Learning via Domain Knowledge Informed Generative Network","authors":"Jingyi Gu, Fadi P. Deek, Guiling Wang","doi":"10.5121/ijaia.2023.14202","DOIUrl":"https://doi.org/10.5121/ijaia.2023.14202","url":null,"abstract":"Predicting the Stock movement attracts much attention from both industry and academia. Despite such significant efforts, the results remain unsatisfactory due to the inherently complicated nature of the stock market driven by factors including supply and demand, the state of the economy, the political climate, and even irrational human behavior. Recently, Generative Adversarial Networks (GAN) have been extended for time series data; however, robust methods are primarily for synthetic series generation, which fall short for appropriate stock prediction. This is because existing GANs for stock applications suffer from mode collapse and only consider one-step prediction, thus underutilizing the potential of GAN. Furthermore, merging news and market volatility are neglected in current GANs. To address these issues, we exploit expert domain knowledge in finance and, for the first time, attempt to formulate stock movement prediction into a Wasserstein GAN framework for multi-step prediction. We propose Index GAN, which includes deliberate designs for the inherent characteristics of the stock market, leverages news context learning to thoroughly investigate textual information and develop an attentive seq2seq learning network that captures the temporal dependency among stock prices, news, and market sentiment. We also utilize the critic to approximate the Wasserstein distance between actual and predicted sequences and develop a rolling strategy for deployment that mitigates noise from the financial market. Extensive experiments are conducted on real-world broad-based indices, demonstrating the superior performance of our architecture over other state-of-the-art baselines, also validating all its contributing components.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":"170 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136002534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep-Learning-based Human Intention Prediction with Data Augmentation 基于深度学习的数据增强人类意向预测
Pub Date : 2022-01-31 DOI: 10.5121/ijaia.2022.13101
Shengchao Li, Lin Zhang, Xiumin Diao
Data augmentation has been broadly applied in training deep-learning models to increase the diversity of data. This study ingestigates the effectiveness of different data augmentation methods for deep-learningbased human intention prediction when only limited training data is available. A human participant pitches a ball to nine potential targets in our experiment. We expect to predict which target the participant pitches the ball to. Firstly, the effectiveness of 10 data augmentation groups is evaluated on a single-participant data set using RGB images. Secondly, the best data augmentation method (i.e., random cropping) on the single-participant data set is further evaluated on a multi-participant data set to assess its generalization ability. Finally, the effectiveness of random cropping on fusion data of RGB images and optical flow is evaluated on both single- and multi-participant data sets. Experiment results show that: 1) Data augmentation methods that crop or deform images can improve the prediction performance; 2) Random cropping can be generalized to the multi-participant data set (prediction accuracy is improved from 50% to 57.4%); and 3) Random cropping with fusion data of RGB images and optical flow can further improve the prediction accuracy from 57.4% to 63.9% on the multi-participant data set.
数据增强已广泛应用于深度学习模型的训练,以增加数据的多样性。本文研究了在训练数据有限的情况下,不同的数据增强方法对基于深度学习的人类意图预测的有效性。在我们的实验中,一个人类参与者向九个潜在目标投一个球。我们期望预测参与者将球投向哪个目标。首先,在使用RGB图像的单参与者数据集上评估了10个数据增强组的有效性。其次,在多参与者数据集上进一步评估单参与者数据集上的最佳数据增强方法(即随机裁剪),以评估其泛化能力。最后,在单参与者和多参与者数据集上评估了随机裁剪对RGB图像和光流融合数据的有效性。实验结果表明:1)对图像进行裁剪或变形的数据增强方法可以提高预测性能;2)随机裁剪可以推广到多参与者数据集(预测精度从50%提高到57.4%);3) RGB图像与光流融合数据的随机裁剪可以进一步将多参与者数据集的预测精度从57.4%提高到63.9%。
{"title":"Deep-Learning-based Human Intention Prediction with Data Augmentation","authors":"Shengchao Li, Lin Zhang, Xiumin Diao","doi":"10.5121/ijaia.2022.13101","DOIUrl":"https://doi.org/10.5121/ijaia.2022.13101","url":null,"abstract":"Data augmentation has been broadly applied in training deep-learning models to increase the diversity of data. This study ingestigates the effectiveness of different data augmentation methods for deep-learningbased human intention prediction when only limited training data is available. A human participant pitches a ball to nine potential targets in our experiment. We expect to predict which target the participant pitches the ball to. Firstly, the effectiveness of 10 data augmentation groups is evaluated on a single-participant data set using RGB images. Secondly, the best data augmentation method (i.e., random cropping) on the single-participant data set is further evaluated on a multi-participant data set to assess its generalization ability. Finally, the effectiveness of random cropping on fusion data of RGB images and optical flow is evaluated on both single- and multi-participant data sets. Experiment results show that: 1) Data augmentation methods that crop or deform images can improve the prediction performance; 2) Random cropping can be generalized to the multi-participant data set (prediction accuracy is improved from 50% to 57.4%); and 3) Random cropping with fusion data of RGB images and optical flow can further improve the prediction accuracy from 57.4% to 63.9% on the multi-participant data set.","PeriodicalId":93188,"journal":{"name":"International journal of artificial intelligence & applications","volume":" ","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48568232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International journal of artificial intelligence & applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1