首页 > 最新文献

Information (Switzerland)最新文献

英文 中文
Thematic Analysis of Big Data in Financial Institutions Using NLP Techniques with a Cloud Computing Perspective: A Systematic Literature Review 基于云计算视角的NLP技术金融机构大数据专题分析:系统文献综述
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-20 DOI: 10.3390/info14100577
Ratnesh Kumar Sharma, Gnana Bharathy, Faezeh Karimi, Anil V. Mishra, Mukesh Prasad
This literature review explores the existing work and practices in applying thematic analysis natural language processing techniques to financial data in cloud environments. This work aims to improve two of the five Vs of the big data system. We used the PRISMA approach (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) for the review. We analyzed the research papers published over the last 10 years about the topic in question using a keyword-based search and bibliometric analysis. The systematic literature review was conducted in multiple phases, and filters were applied to exclude papers based on the title and abstract initially, then based on the methodology/conclusion, and, finally, after reading the full text. The remaining papers were then considered and are discussed here. We found that automated data discovery methods can be augmented by applying an NLP-based thematic analysis on the financial data in cloud environments. This can help identify the correct classification/categorization and measure data quality for a sentiment analysis.
本文献综述探讨了将主题分析自然语言处理技术应用于云环境中的金融数据的现有工作和实践。这项工作旨在改善大数据系统的五个v中的两个。我们使用PRISMA方法(系统评价和荟萃分析的首选报告项目)进行评价。我们使用基于关键字的搜索和文献计量分析分析了过去10年发表的关于该主题的研究论文。系统文献综述分多个阶段进行,首先根据标题和摘要进行筛选,然后根据方法/结论进行筛选,最后在阅读全文后进行筛选。然后审议了其余的文件,并在此讨论。我们发现,通过对云环境中的金融数据应用基于nlp的主题分析,可以增强自动化数据发现方法。这可以帮助识别正确的分类/分类,并衡量情感分析的数据质量。
{"title":"Thematic Analysis of Big Data in Financial Institutions Using NLP Techniques with a Cloud Computing Perspective: A Systematic Literature Review","authors":"Ratnesh Kumar Sharma, Gnana Bharathy, Faezeh Karimi, Anil V. Mishra, Mukesh Prasad","doi":"10.3390/info14100577","DOIUrl":"https://doi.org/10.3390/info14100577","url":null,"abstract":"This literature review explores the existing work and practices in applying thematic analysis natural language processing techniques to financial data in cloud environments. This work aims to improve two of the five Vs of the big data system. We used the PRISMA approach (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) for the review. We analyzed the research papers published over the last 10 years about the topic in question using a keyword-based search and bibliometric analysis. The systematic literature review was conducted in multiple phases, and filters were applied to exclude papers based on the title and abstract initially, then based on the methodology/conclusion, and, finally, after reading the full text. The remaining papers were then considered and are discussed here. We found that automated data discovery methods can be augmented by applying an NLP-based thematic analysis on the financial data in cloud environments. This can help identify the correct classification/categorization and measure data quality for a sentiment analysis.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135617113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prototype Selection for Multilabel Instance-Based Learning 多标签基于实例学习的原型选择
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-19 DOI: 10.3390/info14100572
Panagiotis Filippakis, Stefanos Ougiaroglou, Georgios Evangelidis
Reducing the size of the training set, which involves replacing it with a condensed set, is a widely adopted practice to enhance the efficiency of instance-based classifiers while trying to maintain high classification accuracy. This objective can be achieved through the use of data reduction techniques, also known as prototype selection or generation algorithms. Although there are numerous algorithms available in the literature that effectively address single-label classification problems, most of them are not applicable to multilabel data, where an instance can belong to multiple classes. Well-known transformation methods cannot be combined with a data reduction technique due to different reasons. The Condensed Nearest Neighbor rule is a popular parameter-free single-label prototype selection algorithm. The IB2 algorithm is the one-pass variation of the Condensed Nearest Neighbor rule. This paper proposes variations of these algorithms for multilabel data. Through an experimental study conducted on nine distinct datasets as well as statistical tests, we demonstrate that the eight proposed approaches (four for each algorithm) offer significant reduction rates without compromising the classification accuracy.
为了提高基于实例的分类器的效率,同时保持较高的分类精度,一种被广泛采用的做法是减少训练集的大小,即用压缩集替换训练集。这个目标可以通过使用数据简化技术来实现,也称为原型选择或生成算法。虽然文献中有许多算法可以有效地解决单标签分类问题,但大多数算法不适用于多标签数据,因为一个实例可以属于多个类。由于不同的原因,众所周知的转换方法不能与数据约简技术相结合。压缩最近邻规则是一种流行的无参数单标签原型选择算法。IB2算法是精简最近邻规则的单遍变体。本文针对多标签数据提出了这些算法的变体。通过对9个不同的数据集进行的实验研究以及统计测试,我们证明了8种提出的方法(每种算法4种)在不影响分类准确性的情况下提供了显着的减少率。
{"title":"Prototype Selection for Multilabel Instance-Based Learning","authors":"Panagiotis Filippakis, Stefanos Ougiaroglou, Georgios Evangelidis","doi":"10.3390/info14100572","DOIUrl":"https://doi.org/10.3390/info14100572","url":null,"abstract":"Reducing the size of the training set, which involves replacing it with a condensed set, is a widely adopted practice to enhance the efficiency of instance-based classifiers while trying to maintain high classification accuracy. This objective can be achieved through the use of data reduction techniques, also known as prototype selection or generation algorithms. Although there are numerous algorithms available in the literature that effectively address single-label classification problems, most of them are not applicable to multilabel data, where an instance can belong to multiple classes. Well-known transformation methods cannot be combined with a data reduction technique due to different reasons. The Condensed Nearest Neighbor rule is a popular parameter-free single-label prototype selection algorithm. The IB2 algorithm is the one-pass variation of the Condensed Nearest Neighbor rule. This paper proposes variations of these algorithms for multilabel data. Through an experimental study conducted on nine distinct datasets as well as statistical tests, we demonstrate that the eight proposed approaches (four for each algorithm) offer significant reduction rates without compromising the classification accuracy.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135730228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Translation Performance from the User’s Perspective of Large Language Models and Neural Machine Translation Systems 基于用户视角的大语言模型和神经机器翻译系统的翻译性能
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-19 DOI: 10.3390/info14100574
Jungha Son, Boyoung Kim
The rapid global expansion of ChatGPT, which plays a crucial role in interactive knowledge sharing and translation, underscores the importance of comparative performance assessments in artificial intelligence (AI) technology. This study concentrated on this crucial issue by exploring and contrasting the translation performances of large language models (LLMs) and neural machine translation (NMT) systems. For this aim, the APIs of Google Translate, Microsoft Translator, and OpenAI’s ChatGPT were utilized, leveraging parallel corpora from the Workshop on Machine Translation (WMT) 2018 and 2020 benchmarks. By applying recognized evaluation metrics such as BLEU, chrF, and TER, a comprehensive performance analysis across a variety of language pairs, translation directions, and reference token sizes was conducted. The findings reveal that while Google Translate and Microsoft Translator generally surpass ChatGPT in terms of their BLEU, chrF, and TER scores, ChatGPT exhibits superior performance in specific language pairs. Translations from non-English to English consistently yielded better results across all three systems compared with translations from English to non-English. Significantly, an improvement in translation system performance was observed as the token size increased, hinting at the potential benefits of training models on larger token sizes.
ChatGPT在交互式知识共享和翻译中发挥着至关重要的作用,它在全球的迅速扩张凸显了人工智能(AI)技术中比较绩效评估的重要性。本研究通过探索和对比大型语言模型(llm)和神经机器翻译(NMT)系统的翻译性能来关注这一关键问题。为此,我们利用了谷歌翻译、微软翻译和OpenAI的ChatGPT的api,并利用了2018年和2020年机器翻译研讨会(WMT)基准的平行语料库。通过应用BLEU、chrF和TER等公认的评估指标,对各种语言对、翻译方向和参考标记大小进行了全面的性能分析。研究结果显示,虽然谷歌翻译和微软翻译在BLEU、chrF和TER得分方面普遍超过ChatGPT,但ChatGPT在特定语言对上表现优异。与从英语到非英语的翻译相比,从非英语到英语的翻译在所有三个系统中始终产生更好的结果。值得注意的是,随着令牌大小的增加,可以观察到翻译系统性能的改善,这暗示了在更大的令牌大小上训练模型的潜在好处。
{"title":"Translation Performance from the User’s Perspective of Large Language Models and Neural Machine Translation Systems","authors":"Jungha Son, Boyoung Kim","doi":"10.3390/info14100574","DOIUrl":"https://doi.org/10.3390/info14100574","url":null,"abstract":"The rapid global expansion of ChatGPT, which plays a crucial role in interactive knowledge sharing and translation, underscores the importance of comparative performance assessments in artificial intelligence (AI) technology. This study concentrated on this crucial issue by exploring and contrasting the translation performances of large language models (LLMs) and neural machine translation (NMT) systems. For this aim, the APIs of Google Translate, Microsoft Translator, and OpenAI’s ChatGPT were utilized, leveraging parallel corpora from the Workshop on Machine Translation (WMT) 2018 and 2020 benchmarks. By applying recognized evaluation metrics such as BLEU, chrF, and TER, a comprehensive performance analysis across a variety of language pairs, translation directions, and reference token sizes was conducted. The findings reveal that while Google Translate and Microsoft Translator generally surpass ChatGPT in terms of their BLEU, chrF, and TER scores, ChatGPT exhibits superior performance in specific language pairs. Translations from non-English to English consistently yielded better results across all three systems compared with translations from English to non-English. Significantly, an improvement in translation system performance was observed as the token size increased, hinting at the potential benefits of training models on larger token sizes.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135779169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Impact of Data Science Solutions on the Company Turnover 数据科学解决方案对公司营业额的影响
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-19 DOI: 10.3390/info14100573
Marian Pompiliu Cristescu, Dumitru Alexandru Mara, Lia Cornelia Culda, Raluca Andreea Nerișanu, Adela Bâra, Simona-Vasilica Oprea
This study explores the potential of data science software solutions like Customer Relationship Management Software (CRM) for increasing the revenue generation of businesses. We focused on those businesses in the accommodation and food service sector across the European Union (EU). The investigation is contextualized within the rising trend of data-driven decision-making, examining the potential correlation between data science applications and business revenues. By employing a comprehensive evaluation of Eurostat datasets from 2014 to 2021, we used both univariate and multivariate analyses, assessing the percentage of companies that have e-commerce sales across the EU countries, focusing on the usage of big data analytics from any source and the use of CRM tools for marketing purposes or other activities. Big data utilization showed a clear, positive relationship with enhanced e-commerce sales. However, CRM tools exhibited a dualistic impact: while their use in marketing showed no significant effect on sales, their application in non-marketing functions had negative effects on sales. These findings underscore the potential role of CRM and data science solutions in enhancing business performance in the EU’s accommodation and food service industry.
本研究探讨了客户关系管理软件(CRM)等数据科学软件解决方案在增加企业收入方面的潜力。我们关注的是整个欧盟(EU)的住宿和餐饮服务行业的企业。该调查是在数据驱动决策的上升趋势背景下进行的,研究了数据科学应用与业务收入之间的潜在相关性。通过对2014年至2021年欧盟统计局数据集的全面评估,我们使用了单变量和多变量分析,评估了欧盟国家中拥有电子商务销售的公司的百分比,重点关注来自任何来源的大数据分析的使用情况,以及用于营销目的或其他活动的CRM工具的使用情况。大数据的利用与电子商务销售的提升有着明显的正相关关系。然而,客户关系管理工具表现出双重影响:虽然它们在营销方面的使用对销售没有显著影响,但它们在非营销职能方面的应用对销售有负面影响。这些发现强调了CRM和数据科学解决方案在提高欧盟住宿和餐饮服务行业业务绩效方面的潜在作用。
{"title":"The Impact of Data Science Solutions on the Company Turnover","authors":"Marian Pompiliu Cristescu, Dumitru Alexandru Mara, Lia Cornelia Culda, Raluca Andreea Nerișanu, Adela Bâra, Simona-Vasilica Oprea","doi":"10.3390/info14100573","DOIUrl":"https://doi.org/10.3390/info14100573","url":null,"abstract":"This study explores the potential of data science software solutions like Customer Relationship Management Software (CRM) for increasing the revenue generation of businesses. We focused on those businesses in the accommodation and food service sector across the European Union (EU). The investigation is contextualized within the rising trend of data-driven decision-making, examining the potential correlation between data science applications and business revenues. By employing a comprehensive evaluation of Eurostat datasets from 2014 to 2021, we used both univariate and multivariate analyses, assessing the percentage of companies that have e-commerce sales across the EU countries, focusing on the usage of big data analytics from any source and the use of CRM tools for marketing purposes or other activities. Big data utilization showed a clear, positive relationship with enhanced e-commerce sales. However, CRM tools exhibited a dualistic impact: while their use in marketing showed no significant effect on sales, their application in non-marketing functions had negative effects on sales. These findings underscore the potential role of CRM and data science solutions in enhancing business performance in the EU’s accommodation and food service industry.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135778715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative Adversarial Networks (GANs) for Audio-Visual Speech Recognition in Artificial Intelligence IoT 人工智能物联网中用于视听语音识别的生成对抗网络(GANs
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-19 DOI: 10.3390/info14100575
Yibo He, Kah Phooi Seng, Li Minn Ang
This paper proposes a novel multimodal generative adversarial network AVSR (multimodal AVSR GAN) architecture, to improve both the energy efficiency and the AVSR classification accuracy of artificial intelligence Internet of things (IoT) applications. The audio-visual speech recognition (AVSR) modality is a classical multimodal modality, which is commonly used in IoT and embedded systems. Examples of suitable IoT applications include in-cabin speech recognition systems for driving systems, AVSR in augmented reality environments, and interactive applications such as virtual aquariums. The application of multimodal sensor data for IoT applications requires efficient information processing, to meet the hardware constraints of IoT devices. The proposed multimodal AVSR GAN architecture is composed of a discriminator and a generator, each of which is a two-stream network, corresponding to the audio stream information and the visual stream information, respectively. To validate this approach, we used augmented data from well-known datasets (LRS2-Lip Reading Sentences 2 and LRS3) in the training process, and testing was performed using the original data. The research and experimental results showed that the proposed multimodal AVSR GAN architecture improved the AVSR classification accuracy. Furthermore, in this study, we discuss the domain of GANs and provide a concise summary of the proposed GANs.
为了提高人工智能物联网(IoT)应用的能源效率和AVSR分类精度,提出了一种新的多模态生成对抗网络AVSR (multimodal AVSR GAN)架构。视听语音识别(AVSR)模态是一种经典的多模态模态,常用于物联网和嵌入式系统。合适的物联网应用示例包括用于驾驶系统的车内语音识别系统、增强现实环境中的AVSR以及虚拟水族馆等交互式应用。多模态传感器数据在物联网应用中的应用需要高效的信息处理,以满足物联网设备的硬件限制。提出的多模态AVSR GAN结构由鉴别器和生成器组成,每个鉴别器是一个两流网络,分别对应音频流信息和视觉流信息。为了验证这一方法,我们在训练过程中使用了来自知名数据集(lrs2 -唇读句子2和LRS3)的增强数据,并使用原始数据进行了测试。研究和实验结果表明,提出的多模态AVSR GAN结构提高了AVSR分类精度。此外,在本研究中,我们讨论了gan的领域,并对提出的gan进行了简要的总结。
{"title":"Generative Adversarial Networks (GANs) for Audio-Visual Speech Recognition in Artificial Intelligence IoT","authors":"Yibo He, Kah Phooi Seng, Li Minn Ang","doi":"10.3390/info14100575","DOIUrl":"https://doi.org/10.3390/info14100575","url":null,"abstract":"This paper proposes a novel multimodal generative adversarial network AVSR (multimodal AVSR GAN) architecture, to improve both the energy efficiency and the AVSR classification accuracy of artificial intelligence Internet of things (IoT) applications. The audio-visual speech recognition (AVSR) modality is a classical multimodal modality, which is commonly used in IoT and embedded systems. Examples of suitable IoT applications include in-cabin speech recognition systems for driving systems, AVSR in augmented reality environments, and interactive applications such as virtual aquariums. The application of multimodal sensor data for IoT applications requires efficient information processing, to meet the hardware constraints of IoT devices. The proposed multimodal AVSR GAN architecture is composed of a discriminator and a generator, each of which is a two-stream network, corresponding to the audio stream information and the visual stream information, respectively. To validate this approach, we used augmented data from well-known datasets (LRS2-Lip Reading Sentences 2 and LRS3) in the training process, and testing was performed using the original data. The research and experimental results showed that the proposed multimodal AVSR GAN architecture improved the AVSR classification accuracy. Furthermore, in this study, we discuss the domain of GANs and provide a concise summary of the proposed GANs.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135729614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Use of Kullback–Leibler Divergence for Kernel Selection and Interpretation in Variational Autoencoders for Feature Creation 在变分自编码器特征创建中利用Kullback-Leibler散度进行核选择和解释
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-18 DOI: 10.3390/info14100571
Fábio Mendonça, Sheikh Shanawaz Mostafa, Fernando Morgado-Dias, Antonio G. Ravelo-García
This study presents a novel approach for kernel selection based on Kullback–Leibler divergence in variational autoencoders using features generated by the convolutional encoder. The proposed methodology focuses on identifying the most relevant subset of latent variables to reduce the model’s parameters. Each latent variable is sampled from the distribution associated with a single kernel of the last encoder’s convolutional layer, resulting in an individual distribution for each kernel. Relevant features are selected from the sampled latent variables to perform kernel selection, which filters out uninformative features and, consequently, unnecessary kernels. Both the proposed filter method and the sequential feature selection (standard wrapper method) were examined for feature selection. Particularly, the filter method evaluates the Kullback–Leibler divergence between all kernels’ distributions and hypothesizes that similar kernels can be discarded as they do not convey relevant information. This hypothesis was confirmed through the experiments performed on four standard datasets, where it was observed that the number of kernels can be reduced without meaningfully affecting the performance. This analysis was based on the accuracy of the model when the selected kernels fed a probabilistic classifier and the feature-based similarity index to appraise the quality of the reconstructed images when the variational autoencoder only uses the selected kernels. Therefore, the proposed methodology guides the reduction of the number of parameters of the model, making it suitable for developing applications for resource-constrained devices.
本文提出了一种基于Kullback-Leibler散度的变分自编码器核选择方法,该方法利用卷积编码器产生的特征进行核选择。提出的方法侧重于识别最相关的潜在变量子集,以减少模型的参数。每个潜在变量从与最后一个编码器的卷积层的单个核相关的分布中采样,从而得到每个核的单独分布。从采样的潜在变量中选择相关特征进行核选择,从而过滤掉无信息的特征,从而过滤掉不必要的核。对所提出的滤波方法和顺序特征选择(标准包装方法)进行了特征选择试验。特别是,过滤器方法评估所有核分布之间的Kullback-Leibler散度,并假设相似的核可以被丢弃,因为它们不传递相关信息。通过在四个标准数据集上进行的实验证实了这一假设,其中观察到可以减少核数而不会对性能产生有意义的影响。该分析基于所选核输入概率分类器时模型的准确性,以及变分自编码器仅使用所选核时基于特征的相似度指标来评价重建图像的质量。因此,所提出的方法指导减少模型参数的数量,使其适合开发资源受限设备的应用程序。
{"title":"On the Use of Kullback–Leibler Divergence for Kernel Selection and Interpretation in Variational Autoencoders for Feature Creation","authors":"Fábio Mendonça, Sheikh Shanawaz Mostafa, Fernando Morgado-Dias, Antonio G. Ravelo-García","doi":"10.3390/info14100571","DOIUrl":"https://doi.org/10.3390/info14100571","url":null,"abstract":"This study presents a novel approach for kernel selection based on Kullback–Leibler divergence in variational autoencoders using features generated by the convolutional encoder. The proposed methodology focuses on identifying the most relevant subset of latent variables to reduce the model’s parameters. Each latent variable is sampled from the distribution associated with a single kernel of the last encoder’s convolutional layer, resulting in an individual distribution for each kernel. Relevant features are selected from the sampled latent variables to perform kernel selection, which filters out uninformative features and, consequently, unnecessary kernels. Both the proposed filter method and the sequential feature selection (standard wrapper method) were examined for feature selection. Particularly, the filter method evaluates the Kullback–Leibler divergence between all kernels’ distributions and hypothesizes that similar kernels can be discarded as they do not convey relevant information. This hypothesis was confirmed through the experiments performed on four standard datasets, where it was observed that the number of kernels can be reduced without meaningfully affecting the performance. This analysis was based on the accuracy of the model when the selected kernels fed a probabilistic classifier and the feature-based similarity index to appraise the quality of the reconstructed images when the variational autoencoder only uses the selected kernels. Therefore, the proposed methodology guides the reduction of the number of parameters of the model, making it suitable for developing applications for resource-constrained devices.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135888870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DAEM: A Data- and Application-Aware Error Analysis Methodology for Approximate Adders 近似加法器的数据和应用感知误差分析方法
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-17 DOI: 10.3390/info14100570
Muhammad Abdullah Hanif, Rehan Hafiz, Muhammad Shafique
Approximate adders are some of the fundamental arithmetic operators that are being employed in error-resilient applications, to achieve performance/energy/area gains. This improvement usually comes at the cost of some accuracy and, therefore, requires prior error analysis, to select an approximate adder variant that provides acceptable accuracy. Most of the state-of-the-art error analysis techniques for approximate adders assume input bits and operands to be independent of one another, while some also assume the operands to be uniformly distributed. In this paper, we analyze the impact of these assumptions on the accuracy of error estimation techniques, and we highlight the need to address these assumptions, to achieve better and more realistic quality estimates. Based on our analysis, we propose DAEM, a data- and application-aware error analysis methodology for approximate adders. Unlike existing error analysis models, we neither assume the adder operands to be uniformly distributed nor assume them to be independent. Specifically, we use 2D joint input probability mass functions (PMFs), populated using sample data, in order to incorporate the data and application knowledge in the analysis. These 2D joint input PMFs, along with 2D error maps of approximate adders, are used to estimate the error PMF of an adder network. The error PMF is then utilized to compute different error measures, such as the mean squared error (MSE) and mean error distance (MED). We evaluate the proposed error analysis methodology on audio and video processing applications, and we demonstrate that our methodology provides error estimates having a better correlation with the simulation results, as compared to the state-of-the-art techniques.
近似加法器是一些基本的算术运算符,用于抗错误应用中,以实现性能/能量/面积增益。这种改进通常以某些精度为代价,因此需要事先进行误差分析,以选择提供可接受精度的近似加法器变体。大多数最先进的近似加法器误差分析技术假设输入位和操作数彼此独立,而有些还假设操作数均匀分布。在本文中,我们分析了这些假设对误差估计技术准确性的影响,并强调了解决这些假设的必要性,以实现更好和更现实的质量估计。基于我们的分析,我们提出了DAEM,一种数据和应用感知的近似加法器误差分析方法。与现有的误差分析模型不同,我们既不假设加法器操作数均匀分布,也不假设它们是独立的。具体来说,我们使用二维联合输入概率质量函数(pmf),使用样本数据填充,以便将数据和应用知识纳入分析。这些二维联合输入PMF与近似加法器的二维误差映射一起用于估计加法器网络的误差PMF。然后利用误差PMF计算不同的误差度量,如均方误差(MSE)和平均误差距离(MED)。我们评估了音频和视频处理应用中提出的误差分析方法,并证明与最先进的技术相比,我们的方法提供的误差估计与仿真结果具有更好的相关性。
{"title":"DAEM: A Data- and Application-Aware Error Analysis Methodology for Approximate Adders","authors":"Muhammad Abdullah Hanif, Rehan Hafiz, Muhammad Shafique","doi":"10.3390/info14100570","DOIUrl":"https://doi.org/10.3390/info14100570","url":null,"abstract":"Approximate adders are some of the fundamental arithmetic operators that are being employed in error-resilient applications, to achieve performance/energy/area gains. This improvement usually comes at the cost of some accuracy and, therefore, requires prior error analysis, to select an approximate adder variant that provides acceptable accuracy. Most of the state-of-the-art error analysis techniques for approximate adders assume input bits and operands to be independent of one another, while some also assume the operands to be uniformly distributed. In this paper, we analyze the impact of these assumptions on the accuracy of error estimation techniques, and we highlight the need to address these assumptions, to achieve better and more realistic quality estimates. Based on our analysis, we propose DAEM, a data- and application-aware error analysis methodology for approximate adders. Unlike existing error analysis models, we neither assume the adder operands to be uniformly distributed nor assume them to be independent. Specifically, we use 2D joint input probability mass functions (PMFs), populated using sample data, in order to incorporate the data and application knowledge in the analysis. These 2D joint input PMFs, along with 2D error maps of approximate adders, are used to estimate the error PMF of an adder network. The error PMF is then utilized to compute different error measures, such as the mean squared error (MSE) and mean error distance (MED). We evaluate the proposed error analysis methodology on audio and video processing applications, and we demonstrate that our methodology provides error estimates having a better correlation with the simulation results, as compared to the state-of-the-art techniques.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"34 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135994561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An AI-Based Framework for Translating American Sign Language to English and Vice Versa 基于人工智能的美英手语翻译框架
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-15 DOI: 10.3390/info14100569
Vijayendra D. Avina, Md Amiruzzaman, Stefanie Amiruzzaman, Linh B. Ngo, M. Ali Akber Dewan
In this paper, we propose a framework to convert American Sign Language (ASL) to English and English to ASL. Within this framework, we use a deep learning model along with the rolling average prediction that captures image frames from videos and classifies the signs from the image frames. The classified frames are then used to construct ASL words and sentences to support people with hearing impairments. We also use the same deep learning model to capture signs from the people with deaf symptoms and convert them into ASL words and English sentences. Based on this framework, we developed a web-based tool to use in real-life application and we also present the tool as a proof of concept. With the evaluation, we found that the deep learning model converts the image signs into ASL words and sentences with high accuracy. The tool was also found to be very useful for people with hearing impairment and deaf symptoms. The main contribution of this work is the design of a system to convert ASL to English and vice versa.
本文提出了一个将美国手语转换为英语和英语转换为美国手语的框架。在此框架内,我们使用深度学习模型以及滚动平均预测,从视频中捕获图像帧并对图像帧中的符号进行分类。然后使用分类框架来构建美国手语单词和句子,以帮助有听力障碍的人。我们还使用相同的深度学习模型来捕捉有失聪症状的人的信号,并将它们转换成美国手语单词和英语句子。在此框架的基础上,我们开发了一个基于web的工具,用于实际应用,并将该工具作为概念验证。通过评估,我们发现深度学习模型将图像符号转换为美国手语单词和句子的准确率很高。该工具还被发现对有听力障碍和失聪症状的人非常有用。本工作的主要贡献是设计了一个将美国手语转换为英语的系统。
{"title":"An AI-Based Framework for Translating American Sign Language to English and Vice Versa","authors":"Vijayendra D. Avina, Md Amiruzzaman, Stefanie Amiruzzaman, Linh B. Ngo, M. Ali Akber Dewan","doi":"10.3390/info14100569","DOIUrl":"https://doi.org/10.3390/info14100569","url":null,"abstract":"In this paper, we propose a framework to convert American Sign Language (ASL) to English and English to ASL. Within this framework, we use a deep learning model along with the rolling average prediction that captures image frames from videos and classifies the signs from the image frames. The classified frames are then used to construct ASL words and sentences to support people with hearing impairments. We also use the same deep learning model to capture signs from the people with deaf symptoms and convert them into ASL words and English sentences. Based on this framework, we developed a web-based tool to use in real-life application and we also present the tool as a proof of concept. With the evaluation, we found that the deep learning model converts the image signs into ASL words and sentences with high accuracy. The tool was also found to be very useful for people with hearing impairment and deaf symptoms. The main contribution of this work is the design of a system to convert ASL to English and vice versa.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136184855","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Investigating the Relationship of User Acceptance to the Characteristics and Performance of an Educational Software in Byzantine Music 用户接受度与拜占庭音乐教育软件特性和性能的关系研究
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-15 DOI: 10.3390/info14100568
Konstantinos-Hercules Kokkinidis, Georgios Patronas, Sotirios K. Goudos, Theodoros Maikantis, Nikolaos Nikolaidis
The purpose of this study is to examine the impact of educational software characteristics on software performance through the mediating role of user acceptance. Our approach allows for a deeper understanding of the factors that contribute to the effectiveness of educational software by bridging the fields of educational technology, psychology, and human–computer interaction, offering a holistic perspective on software adoption and performance. This study is based on a sample collected from public and private education institutes in Northern Greece and on data obtained from 236 users. The statistical method employed is structural equation models (SEMs), via SPSS—AMOS estimation. The findings of this study suggest that user acceptance and performance appraisal are exceptionally interrelated in regard to educational applications. The study argues that user acceptance is positively related to the performance of educational software and constitutes the nested epicenter mediating construct in the educational software characteristics. Additional findings, such as computer-familiar users and users from the field of choral music, are positively related to the performance of the educational software. Our conclusions help in understanding the psychological and behavioral aspects of technology adoption in the educational setting. Findings are discussed in terms of their practical usefulness in education and further research.
本研究的目的是通过用户接受度的中介作用来检验教育软件特性对软件性能的影响。我们的方法通过连接教育技术、心理学和人机交互领域,对影响教育软件有效性的因素有了更深的理解,提供了软件采用和性能的整体视角。这项研究基于从希腊北部的公立和私立教育机构收集的样本,以及从236名用户那里获得的数据。采用的统计方法是结构方程模型(sem),通过SPSS-AMOS估计。这项研究的结果表明,在教育应用方面,用户接受度和业绩评价是异常相关的。研究认为,用户接受度与教育软件的性能呈正相关,构成了教育软件特性中的嵌套震中中介结构。其他发现,如熟悉电脑的用户和合唱音乐领域的用户,与教育软件的表现呈正相关。我们的结论有助于理解教育环境中技术采用的心理和行为方面。研究结果在教育和进一步研究中的实际用途进行了讨论。
{"title":"Investigating the Relationship of User Acceptance to the Characteristics and Performance of an Educational Software in Byzantine Music","authors":"Konstantinos-Hercules Kokkinidis, Georgios Patronas, Sotirios K. Goudos, Theodoros Maikantis, Nikolaos Nikolaidis","doi":"10.3390/info14100568","DOIUrl":"https://doi.org/10.3390/info14100568","url":null,"abstract":"The purpose of this study is to examine the impact of educational software characteristics on software performance through the mediating role of user acceptance. Our approach allows for a deeper understanding of the factors that contribute to the effectiveness of educational software by bridging the fields of educational technology, psychology, and human–computer interaction, offering a holistic perspective on software adoption and performance. This study is based on a sample collected from public and private education institutes in Northern Greece and on data obtained from 236 users. The statistical method employed is structural equation models (SEMs), via SPSS—AMOS estimation. The findings of this study suggest that user acceptance and performance appraisal are exceptionally interrelated in regard to educational applications. The study argues that user acceptance is positively related to the performance of educational software and constitutes the nested epicenter mediating construct in the educational software characteristics. Additional findings, such as computer-familiar users and users from the field of choral music, are positively related to the performance of the educational software. Our conclusions help in understanding the psychological and behavioral aspects of technology adoption in the educational setting. Findings are discussed in terms of their practical usefulness in education and further research.","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"87 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136184506","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated Assessment of Comprehension Strategies from Self-Explanations Using LLMs 基于llm的自我解释理解策略的自动评估
Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2023-10-14 DOI: 10.3390/info14100567
Bogdan Nicula, Mihai Dascalu, Tracy Arner, Renu Balyan, Danielle S. McNamara
Text comprehension is an essential skill in today’s information-rich world, and self-explanation practice helps students improve their understanding of complex texts. This study was centered on leveraging open-source Large Language Models (LLMs), specifically FLAN-T5, to automatically assess the comprehension strategies employed by readers while understanding Science, Technology, Engineering, and Mathematics (STEM) texts. The experiments relied on a corpus of three datasets (N = 11,833) with self-explanations annotated on 4 dimensions: 3 comprehension strategies (i.e., bridging, elaboration, and paraphrasing) and overall quality. Besides FLAN-T5, we also considered GPT3.5-turbo to establish a stronger baseline. Our experiments indicated that the performance improved with fine-tuning, having a larger LLM model, and providing examples via the prompt. Our best model considered a pretrained FLAN-T5 XXL model and obtained a weighted F1-score of 0.721, surpassing the 0.699 F1-score previously obtained using smaller models (i.e., RoBERTa).
在当今信息丰富的世界中,文本理解是一项必不可少的技能,自我解释练习可以帮助学生提高对复杂文本的理解。本研究集中于利用开源大型语言模型(llm),特别是FLAN-T5,来自动评估读者在理解科学、技术、工程和数学(STEM)文本时采用的理解策略。实验依赖于三个数据集(N = 11,833)的语料库,这些数据集在4个维度上标注了自我解释:3种理解策略(即桥接、阐述和释义)和整体质量。除了FLAN-T5,我们还考虑了gpt3.5 turbo,以建立更强的基线。我们的实验表明,通过微调,拥有更大的LLM模型,并通过提示符提供示例,性能得到了提高。我们的最佳模型考虑了预训练的FLAN-T5 XXL模型,并获得了0.721的加权f1分数,超过了之前使用较小模型(即RoBERTa)获得的0.699 f1分数。
{"title":"Automated Assessment of Comprehension Strategies from Self-Explanations Using LLMs","authors":"Bogdan Nicula, Mihai Dascalu, Tracy Arner, Renu Balyan, Danielle S. McNamara","doi":"10.3390/info14100567","DOIUrl":"https://doi.org/10.3390/info14100567","url":null,"abstract":"Text comprehension is an essential skill in today’s information-rich world, and self-explanation practice helps students improve their understanding of complex texts. This study was centered on leveraging open-source Large Language Models (LLMs), specifically FLAN-T5, to automatically assess the comprehension strategies employed by readers while understanding Science, Technology, Engineering, and Mathematics (STEM) texts. The experiments relied on a corpus of three datasets (N = 11,833) with self-explanations annotated on 4 dimensions: 3 comprehension strategies (i.e., bridging, elaboration, and paraphrasing) and overall quality. Besides FLAN-T5, we also considered GPT3.5-turbo to establish a stronger baseline. Our experiments indicated that the performance improved with fine-tuning, having a larger LLM model, and providing examples via the prompt. Our best model considered a pretrained FLAN-T5 XXL model and obtained a weighted F1-score of 0.721, surpassing the 0.699 F1-score previously obtained using smaller models (i.e., RoBERTa).","PeriodicalId":38479,"journal":{"name":"Information (Switzerland)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135766283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Information (Switzerland)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1