首页 > 最新文献

Applied and Computational Engineering最新文献

英文 中文
Comparative analysis of transformer and GoogleNet models in image classification based on the CIFAR dataset 基于 CIFAR 数据集的变换器和 GoogleNet 模型在图像分类中的对比分析
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/79/20241537
Xinran Xie, Qinwen Yan, Haoye Li, Sujie Yan, Zirong Jiang
Image classification plays a pivotal role in numerous applications, with substantial implications for daily life, including diagnosing disease from medical images and management of images in autonomous vehicles. However, such sort of research in this field continuously challenges scientists in terms of choosing datasets, testing accuracy, and improvement of models, etc. In this paper, we focus on the performance of two prominent models GoogleNet and residual attention network. We construct two models on the Python platform according to available online resources. To assess their capabilities, we employ the CIFAR-100 dataset, a widely used benchmark dataset. Despite the simplicity of our implementations, GoogleNet comprises approximately 75 convolutional layers and inception modules, and the Residual Attention Network incorporates multiple attention modules within its architecture. These characteristics demonstrate the models' potential for achieving exceptional classification results. Through comprehensive testing and visualization, we aim to provide insights into the efficacy of these models in the context of image classification. Our study contributes to a broader and profounder understanding of their suitability for real-world applications. According to our diagrams and analysis, we conclude that although attention56 is suitable to be adopted in image classification concerning its structure since the model is unstable and invalid in a wide range of training image data on dataset SIFAR100 it might not be exploited in practice. However, as to the model GoogleNet, with an increasing number of training, it obviously is prone to robustness and solid capability of noise resistance. Therefore, GoogleNet is a suitable one to be employed in image classification.
图像分类在众多应用中发挥着举足轻重的作用,对日常生活有着重大影响,包括从医学图像中诊断疾病和自动驾驶汽车中的图像管理。然而,该领域的此类研究在数据集选择、准确性测试和模型改进等方面不断向科学家提出挑战。在本文中,我们重点研究了两个著名模型 GoogleNet 和剩余注意力网络的性能。我们根据可用的在线资源,在 Python 平台上构建了两个模型。为了评估它们的能力,我们使用了 CIFAR-100 数据集,这是一个广泛使用的基准数据集。尽管我们的实现比较简单,但 GoogleNet 包含约 75 个卷积层和入门模块,而残差注意网络的架构中包含多个注意模块。这些特点表明,这些模型具有取得优异分类结果的潜力。通过综合测试和可视化,我们希望深入了解这些模型在图像分类方面的功效。我们的研究有助于更广泛、更深入地了解这些模型在实际应用中的适用性。根据我们的图表和分析,我们得出结论:尽管 attention56 适合用于图像分类,但由于该模型在数据集 SIFAR100 上的大量训练图像数据中不稳定且无效,因此其结构在实践中可能无法利用。然而,对于 GoogleNet 模型来说,随着训练次数的增加,它的鲁棒性和抗噪声能力明显增强。因此,GoogleNet 是一个适合用于图像分类的模型。
{"title":"Comparative analysis of transformer and GoogleNet models in image classification based on the CIFAR dataset","authors":"Xinran Xie, Qinwen Yan, Haoye Li, Sujie Yan, Zirong Jiang","doi":"10.54254/2755-2721/79/20241537","DOIUrl":"https://doi.org/10.54254/2755-2721/79/20241537","url":null,"abstract":"Image classification plays a pivotal role in numerous applications, with substantial implications for daily life, including diagnosing disease from medical images and management of images in autonomous vehicles. However, such sort of research in this field continuously challenges scientists in terms of choosing datasets, testing accuracy, and improvement of models, etc. In this paper, we focus on the performance of two prominent models GoogleNet and residual attention network. We construct two models on the Python platform according to available online resources. To assess their capabilities, we employ the CIFAR-100 dataset, a widely used benchmark dataset. Despite the simplicity of our implementations, GoogleNet comprises approximately 75 convolutional layers and inception modules, and the Residual Attention Network incorporates multiple attention modules within its architecture. These characteristics demonstrate the models' potential for achieving exceptional classification results. Through comprehensive testing and visualization, we aim to provide insights into the efficacy of these models in the context of image classification. Our study contributes to a broader and profounder understanding of their suitability for real-world applications. According to our diagrams and analysis, we conclude that although attention56 is suitable to be adopted in image classification concerning its structure since the model is unstable and invalid in a wide range of training image data on dataset SIFAR100 it might not be exploited in practice. However, as to the model GoogleNet, with an increasing number of training, it obviously is prone to robustness and solid capability of noise resistance. Therefore, GoogleNet is a suitable one to be employed in image classification.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"30 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141803546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing customer behavior prediction in e-commerce: A comparative analysis of machine learning and deep learning models 加强电子商务中的客户行为预测:机器学习和深度学习模型的比较分析
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/55/20241475
Deming Liu, Hansheng Huang, Haimei Zhang, Xinyu Luo, Zhongyang Fan
The digital era has transformed the way businesses interact with their customers, with online platforms serving as crucial touchpoints for user engagement. Understanding customer behavior in this context is paramount for enhancing user experience, optimizing marketing strategies, and driving business growth. This study aims to explore the likelihood of customers making purchases based on their clickstream data by employing both machine learning and deep learning techniques. This research uses a machine learning model Random Forest (RF), Gradient Boosting Decision Trees (GBDT), Extreme Gradient Boosting (XGBOOST) and deep learning model Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM) to predict whether customers will purchase the items using 33,040,175 records in the file of the click and 1,177,769 records in the buys file from real e-commerce customers. The results show that both machine learning and deep learning can accurately forecast the purchasing behavior of customers with an accuracy of around 72 to 75 percent. For the machine learning model, attains the highest prediction accuracy when using a sliding window of 6 days. For the deep learning model, the LSTM model with 50 layers shows the highest prediction of customers willingness to purchase an item. Compared with previous studies, the three machine learning models narrow the range of days, give more accurate predictions, and also improve the model. Both RNN and LSTM show similar accuracy for customer behavior. The current research has asserted that both machine learning and deep learning models give profound results on whether customers will purchase a product, and there is not a significant difference between machine learning and deep learning in this classification topic.
数字时代改变了企业与客户互动的方式,在线平台成为用户参与的关键接触点。在这种情况下,了解客户行为对于提升用户体验、优化营销策略和推动业务增长至关重要。本研究旨在通过采用机器学习和深度学习技术,根据客户的点击流数据探索客户进行购买的可能性。本研究使用机器学习模型随机森林(RF)、梯度提升决策树(GBDT)、极端梯度提升(XGBOOST)和深度学习模型循环神经网络(RNN)、长短期记忆(LSTM),利用真实电子商务客户点击文件中的 33,040,175 条记录和购买文件中的 1,177,769 条记录,预测客户是否会购买商品。结果表明,机器学习和深度学习都能准确预测客户的购买行为,准确率约为 72%至 75%。对于机器学习模型,当使用 6 天的滑动窗口时,预测准确率最高。在深度学习模型中,50 层的 LSTM 模型对顾客购买商品意愿的预测率最高。与之前的研究相比,这三种机器学习模型缩小了天数范围,给出了更准确的预测,同时也改进了模型。RNN 和 LSTM 对顾客行为的预测准确率相似。目前的研究表明,机器学习和深度学习模型都能对顾客是否会购买商品给出深刻的结果,机器学习和深度学习在这一分类主题上没有显著差异。
{"title":"Enhancing customer behavior prediction in e-commerce: A comparative analysis of machine learning and deep learning models","authors":"Deming Liu, Hansheng Huang, Haimei Zhang, Xinyu Luo, Zhongyang Fan","doi":"10.54254/2755-2721/55/20241475","DOIUrl":"https://doi.org/10.54254/2755-2721/55/20241475","url":null,"abstract":"The digital era has transformed the way businesses interact with their customers, with online platforms serving as crucial touchpoints for user engagement. Understanding customer behavior in this context is paramount for enhancing user experience, optimizing marketing strategies, and driving business growth. This study aims to explore the likelihood of customers making purchases based on their clickstream data by employing both machine learning and deep learning techniques. This research uses a machine learning model Random Forest (RF), Gradient Boosting Decision Trees (GBDT), Extreme Gradient Boosting (XGBOOST) and deep learning model Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM) to predict whether customers will purchase the items using 33,040,175 records in the file of the click and 1,177,769 records in the buys file from real e-commerce customers. The results show that both machine learning and deep learning can accurately forecast the purchasing behavior of customers with an accuracy of around 72 to 75 percent. For the machine learning model, attains the highest prediction accuracy when using a sliding window of 6 days. For the deep learning model, the LSTM model with 50 layers shows the highest prediction of customers willingness to purchase an item. Compared with previous studies, the three machine learning models narrow the range of days, give more accurate predictions, and also improve the model. Both RNN and LSTM show similar accuracy for customer behavior. The current research has asserted that both machine learning and deep learning models give profound results on whether customers will purchase a product, and there is not a significant difference between machine learning and deep learning in this classification topic.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"10 4","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141803688","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A review of research about automatic navigation system of mobile robots based on ROS 基于 ROS 的移动机器人自动导航系统研究综述
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/79/20241666
Qiuyang Yu
With the rapid development of technology, robotic technology is continuously updated and iterated. In this process, mobile robots are gradually entering different fields and playing important roles in people's daily lives. In this context, research on path planning for mobile robots has become a priority in order to meet the various tasks that mobile robots need to handle in different situations. This paper proposes a ROS-based mobile robot autonomous navigation system, which uses the gampping algorithm to implement SLAM technology for building environment maps. It addresses the accuracy issues found in traditional map construction methods and improves localization accuracy by combining IMU with laser odometry. The paper focuses on the problem of path planning for mobile robots and mainly compares and analyzes the advantages and disadvantages of the A* algorithm and Dijkstra algorithm in path planning algorithms, aiming to find a more suitable path planning algorithm. This paper provides support for the development of the modern mobile robot field.
随着科技的飞速发展,机器人技术也在不断更新迭代。在此过程中,移动机器人逐渐进入不同领域,并在人们的日常生活中扮演着重要角色。在此背景下,为了满足移动机器人在不同情况下需要处理的各种任务,移动机器人的路径规划研究已成为当务之急。本文提出了一种基于 ROS 的移动机器人自主导航系统,该系统采用伽扑(gampping)算法实现 SLAM 技术,用于构建环境地图。该系统解决了传统地图构建方法中存在的精度问题,并通过结合 IMU 和激光里程计提高了定位精度。本文重点研究了移动机器人的路径规划问题,主要对比分析了A*算法和Dijkstra算法在路径规划算法中的优缺点,旨在找到更合适的路径规划算法。本文为现代移动机器人领域的发展提供了支持。
{"title":"A review of research about automatic navigation system of mobile robots based on ROS","authors":"Qiuyang Yu","doi":"10.54254/2755-2721/79/20241666","DOIUrl":"https://doi.org/10.54254/2755-2721/79/20241666","url":null,"abstract":"With the rapid development of technology, robotic technology is continuously updated and iterated. In this process, mobile robots are gradually entering different fields and playing important roles in people's daily lives. In this context, research on path planning for mobile robots has become a priority in order to meet the various tasks that mobile robots need to handle in different situations. This paper proposes a ROS-based mobile robot autonomous navigation system, which uses the gampping algorithm to implement SLAM technology for building environment maps. It addresses the accuracy issues found in traditional map construction methods and improves localization accuracy by combining IMU with laser odometry. The paper focuses on the problem of path planning for mobile robots and mainly compares and analyzes the advantages and disadvantages of the A* algorithm and Dijkstra algorithm in path planning algorithms, aiming to find a more suitable path planning algorithm. This paper provides support for the development of the modern mobile robot field.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"48 21","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141803736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on time-series financial data prediction and analysis based on deep recurrent neural network 基于深度递归神经网络的时间序列金融数据预测与分析研究
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/69/20241498
Feng Yuan
Time series data is widely available in a variety of industries. By forecasting time series, decision-makers can better grasp future trends and make more effective decisions. Financial time series data exhibit non-stationarity and high volatility. High-frequency fluctuations in financial products such as exchange rates, bonds and equities may reflect external shocks and risks in global financial markets, which are potentially dangerous and may threaten national economic security or even trigger financial crises. For financial time series data, a deep recurrent neural network first progressively processes each data point in the time series through its recurrent unit. Each recurring unit can adjust its own weights to better predict or analyze future values. Over time, these recurrent units continuously update their internal state, resulting in a comprehensive understanding of the characteristics of the entire data sequence. In addition, we add a gating mechanism to further improve the network's ability to control the flow of information, so that the model is more effective when retaining long-term dependencies, so as to improve the accuracy of prediction and the stability of the model. Experimental results show that our recurrent neural network model shows higher prediction accuracy and stability than other baseline models on financial time series datasets.
时间序列数据广泛存在于各行各业。通过预测时间序列,决策者可以更好地把握未来趋势,做出更有效的决策。金融时间序列数据表现出非平稳性和高波动性。汇率、债券和股票等金融产品的高频波动可能反映全球金融市场的外部冲击和风险,具有潜在危险,可能威胁国家经济安全,甚至引发金融危机。对于金融时间序列数据,深度递归神经网络首先通过其递归单元逐步处理时间序列中的每个数据点。每个递归单元都可以调整自己的权重,以更好地预测或分析未来值。随着时间的推移,这些递归单元会不断更新其内部状态,从而全面了解整个数据序列的特征。此外,我们还增加了门控机制,进一步提高网络控制信息流的能力,使模型在保留长期依赖关系时更加有效,从而提高预测的准确性和模型的稳定性。实验结果表明,在金融时间序列数据集上,我们的递归神经网络模型比其他基线模型显示出更高的预测精度和稳定性。
{"title":"Research on time-series financial data prediction and analysis based on deep recurrent neural network","authors":"Feng Yuan","doi":"10.54254/2755-2721/69/20241498","DOIUrl":"https://doi.org/10.54254/2755-2721/69/20241498","url":null,"abstract":"Time series data is widely available in a variety of industries. By forecasting time series, decision-makers can better grasp future trends and make more effective decisions. Financial time series data exhibit non-stationarity and high volatility. High-frequency fluctuations in financial products such as exchange rates, bonds and equities may reflect external shocks and risks in global financial markets, which are potentially dangerous and may threaten national economic security or even trigger financial crises. For financial time series data, a deep recurrent neural network first progressively processes each data point in the time series through its recurrent unit. Each recurring unit can adjust its own weights to better predict or analyze future values. Over time, these recurrent units continuously update their internal state, resulting in a comprehensive understanding of the characteristics of the entire data sequence. In addition, we add a gating mechanism to further improve the network's ability to control the flow of information, so that the model is more effective when retaining long-term dependencies, so as to improve the accuracy of prediction and the stability of the model. Experimental results show that our recurrent neural network model shows higher prediction accuracy and stability than other baseline models on financial time series datasets.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"9 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141804373","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cops and robbers game and applications of its variant 警察与强盗游戏及其变体的应用
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/69/20241464
Rui Bao
Pursuit-evasion game is a game with a pursuer player and an evader player, it has numerous applications including artificial intelligence, robot motion planning and so on. The cops and robbers game is a type of pursuit-evasion game played on graphs, and its variant is a very fruitful research area in graph theory. The purpose of this paper is to find the specific relations between pursuit-evasion game applications and different types of variants of the cops and robbers game. The research method is to find variants with different game rules and winning strategies and classify them. Then compare these rules and strategies with the applications of pursuit-evasion games to find their relationships. The comparison result shows that there are many differences between the variants and the applications, which lead to their inability to be directly related. The application of the pursuit-evasion game is mainly based on route planning and object distance, and generally contains multiple variant types, which is different from the current research direction of variant.
追逐-逃避博弈是一种由追逐者和逃避者组成的博弈,它在人工智能、机器人运动规划等方面有着广泛的应用。警察与强盗博弈是在图上进行的一种追逐-逃避博弈,其变体是图论中一个非常有成果的研究领域。本文的目的是找出追逐-逃避博弈应用与警察与强盗博弈不同类型变体之间的具体关系。研究方法是找到具有不同游戏规则和获胜策略的变体,并对它们进行分类。然后将这些规则和策略与逃避游戏的应用进行比较,找出它们之间的关系。比较结果表明,变体和应用之间存在许多差异,导致它们无法直接联系起来。追逃博弈的应用主要基于路线规划和目标距离,一般包含多种变体类型,这与当前变体的研究方向不同。
{"title":"Cops and robbers game and applications of its variant","authors":"Rui Bao","doi":"10.54254/2755-2721/69/20241464","DOIUrl":"https://doi.org/10.54254/2755-2721/69/20241464","url":null,"abstract":"Pursuit-evasion game is a game with a pursuer player and an evader player, it has numerous applications including artificial intelligence, robot motion planning and so on. The cops and robbers game is a type of pursuit-evasion game played on graphs, and its variant is a very fruitful research area in graph theory. The purpose of this paper is to find the specific relations between pursuit-evasion game applications and different types of variants of the cops and robbers game. The research method is to find variants with different game rules and winning strategies and classify them. Then compare these rules and strategies with the applications of pursuit-evasion games to find their relationships. The comparison result shows that there are many differences between the variants and the applications, which lead to their inability to be directly related. The application of the pursuit-evasion game is mainly based on route planning and object distance, and generally contains multiple variant types, which is different from the current research direction of variant.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"46 13","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141805101","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The limits of excellence: Assessing fine-tuned ChatGPTs efficacy in stock price forecasting 卓越的极限:评估微调 ChatGPT 在股价预测中的功效
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/79/20241612
Yiyang Huang, Xiang Liu, Naichuan Zhang, Tianshu Zhao
In this study, we explore the ability of ChatGPT to predict stock market trends based on stock news headlines and real stock market data. In order to evaluate the performance of the fine-tuned model, we first obtain the prediction results of GPT-3.5 Turbo on specific stocks future trends as a comparison. We fine-tuned GPT-3.5 Turbo and conducted related training, testing and result evaluation. The experiments implemented on the two datasets Bigdata2022 and Cikm illustrate that fine-tuning can help the model to produce expected structured output according to user requirements, based on its more sophisticated understanding of the text and data in this field. However, although the models performance is improved significantly, GPT-3.5 Turbo does not demonstrate better performance compared to other traditional large language models in terms of integrating time series data and news headline data for stock forecasting. The fine-tuned ChatGPT model is expected to achieve excellent results in the stock market forecasting tasks through more in-depth research and become one of the mainstream research models in this field.
在本研究中,我们探讨了 ChatGPT 基于股票新闻标题和真实股市数据预测股市趋势的能力。为了评估微调模型的性能,我们首先获得了 GPT-3.5 Turbo 对特定股票未来趋势的预测结果作为对比。我们对 GPT-3.5 Turbo 进行了微调,并进行了相关的训练、测试和结果评估。在 Bigdata2022 和 Cikm 这两个数据集上进行的实验表明,基于对该领域文本和数据更复杂的理解,微调可以帮助模型根据用户需求产生预期的结构化输出。不过,虽然模型的性能有了显著提高,但与其他传统的大型语言模型相比,GPT-3.5 Turbo 在整合时间序列数据和新闻标题数据进行股票预测方面并没有表现出更好的性能。经过微调的 ChatGPT 模型有望通过更深入的研究在股市预测任务中取得优异成绩,并成为该领域的主流研究模型之一。
{"title":"The limits of excellence: Assessing fine-tuned ChatGPTs efficacy in stock price forecasting","authors":"Yiyang Huang, Xiang Liu, Naichuan Zhang, Tianshu Zhao","doi":"10.54254/2755-2721/79/20241612","DOIUrl":"https://doi.org/10.54254/2755-2721/79/20241612","url":null,"abstract":"In this study, we explore the ability of ChatGPT to predict stock market trends based on stock news headlines and real stock market data. In order to evaluate the performance of the fine-tuned model, we first obtain the prediction results of GPT-3.5 Turbo on specific stocks future trends as a comparison. We fine-tuned GPT-3.5 Turbo and conducted related training, testing and result evaluation. The experiments implemented on the two datasets Bigdata2022 and Cikm illustrate that fine-tuning can help the model to produce expected structured output according to user requirements, based on its more sophisticated understanding of the text and data in this field. However, although the models performance is improved significantly, GPT-3.5 Turbo does not demonstrate better performance compared to other traditional large language models in terms of integrating time series data and news headline data for stock forecasting. The fine-tuned ChatGPT model is expected to achieve excellent results in the stock market forecasting tasks through more in-depth research and become one of the mainstream research models in this field.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"49 14","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141805673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient computation of eigenvalues in diffusion maps: A multi-strategy approach 扩散图特征值的高效计算:多策略方法
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/55/20241429
Yixiong Fang, Weixi Yang
In the pursuit of accelerating the computation of k-largest eigenvalues and eigenvectors, this work presents novel methodologies and insights across three main areas. 1) Speed Arnoldi Iteration Up: By observing existing algorithms, we propose an innovative approach that leverages matrix decomposition to accelerate the computation process. The implementation focuses on iteratively computing orthogonal projections and efficiently storing computed vectors in Krylov subspace. 2) Special Case: Eigenvector for =1: We examine a specific scenario concerning Markov matrices, where the largest eigenvalue is 1. The work provides a detailed proof and analysis of this property, contributing to a deeper understanding of eigenvalues and eigenvectors in the context of stochastic processes. 3) Gaussian Approximation for Markov Matrices: This section delves into the Gaussian approximation for Markov matrices, denoted by P. The work covers theoretical insights, practical challenges, computational efficiency, and empirical validation, providing a comprehensive exploration of this critical method. Together, these sections form a cohesive study aimed at enhancing the computational efficiency of significant algorithms within the field of dimensionality reduction and matrix analysis. The findings may find broad applications in various domains, including image segmentation, speaker verification, anomaly detection, and more.
为了加快 k 大特征值和特征向量的计算速度,本研究在三个主要领域提出了新颖的方法和见解。1) 加速阿诺德迭代:通过观察现有算法,我们提出了一种利用矩阵分解加速计算过程的创新方法。实现的重点是迭代计算正交投影,并将计算出的向量高效地存储在克雷洛夫子空间中。2) 特殊情况:=1 的特征向量:我们研究了马尔可夫矩阵的一个特殊情况,即最大特征值为 1。这项研究提供了对这一特性的详细证明和分析,有助于加深对随机过程中特征值和特征向量的理解。3) 马尔可夫矩阵的高斯逼近:本节深入探讨马尔可夫矩阵的高斯逼近(用 P 表示)。工作内容包括理论见解、实际挑战、计算效率和经验验证,对这一关键方法进行了全面探索。这些部分共同构成了一项具有凝聚力的研究,旨在提高降维和矩阵分析领域重要算法的计算效率。这些研究成果可广泛应用于各个领域,包括图像分割、说话人验证、异常检测等。
{"title":"Efficient computation of eigenvalues in diffusion maps: A multi-strategy approach","authors":"Yixiong Fang, Weixi Yang","doi":"10.54254/2755-2721/55/20241429","DOIUrl":"https://doi.org/10.54254/2755-2721/55/20241429","url":null,"abstract":"In the pursuit of accelerating the computation of k-largest eigenvalues and eigenvectors, this work presents novel methodologies and insights across three main areas. 1) Speed Arnoldi Iteration Up: By observing existing algorithms, we propose an innovative approach that leverages matrix decomposition to accelerate the computation process. The implementation focuses on iteratively computing orthogonal projections and efficiently storing computed vectors in Krylov subspace. 2) Special Case: Eigenvector for =1: We examine a specific scenario concerning Markov matrices, where the largest eigenvalue is 1. The work provides a detailed proof and analysis of this property, contributing to a deeper understanding of eigenvalues and eigenvectors in the context of stochastic processes. 3) Gaussian Approximation for Markov Matrices: This section delves into the Gaussian approximation for Markov matrices, denoted by P. The work covers theoretical insights, practical challenges, computational efficiency, and empirical validation, providing a comprehensive exploration of this critical method. Together, these sections form a cohesive study aimed at enhancing the computational efficiency of significant algorithms within the field of dimensionality reduction and matrix analysis. The findings may find broad applications in various domains, including image segmentation, speaker verification, anomaly detection, and more.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"56 22","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141805926","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey of generative models used in text-to-image 文本到图像生成模型概览
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/79/20241286
Jingjing Xu, Jiahao Du, Junyi Wang
The emergence and rapid development of neural networks have been pivotal in advancing text-to-image generative models, with particular emphasis on generative adversarial networks (GANs), variational autoencoders (VAEs), and augmented reality (AR). These models have greatly enriched the field, offering diverse avenues for image generation. Critical support has been provided by databases such as MS COCO, Flickr30K, Visual Genome, and Conceptual Captions, along with essential evaluation metrics, including Inception Score (IS), Frchet Inception Distance (FID), precision, and recall. In this comprehensive review, we delve into the mechanisms and significance of each model and technique, ensuring a holistic examination of their contributions. Both GANs and VAEs stand out as significant models within image generative frameworks, each excelling in distinct aspects. Therefore, it is imperative to discuss both models in this review, as they offer complementary strengths. Additionally, we include noteworthy models such as augmented reality to provide a well-rounded assessment of the current advancements in the field. In terms of datasets, MS COCO offers a diverse and extensive collection of images, serving as a cornerstone for model training. Other datasets like Flickr 30k, Visual Genome, and Conceptual Captions contribute valuable labeled examples, further enriching the learning process for these models. The incorporation of widely recognized metrics and methodologies in the field allows for effective evaluation and comparison of their relative significance. In conclusion, the field's recent achievements owe much to the integration of its various components. VAEs and GANs, with their unique strengths, complement each other, while metrics and datasets play complementary roles in advancing the capabilities of generative models in the context of text-to-image synthesis. This survey underscores the collaborative synergy between models, metrics, and datasets, propelling the field toward new horizons.
神经网络的出现和快速发展在推动文本到图像生成模型方面发挥了关键作用,尤其是生成对抗网络(GAN)、变异自动编码器(VAE)和增强现实(AR)。这些模型极大地丰富了这一领域,为图像生成提供了多种途径。MS COCO、Flickr30K、Visual Genome 和 Conceptual Captions 等数据库提供了重要支持,同时还提供了重要的评估指标,包括入门分数(IS)、Frchet 入门距离(FID)、精确度和召回率。在这篇综合评论中,我们深入探讨了每种模型和技术的机制和意义,确保对它们的贡献进行全面考察。GANs 和 VAEs 都是图像生成框架中的重要模型,各自在不同的方面表现出色。因此,在本综述中必须讨论这两种模型,因为它们具有互补优势。此外,我们还纳入了增强现实等值得关注的模型,以便对该领域的当前进展进行全面评估。在数据集方面,MS COCO 提供了丰富多样的图像,是模型训练的基石。Flickr 30k、Visual Genome 和 Conceptual Captions 等其他数据集提供了宝贵的标注示例,进一步丰富了这些模型的学习过程。通过采用该领域广泛认可的指标和方法,可以有效评估和比较它们的相对重要性。总之,该领域最近取得的成就在很大程度上归功于其各个组成部分的整合。具有独特优势的 VAE 和 GAN 相辅相成,而度量标准和数据集则在文本到图像合成中发挥了互补作用,推动了生成模型能力的提高。这项调查强调了模型、度量和数据集之间的合作协同作用,推动这一领域迈向新的境界。
{"title":"A survey of generative models used in text-to-image","authors":"Jingjing Xu, Jiahao Du, Junyi Wang","doi":"10.54254/2755-2721/79/20241286","DOIUrl":"https://doi.org/10.54254/2755-2721/79/20241286","url":null,"abstract":"The emergence and rapid development of neural networks have been pivotal in advancing text-to-image generative models, with particular emphasis on generative adversarial networks (GANs), variational autoencoders (VAEs), and augmented reality (AR). These models have greatly enriched the field, offering diverse avenues for image generation. Critical support has been provided by databases such as MS COCO, Flickr30K, Visual Genome, and Conceptual Captions, along with essential evaluation metrics, including Inception Score (IS), Frchet Inception Distance (FID), precision, and recall. In this comprehensive review, we delve into the mechanisms and significance of each model and technique, ensuring a holistic examination of their contributions. Both GANs and VAEs stand out as significant models within image generative frameworks, each excelling in distinct aspects. Therefore, it is imperative to discuss both models in this review, as they offer complementary strengths. Additionally, we include noteworthy models such as augmented reality to provide a well-rounded assessment of the current advancements in the field. In terms of datasets, MS COCO offers a diverse and extensive collection of images, serving as a cornerstone for model training. Other datasets like Flickr 30k, Visual Genome, and Conceptual Captions contribute valuable labeled examples, further enriching the learning process for these models. The incorporation of widely recognized metrics and methodologies in the field allows for effective evaluation and comparison of their relative significance. In conclusion, the field's recent achievements owe much to the integration of its various components. VAEs and GANs, with their unique strengths, complement each other, while metrics and datasets play complementary roles in advancing the capabilities of generative models in the context of text-to-image synthesis. This survey underscores the collaborative synergy between models, metrics, and datasets, propelling the field toward new horizons.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"15 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141804415","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep reinforcement learning-aided minimum current control for the DBSRC based on harmonic analysis method 基于谐波分析方法的深度强化学习辅助 DBSRC 最小电流控制
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/67/2024ma0069
Ziqiao Yu, Zhengcheng Li
Aiming to optimize the modulation efficiency for the dual-bridge series-resonant converter (DBSRC), this paper proposes an deep reinforcement learning (DRL) aided EPS (DEPS) modulation scheme for minimum current operation based on the harmonic analysis method. Using the deep deterministic policy gradient (DDPG) algorithm as an advanced DRL algorithm, the scheme obtains the optimized modulation scheme DEPS through offline training of the agent, which can adopt the extended-phase-shift (EPS) modulation scheme and consider the zero-voltage-switching (ZVS) constraints. Thus, the trained agent of the DDPG which likes an implicit function, can provide optimal phase shift angle for the DBSRC in real-time with the minimum current and soft switching performance in the continuous operation range. Compared with the existing EPS modulation schemes using First Harmonic Approximation (FHA), the DEPS modulation scheme has similar operational efficiency and performance of the converter, while also possessing the ability to obtain modulation angles in real-time based on environmental parameters. Finally, PSIM simulation verifies the effectiveness of the proposed optimization scheme.
为了优化双桥串联谐振变换器(DBSRC)的调制效率,本文提出了一种基于谐波分析方法的深度强化学习(DRL)辅助 EPS(DEPS)调制方案,用于最小电流运行。该方案将深度确定性策略梯度(DDPG)算法作为一种先进的 DRL 算法,通过离线训练代理获得优化的调制方案 DEPS,该方案可采用扩展相移(EPS)调制方案,并考虑零电压开关(ZVS)约束。因此,经过训练的 DDPG 代理(喜欢隐式函数)可以实时为 DBSRC 提供最佳相移角,并在连续工作范围内实现最小电流和软开关性能。与现有的使用一次谐波逼近法(FHA)的 EPS 调制方案相比,DEPS 调制方案具有相似的转换器运行效率和性能,同时还能根据环境参数实时获得调制角。最后,PSIM 仿真验证了所提优化方案的有效性。
{"title":"Deep reinforcement learning-aided minimum current control for the DBSRC based on harmonic analysis method","authors":"Ziqiao Yu, Zhengcheng Li","doi":"10.54254/2755-2721/67/2024ma0069","DOIUrl":"https://doi.org/10.54254/2755-2721/67/2024ma0069","url":null,"abstract":"Aiming to optimize the modulation efficiency for the dual-bridge series-resonant converter (DBSRC), this paper proposes an deep reinforcement learning (DRL) aided EPS (DEPS) modulation scheme for minimum current operation based on the harmonic analysis method. Using the deep deterministic policy gradient (DDPG) algorithm as an advanced DRL algorithm, the scheme obtains the optimized modulation scheme DEPS through offline training of the agent, which can adopt the extended-phase-shift (EPS) modulation scheme and consider the zero-voltage-switching (ZVS) constraints. Thus, the trained agent of the DDPG which likes an implicit function, can provide optimal phase shift angle for the DBSRC in real-time with the minimum current and soft switching performance in the continuous operation range. Compared with the existing EPS modulation schemes using First Harmonic Approximation (FHA), the DEPS modulation scheme has similar operational efficiency and performance of the converter, while also possessing the ability to obtain modulation angles in real-time based on environmental parameters. Finally, PSIM simulation verifies the effectiveness of the proposed optimization scheme.","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"6 9","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141804576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Four possible error conjectures with SPM in MATLAB 在 MATLAB 中使用 SPM 的四种可能误差猜想
Pub Date : 2024-07-25 DOI: 10.54254/2755-2721/55/20241491
Ke Ning
Flanker tasks are mainly supposed to have activation signals on the prefrontal regions and anterior cingulate cortex (ACC) because this assignment requires immediate actions from participants after seeing unique signs. The SPM in MATLAB is used to process the data of activation signals of brains from flanker task data collection. This process has four steps: preprocessing of all the subjects, first-level analysis, second-level analysis, and Region of Interest (ROI) analysis. After all the steps, the final graph should show black spots representing the activation signal on the prefrontal and anterior cingulate cortex regions. However, the spotted area differs from the areas in the proper graph, indicating the errors during processing each subject in flanker tasks in this assignment. There are four conjectures of error steps after reviewing the wrong result
侧翼任务主要是指前额叶区域和前扣带回皮层(ACC)的激活信号,因为这项任务要求参与者在看到独特标志后立即采取行动。MATLAB 中的 SPM 用于处理侧翼任务数据收集中的大脑激活信号数据。这一过程分为四个步骤:对所有受试者进行预处理、一级分析、二级分析和感兴趣区(ROI)分析。经过所有步骤后,最终的图形应显示代表前额叶和前扣带回皮层区域激活信号的黑点。然而,黑点区域与正确图形中的区域不同,这表明每个受试者在处理本任务中的侧翼任务时出现了错误。回顾错误结果后,有四种错误步骤猜想
{"title":"Four possible error conjectures with SPM in MATLAB","authors":"Ke Ning","doi":"10.54254/2755-2721/55/20241491","DOIUrl":"https://doi.org/10.54254/2755-2721/55/20241491","url":null,"abstract":"Flanker tasks are mainly supposed to have activation signals on the prefrontal regions and anterior cingulate cortex (ACC) because this assignment requires immediate actions from participants after seeing unique signs. The SPM in MATLAB is used to process the data of activation signals of brains from flanker task data collection. This process has four steps: preprocessing of all the subjects, first-level analysis, second-level analysis, and Region of Interest (ROI) analysis. After all the steps, the final graph should show black spots representing the activation signal on the prefrontal and anterior cingulate cortex regions. However, the spotted area differs from the areas in the proper graph, indicating the errors during processing each subject in flanker tasks in this assignment. There are four conjectures of error steps after reviewing the wrong result","PeriodicalId":502253,"journal":{"name":"Applied and Computational Engineering","volume":"35 7","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141805350","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Applied and Computational Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1