首页 > 最新文献

HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY最新文献

英文 中文
Hybrid knowledge-infused collaborative filtering for enhanced movie clustering and recommendation 混合知识融合协同过滤技术用于增强电影聚类和推荐功能
Pub Date : 2024-03-06 DOI: 10.46223/hcmcoujs.tech.en.14.1.2927.2024
Hong Thi Thu Phan, Vuong Luong Nguyen, Trinh Quoc Vo, Nguyen Ho Trong Pham
This article proposes an enhanced knowledge-based collaborative filtering model for movie recommendation services to address the limitations of collaborative filtering in capturing the diverse preferences and specific characteristics of movies. The proposed model integrates external knowledge sources, such as movie plots and reviews, to enrich the recommendation process. By leveraging this additional information, the model can better understand movies’ unique features and attributes, improving recommendation accuracy and relevance. The knowledge-based features are extracted and incorporated into the collaborative filtering framework, enhancing the model’s ability to match user preferences with movie characteristics. Experiments are conducted using the MovieLens dataset to evaluate the proposed model. The MAE and RMSE metrics are employed to assess the quality of recommendations. Comparative analyses are conducted against various baseline approaches, including popularity-based, CF-based, content-based, and hybrid recommendation models. The experimental results demonstrate the effectiveness of the proposed knowledge-based collaborative filtering model. The proposed model consistently outperforms the baselines, providing more accurate and personalized recommendations.
本文为电影推荐服务提出了一种基于知识的增强型协同过滤模型,以解决协同过滤在捕捉电影的不同偏好和具体特征方面的局限性。该模型整合了外部知识源,如电影情节和评论,以丰富推荐过程。通过利用这些附加信息,该模型可以更好地理解电影的独特特征和属性,从而提高推荐的准确性和相关性。基于知识的特征被提取出来并纳入协同过滤框架,从而增强了模型将用户偏好与电影特征相匹配的能力。我们使用 MovieLens 数据集进行了实验,以评估所提出的模型。采用 MAE 和 RMSE 指标来评估推荐的质量。与各种基线方法进行了比较分析,包括基于流行度的推荐模型、基于 CF 的推荐模型、基于内容的推荐模型和混合推荐模型。实验结果证明了所提出的基于知识的协同过滤模型的有效性。所提出的模型始终优于基线方法,能提供更准确、更个性化的推荐。
{"title":"Hybrid knowledge-infused collaborative filtering for enhanced movie clustering and recommendation","authors":"Hong Thi Thu Phan, Vuong Luong Nguyen, Trinh Quoc Vo, Nguyen Ho Trong Pham","doi":"10.46223/hcmcoujs.tech.en.14.1.2927.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.2927.2024","url":null,"abstract":"This article proposes an enhanced knowledge-based collaborative filtering model for movie recommendation services to address the limitations of collaborative filtering in capturing the diverse preferences and specific characteristics of movies. The proposed model integrates external knowledge sources, such as movie plots and reviews, to enrich the recommendation process. By leveraging this additional information, the model can better understand movies’ unique features and attributes, improving recommendation accuracy and relevance. The knowledge-based features are extracted and incorporated into the collaborative filtering framework, enhancing the model’s ability to match user preferences with movie characteristics. Experiments are conducted using the MovieLens dataset to evaluate the proposed model. The MAE and RMSE metrics are employed to assess the quality of recommendations. Comparative analyses are conducted against various baseline approaches, including popularity-based, CF-based, content-based, and hybrid recommendation models. The experimental results demonstrate the effectiveness of the proposed knowledge-based collaborative filtering model. The proposed model consistently outperforms the baselines, providing more accurate and personalized recommendations.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"6 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140261909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating BIM and computer vision for preventing Hazards at construction sites 整合建筑信息模型和计算机视觉,预防建筑工地危险
Pub Date : 2024-03-05 DOI: 10.46223/hcmcoujs.tech.en.14.1.2966.2024
Si Tran
Construction safety monitoring is vital in enhancing site safety, such as tracking entering hazardous areas and the correlation between workers and other hazard entities. Therein, computer vision-based image/video processing, one of the emerging technologies, has been actively used to automatically identify and recognize unsafe conditions. However, the construction site has various potential hazard situations during the project. Due to the site’s complexity, many visual devices simultaneously participate in monitoring. It challenges developing and operating corresponding detection algorithms at specific workplaces and times. Besides, safety information detected by computer vision must be organized before being delivered to stakeholders. Hence, this study proposes an approach for construction safety monitoring using vision intelligence technology and BIM-cloud, called BMT. The BMT comprises two modules: (1) the virtual model based on the 4D BIM-cloud model, which provides the spatial-temporal information to decide computer vision algorithm adoptions; (2) the construction physical model built the vision intelligence technologies, which is supported by (1) and deliver safety status and update into the BIM-cloud model to visualize and deliver the risk level to related employees. The efficiency of the BMT approach is validated by testing with the preliminary implementation of a prototype.
施工安全监控对加强工地安全至关重要,例如跟踪进入危险区域的情况以及工人与其他危险实体之间的关联。在这方面,基于计算机视觉的图像/视频处理作为新兴技术之一,已被积极用于自动识别和识别不安全状况。然而,建筑工地在施工过程中存在各种潜在的危险情况。由于工地的复杂性,许多视觉设备同时参与监控。这对开发和运行特定工作场所和时间的相应检测算法提出了挑战。此外,计算机视觉检测到的安全信息必须经过整理后才能传递给利益相关者。因此,本研究提出了一种利用视觉智能技术和 BIM 云进行施工安全监控的方法,称为 BMT。BMT 包括两个模块:(1)基于四维 BIM 云模型的虚拟模型,为决定计算机视觉算法的采用提供时空信息;(2)建立视觉智能技术的施工物理模型,在(1)的支持下,将安全状态和更新信息传递到 BIM 云模型中,以可视化的方式将风险等级传递给相关员工。通过对原型的初步实施测试,验证了 BMT 方法的效率。
{"title":"Integrating BIM and computer vision for preventing Hazards at construction sites","authors":"Si Tran","doi":"10.46223/hcmcoujs.tech.en.14.1.2966.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.2966.2024","url":null,"abstract":"Construction safety monitoring is vital in enhancing site safety, such as tracking entering hazardous areas and the correlation between workers and other hazard entities. Therein, computer vision-based image/video processing, one of the emerging technologies, has been actively used to automatically identify and recognize unsafe conditions. However, the construction site has various potential hazard situations during the project. Due to the site’s complexity, many visual devices simultaneously participate in monitoring. It challenges developing and operating corresponding detection algorithms at specific workplaces and times. Besides, safety information detected by computer vision must be organized before being delivered to stakeholders. Hence, this study proposes an approach for construction safety monitoring using vision intelligence technology and BIM-cloud, called BMT. The BMT comprises two modules: (1) the virtual model based on the 4D BIM-cloud model, which provides the spatial-temporal information to decide computer vision algorithm adoptions; (2) the construction physical model built the vision intelligence technologies, which is supported by (1) and deliver safety status and update into the BIM-cloud model to visualize and deliver the risk level to related employees. The efficiency of the BMT approach is validated by testing with the preliminary implementation of a prototype.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"13 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140263579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A sum rate maximization problem in uplink MIMO with RSMA systems 带 RSMA 的上行多输入多输出系统中的总速率最大化问题
Pub Date : 2024-03-05 DOI: 10.46223/hcmcoujs.tech.en.14.1.2955.2024
Phung Truong
This study explores the problem of maximizing the sum rate in uplink multi-user Multiple-Input Multiple-Output (MIMO) using Rate-Splitting Multiple Access (RSMA) systems. The investigation revolves around the scenario where the Users (UEs) are single-antenna nodes transmitting data to a multi-antenna Base Station (BS) through the RSMA technique. The optimization process encompasses determining parameters such as UEs’ transmit powers, decoding order, and detection vector at the BS. An approach based on Deep Reinforcement Learning (DRL) is introduced to address this challenge. This DRL framework involves an action-refined stage and applies a Deep Deterministic Policy Gradient (DDPG)-based strategy. Simulation outcomes effectively demonstrate the convergence of the proposed DRL framework, where it converges after approximately 1,800 episodes. Also, the results prove the superior performance of the proposed method when compared to established benchmark strategies, where it is up to 45% and 86% higher than the local search and random schemes, respectively.
本研究探讨了在上行链路多用户多输入多输出(MIMO)系统中使用速率分割多路访问(RSMA)最大化总和速率的问题。研究围绕用户(UE)是通过 RSMA 技术向多天线基站(BS)传输数据的单天线节点这一场景展开。优化过程包括确定 UE 的发射功率、解码顺序和 BS 的检测向量等参数。为应对这一挑战,引入了一种基于深度强化学习(DRL)的方法。该 DRL 框架包括一个行动提炼阶段,并应用了基于深度确定性策略梯度(DDPG)的策略。仿真结果有效地证明了所提出的 DRL 框架的收敛性,即在大约 1,800 个事件后收敛。此外,仿真结果还证明,与已有的基准策略相比,所提出的方法具有卓越的性能,比局部搜索和随机方案分别高出 45% 和 86%。
{"title":"A sum rate maximization problem in uplink MIMO with RSMA systems","authors":"Phung Truong","doi":"10.46223/hcmcoujs.tech.en.14.1.2955.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.2955.2024","url":null,"abstract":"This study explores the problem of maximizing the sum rate in uplink multi-user Multiple-Input Multiple-Output (MIMO) using Rate-Splitting Multiple Access (RSMA) systems. The investigation revolves around the scenario where the Users (UEs) are single-antenna nodes transmitting data to a multi-antenna Base Station (BS) through the RSMA technique. The optimization process encompasses determining parameters such as UEs’ transmit powers, decoding order, and detection vector at the BS. An approach based on Deep Reinforcement Learning (DRL) is introduced to address this challenge. This DRL framework involves an action-refined stage and applies a Deep Deterministic Policy Gradient (DDPG)-based strategy. Simulation outcomes effectively demonstrate the convergence of the proposed DRL framework, where it converges after approximately 1,800 episodes. Also, the results prove the superior performance of the proposed method when compared to established benchmark strategies, where it is up to 45% and 86% higher than the local search and random schemes, respectively.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"11 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140265203","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic matrix factorization-based collaborative filtering in movie recommendation services 电影推荐服务中基于动态矩阵因式分解的协同过滤技术
Pub Date : 2024-03-05 DOI: 10.46223/hcmcoujs.tech.en.14.1.2922.2024
Vuong Luong Nguyen, Trinh Quoc Vo, Hoai Thi Thuy Nguyen
Movies are a primary source of entertainment, but finding specific content can be challenging given the exponentially increasing number of movies produced each year. Recommendation systems are extremely useful for solving this problem. While various approaches exist, Collaborative Filtering (CF) is the most straightforward. CF leverages user input and historical preferences to determine user similarity and suggest movies. Matrix Factorization (MF) is one of the most popular Collaborative Filtering (CF) techniques. It maps users and items into a joint latent space, using a vector of latent features to represent each user or item. However, traditional MF techniques are static, while user cognition and product variety are constantly evolving. As a result, traditional MF approaches struggle to accommodate the dynamic nature of user-item interactions. To address this challenge, we propose a Dynamic Matrix Factorization CF model for movie recommendation systems (DMF-CF) that considers the dynamic changes in user interactions. To validate our approach, we conducted evaluations using the standard MovieLens dataset and compared it to state-of-the-art models. Our preliminary findings highlight the substantial benefits of DMF-CF, which outperforms recent models on the MovieLens-100K and MovieLens-1M datasets in terms of Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) metrics.
电影是人们娱乐的主要来源,但由于每年生产的电影数量呈指数级增长,要找到特定的电影内容非常具有挑战性。推荐系统对于解决这一问题非常有用。虽然有多种方法,但协同过滤(CF)是最直接的方法。协同过滤利用用户输入和历史偏好来确定用户相似度并推荐电影。矩阵因式分解(MF)是最流行的协同过滤(CF)技术之一。它将用户和项目映射到一个联合潜在空间,使用潜在特征向量来表示每个用户或项目。然而,传统的 MF 技术是静态的,而用户认知和产品种类是不断变化的。因此,传统的 MF 方法很难适应用户-物品交互的动态性质。为了应对这一挑战,我们提出了一种用于电影推荐系统的动态矩阵因式分解 CF 模型(DMF-CF),该模型考虑了用户交互的动态变化。为了验证我们的方法,我们使用标准的 MovieLens 数据集进行了评估,并将其与最先进的模型进行了比较。在 MovieLens-100K 和 MovieLens-1M 数据集上,DMF-CF 的平均绝对误差(MAE)和均方根误差(RMSE)指标优于最新模型。
{"title":"Dynamic matrix factorization-based collaborative filtering in movie recommendation services","authors":"Vuong Luong Nguyen, Trinh Quoc Vo, Hoai Thi Thuy Nguyen","doi":"10.46223/hcmcoujs.tech.en.14.1.2922.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.2922.2024","url":null,"abstract":"Movies are a primary source of entertainment, but finding specific content can be challenging given the exponentially increasing number of movies produced each year. Recommendation systems are extremely useful for solving this problem. While various approaches exist, Collaborative Filtering (CF) is the most straightforward. CF leverages user input and historical preferences to determine user similarity and suggest movies. Matrix Factorization (MF) is one of the most popular Collaborative Filtering (CF) techniques. It maps users and items into a joint latent space, using a vector of latent features to represent each user or item. However, traditional MF techniques are static, while user cognition and product variety are constantly evolving. As a result, traditional MF approaches struggle to accommodate the dynamic nature of user-item interactions. To address this challenge, we propose a Dynamic Matrix Factorization CF model for movie recommendation systems (DMF-CF) that considers the dynamic changes in user interactions. To validate our approach, we conducted evaluations using the standard MovieLens dataset and compared it to state-of-the-art models. Our preliminary findings highlight the substantial benefits of DMF-CF, which outperforms recent models on the MovieLens-100K and MovieLens-1M datasets in terms of Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) metrics.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"22 8","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140264957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance comparison ensemble classifier’s performance in answering frequently asked questions about psychology 集合分类器在回答心理学常见问题时的性能比较
Pub Date : 2024-03-05 DOI: 10.46223/hcmcoujs.tech.en.14.1.2921.2024
Vy Thuy Tong, Hieu Chi Tran, Kiet Trung Tran
In today’s era of digital healthcare transformation, there is a growing demand for swift responses to mental health queries. To meet this need, we introduce an AI-driven chatbot system designed to automatically address frequently asked questions in psychology. Leveraging a range of classifiers including Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naïve Bayes, our system extracts insights from expert data sources and employs natural language processing techniques like LDA Topic Modeling and Cosine similarity to generate contextually relevant responses. Through rigorous experimentation, we find that SVM surpasses Naïve Bayes and KNN in accuracy, precision, recall, and F1-score, making it our top choice for constructing the final response system. This research underscores the effectiveness of ensemble classifiers, particularly SVM, in providing accurate and valuable information to enhance mental health support in response to common psychological inquiries.
在当今的数字医疗转型时代,人们对快速回复心理健康问题的需求日益增长。为了满足这一需求,我们推出了一款人工智能驱动的聊天机器人系统,旨在自动解决心理学方面的常见问题。我们的系统利用支持向量机(SVM)、K-近邻(KNN)和奈夫贝叶斯(Naïve Bayes)等一系列分类器,从专家数据源中提取见解,并采用 LDA 主题建模和余弦相似性等自然语言处理技术,生成与上下文相关的回复。通过严格的实验,我们发现 SVM 在准确度、精确度、召回率和 F1 分数方面都超过了 Naïve Bayes 和 KNN,因此成为我们构建最终回复系统的首选。这项研究强调了集合分类器的有效性,尤其是 SVM,它能提供准确而有价值的信息,在回答常见的心理咨询时加强心理健康支持。
{"title":"Performance comparison ensemble classifier’s performance in answering frequently asked questions about psychology","authors":"Vy Thuy Tong, Hieu Chi Tran, Kiet Trung Tran","doi":"10.46223/hcmcoujs.tech.en.14.1.2921.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.2921.2024","url":null,"abstract":"In today’s era of digital healthcare transformation, there is a growing demand for swift responses to mental health queries. To meet this need, we introduce an AI-driven chatbot system designed to automatically address frequently asked questions in psychology. Leveraging a range of classifiers including Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naïve Bayes, our system extracts insights from expert data sources and employs natural language processing techniques like LDA Topic Modeling and Cosine similarity to generate contextually relevant responses. Through rigorous experimentation, we find that SVM surpasses Naïve Bayes and KNN in accuracy, precision, recall, and F1-score, making it our top choice for constructing the final response system. This research underscores the effectiveness of ensemble classifiers, particularly SVM, in providing accurate and valuable information to enhance mental health support in response to common psychological inquiries.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"11 11","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140263543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A study on constructing an efficient examination scheduling system 构建高效考试时间安排系统的研究
Pub Date : 2024-03-05 DOI: 10.46223/hcmcoujs.tech.en.14.1.2920.2024
Linh Nguyen Mai Vu, Hieu Chi Tran, Anh Thi Tram Nguyen
The objective of this study is to investigate the final exam scheduling process at the Ho Chi Minh City Open University and develop an automated exam scheduling application. Our primary objectives are to prevent students from having conflicting exam schedules and to ensure that no student has to take more than two exams on the same day. This research focuses on applying graph coloring algorithms to the problem of automatic exam scheduling. Our research findings indicate that the graph coloring algorithm is highly effective for automated exam scheduling. This study has the potential to expand and support the development of an automatic exam scheduling and management system, in line with our overall goals. We conduct the experiments on the practical data at HCMCOU and obtain promising results.
本研究的目的是调查胡志明市开放大学的期末考试安排流程,并开发一个自动考试安排应用程序。我们的主要目标是防止学生的考试时间发生冲突,并确保没有学生需要在同一天参加两场以上的考试。本研究的重点是将图形着色算法应用于自动考试时间安排问题。我们的研究结果表明,图着色算法对自动考试时间安排非常有效。根据我们的总体目标,这项研究有可能扩展和支持自动考试调度和管理系统的开发。我们在华中师范大学的实际数据上进行了实验,并取得了可喜的成果。
{"title":"A study on constructing an efficient examination scheduling system","authors":"Linh Nguyen Mai Vu, Hieu Chi Tran, Anh Thi Tram Nguyen","doi":"10.46223/hcmcoujs.tech.en.14.1.2920.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.2920.2024","url":null,"abstract":"The objective of this study is to investigate the final exam scheduling process at the Ho Chi Minh City Open University and develop an automated exam scheduling application. Our primary objectives are to prevent students from having conflicting exam schedules and to ensure that no student has to take more than two exams on the same day. This research focuses on applying graph coloring algorithms to the problem of automatic exam scheduling. Our research findings indicate that the graph coloring algorithm is highly effective for automated exam scheduling. This study has the potential to expand and support the development of an automatic exam scheduling and management system, in line with our overall goals. We conduct the experiments on the practical data at HCMCOU and obtain promising results.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"16 3","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140264229","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting spelling errors in Vietnamese administrative document using large language models 利用大型语言模型检测越南行政文件中的拼写错误
Pub Date : 2024-03-05 DOI: 10.46223/hcmcoujs.tech.en.14.1.3141.2024
Huan The Phung, Nghia Van Luong
In the context of the emergence of more and more administrative documents, the need to ensure accuracy and improve the quality of these documents becomes increasingly important. This research focuses on applying advanced language models to detect spelling errors in administrative documents. Specifically, in this study, a new method using a language model based on the Transformers architecture is proposed to automatically detect and correct common spelling errors in administrative documents. This method combines the model’s ability to understand context and grammar to identify words or phrases that are likely to be misspelled. The proposed method is tested on a dataset containing real administrative documents, and the experimental results show that the proposed model is capable of detecting spelling errors with significant performance, helping to improve accuracy. and improve the quality of administrative documents. This research not only contributes to improving the quality of administrative documents but also opens up new research directions in applying language models to issues related to natural language processing in the field of administration.
在越来越多的行政文件出现的背景下,确保这些文件的准确性并提高其质量变得越来越重要。本研究的重点是应用先进的语言模型来检测行政文件中的拼写错误。具体来说,本研究提出了一种使用基于 Transformers 架构的语言模型的新方法,用于自动检测和纠正行政文件中的常见拼写错误。该方法结合了模型理解上下文和语法的能力,以识别可能拼写错误的单词或短语。实验结果表明,所提出的模型能够以显著的性能检测拼写错误,有助于提高行政文件的准确性和质量。这项研究不仅有助于提高行政文件的质量,还为将语言模型应用于行政领域的自然语言处理相关问题开辟了新的研究方向。
{"title":"Detecting spelling errors in Vietnamese administrative document using large language models","authors":"Huan The Phung, Nghia Van Luong","doi":"10.46223/hcmcoujs.tech.en.14.1.3141.2024","DOIUrl":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.3141.2024","url":null,"abstract":"In the context of the emergence of more and more administrative documents, the need to ensure accuracy and improve the quality of these documents becomes increasingly important. This research focuses on applying advanced language models to detect spelling errors in administrative documents. Specifically, in this study, a new method using a language model based on the Transformers architecture is proposed to automatically detect and correct common spelling errors in administrative documents. This method combines the model’s ability to understand context and grammar to identify words or phrases that are likely to be misspelled. The proposed method is tested on a dataset containing real administrative documents, and the experimental results show that the proposed model is capable of detecting spelling errors with significant performance, helping to improve accuracy. and improve the quality of administrative documents. This research not only contributes to improving the quality of administrative documents but also opens up new research directions in applying language models to issues related to natural language processing in the field of administration.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"10 5","pages":""},"PeriodicalIF":0.0,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140264238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1