利用大型语言模型检测越南行政文件中的拼写错误

Huan The Phung, Nghia Van Luong
{"title":"利用大型语言模型检测越南行政文件中的拼写错误","authors":"Huan The Phung, Nghia Van Luong","doi":"10.46223/hcmcoujs.tech.en.14.1.3141.2024","DOIUrl":null,"url":null,"abstract":"In the context of the emergence of more and more administrative documents, the need to ensure accuracy and improve the quality of these documents becomes increasingly important. This research focuses on applying advanced language models to detect spelling errors in administrative documents. Specifically, in this study, a new method using a language model based on the Transformers architecture is proposed to automatically detect and correct common spelling errors in administrative documents. This method combines the model’s ability to understand context and grammar to identify words or phrases that are likely to be misspelled. The proposed method is tested on a dataset containing real administrative documents, and the experimental results show that the proposed model is capable of detecting spelling errors with significant performance, helping to improve accuracy. and improve the quality of administrative documents. This research not only contributes to improving the quality of administrative documents but also opens up new research directions in applying language models to issues related to natural language processing in the field of administration.","PeriodicalId":512408,"journal":{"name":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","volume":"10 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Detecting spelling errors in Vietnamese administrative document using large language models\",\"authors\":\"Huan The Phung, Nghia Van Luong\",\"doi\":\"10.46223/hcmcoujs.tech.en.14.1.3141.2024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the context of the emergence of more and more administrative documents, the need to ensure accuracy and improve the quality of these documents becomes increasingly important. This research focuses on applying advanced language models to detect spelling errors in administrative documents. Specifically, in this study, a new method using a language model based on the Transformers architecture is proposed to automatically detect and correct common spelling errors in administrative documents. This method combines the model’s ability to understand context and grammar to identify words or phrases that are likely to be misspelled. The proposed method is tested on a dataset containing real administrative documents, and the experimental results show that the proposed model is capable of detecting spelling errors with significant performance, helping to improve accuracy. and improve the quality of administrative documents. This research not only contributes to improving the quality of administrative documents but also opens up new research directions in applying language models to issues related to natural language processing in the field of administration.\",\"PeriodicalId\":512408,\"journal\":{\"name\":\"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY\",\"volume\":\"10 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.3141.2024\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"HO CHI MINH CITY OPEN UNIVERSITY JOURNAL OF SCIENCE - ENGINEERING AND TECHNOLOGY","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.46223/hcmcoujs.tech.en.14.1.3141.2024","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在越来越多的行政文件出现的背景下,确保这些文件的准确性并提高其质量变得越来越重要。本研究的重点是应用先进的语言模型来检测行政文件中的拼写错误。具体来说,本研究提出了一种使用基于 Transformers 架构的语言模型的新方法,用于自动检测和纠正行政文件中的常见拼写错误。该方法结合了模型理解上下文和语法的能力,以识别可能拼写错误的单词或短语。实验结果表明,所提出的模型能够以显著的性能检测拼写错误,有助于提高行政文件的准确性和质量。这项研究不仅有助于提高行政文件的质量,还为将语言模型应用于行政领域的自然语言处理相关问题开辟了新的研究方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Detecting spelling errors in Vietnamese administrative document using large language models
In the context of the emergence of more and more administrative documents, the need to ensure accuracy and improve the quality of these documents becomes increasingly important. This research focuses on applying advanced language models to detect spelling errors in administrative documents. Specifically, in this study, a new method using a language model based on the Transformers architecture is proposed to automatically detect and correct common spelling errors in administrative documents. This method combines the model’s ability to understand context and grammar to identify words or phrases that are likely to be misspelled. The proposed method is tested on a dataset containing real administrative documents, and the experimental results show that the proposed model is capable of detecting spelling errors with significant performance, helping to improve accuracy. and improve the quality of administrative documents. This research not only contributes to improving the quality of administrative documents but also opens up new research directions in applying language models to issues related to natural language processing in the field of administration.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Hybrid knowledge-infused collaborative filtering for enhanced movie clustering and recommendation Performance comparison ensemble classifier’s performance in answering frequently asked questions about psychology Integrating BIM and computer vision for preventing Hazards at construction sites A study on constructing an efficient examination scheduling system Detecting spelling errors in Vietnamese administrative document using large language models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1