首页 > 最新文献

International Conference on Algorithms, Microchips and Network Applications最新文献

英文 中文
Ethernet data distribution service data processing method 以太网数据分发业务数据处理方法
Yu Wu, Xiaoya Li, Huiyang Hu
With the rapid development of aviation electromechanical technology, new aircraft are equipped with a large number of sensors and electronic control devices. In order to meet the data transmission requirements of avionics systems, Ethernet Data Distribution (DDS) protocol has gradually become the standard solution for published/subscribed data in distributed real-time systems. Facing the new requirements of bus data processing using DDS protocol, starting from the characteristics and format of Ethernet DDS data recording, this paper gives the overall architecture and core processing method of the processing software. This paper analyzes the key problems of Ethernet DDS data processing and proposes solutions for DDS bus Interface Control Document (ICD) plane structure, multi-level parallel data processing algorithm, data detection and filtering. The method is validated and can correctly process Ethernet DDS data.
随着航空机电技术的飞速发展,新型飞机上安装了大量的传感器和电子控制装置。为了满足航空电子系统的数据传输需求,以太网数据分发(DDS)协议逐渐成为分布式实时系统中发布/订阅数据的标准解决方案。面对采用DDS协议进行总线数据处理的新要求,本文从以太网DDS数据记录的特点和格式出发,给出了处理软件的总体架构和核心处理方法。分析了以太网DDS数据处理中的关键问题,从DDS总线接口控制文件(ICD)平面结构、多级并行数据处理算法、数据检测与滤波等方面提出了解决方案。该方法经过验证,能够正确处理以太网DDS数据。
{"title":"Ethernet data distribution service data processing method","authors":"Yu Wu, Xiaoya Li, Huiyang Hu","doi":"10.1117/12.2679124","DOIUrl":"https://doi.org/10.1117/12.2679124","url":null,"abstract":"With the rapid development of aviation electromechanical technology, new aircraft are equipped with a large number of sensors and electronic control devices. In order to meet the data transmission requirements of avionics systems, Ethernet Data Distribution (DDS) protocol has gradually become the standard solution for published/subscribed data in distributed real-time systems. Facing the new requirements of bus data processing using DDS protocol, starting from the characteristics and format of Ethernet DDS data recording, this paper gives the overall architecture and core processing method of the processing software. This paper analyzes the key problems of Ethernet DDS data processing and proposes solutions for DDS bus Interface Control Document (ICD) plane structure, multi-level parallel data processing algorithm, data detection and filtering. The method is validated and can correctly process Ethernet DDS data.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123465310","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nuclear weapon prediction based on the verhulst method of comprehensive weighting and LS-SVM equidimensional information supplement 基于综合加权法和LS-SVM等维信息补充的核武器预测
Hongyi Duan, Jianan Zhang
In this paper, to predict the nuclear weapons, we first introduce evaluation indicators that affect the possession of nuclear weapons, economic indicators, scientific and technological indicators, and establish a TOPSIS evaluation model improved by the optimal assignment method to predict countries with evaluation values less than 20, as countries that will possess nuclear weapons in the next 100 years. Then, in view of the fact that the number of nuclear weapons is calculated in years and changes over time, and considering the global consensus to limit the number of nuclear weapons from 2022 when the Treaty on the Prohibition of Nuclear Weapons and other policies come into force, it is decided to build a Verhulst prediction model with saturation based on the LS-SVM algorithm, and finally to improve the accuracy and reasonableness of the model by using the metabolic data processing method of equal-dimensional neutrosophic recurrence prediction. By predicting the number of nuclear weapons, countries can make reasonable plans for future nuclear weapons production and hope to reach a global consensus, which will help to solve the nuclear crisis.
为了预测核武器,本文首先引入影响拥有核武器的评价指标、经济指标、科技指标,建立了经最优赋值法改进的TOPSIS评价模型,将评价值小于20的国家预测为未来100年内拥有核武器的国家。然后,考虑到核武器数量是按年计算的,并且随着时间的变化而变化,并考虑到从《禁止核武器条约》等政策生效的2022年开始限制核武器数量的全球共识,决定建立基于LS-SVM算法的饱和Verhulst预测模型。最后采用等维嗜中性粒细胞复发预测的代谢数据处理方法,提高模型的准确性和合理性。通过对核武器数量的预测,各国可以对未来的核武器生产做出合理的规划,并有望在全球范围内达成共识,这将有助于解决核危机。
{"title":"Nuclear weapon prediction based on the verhulst method of comprehensive weighting and LS-SVM equidimensional information supplement","authors":"Hongyi Duan, Jianan Zhang","doi":"10.1117/12.2678905","DOIUrl":"https://doi.org/10.1117/12.2678905","url":null,"abstract":"In this paper, to predict the nuclear weapons, we first introduce evaluation indicators that affect the possession of nuclear weapons, economic indicators, scientific and technological indicators, and establish a TOPSIS evaluation model improved by the optimal assignment method to predict countries with evaluation values less than 20, as countries that will possess nuclear weapons in the next 100 years. Then, in view of the fact that the number of nuclear weapons is calculated in years and changes over time, and considering the global consensus to limit the number of nuclear weapons from 2022 when the Treaty on the Prohibition of Nuclear Weapons and other policies come into force, it is decided to build a Verhulst prediction model with saturation based on the LS-SVM algorithm, and finally to improve the accuracy and reasonableness of the model by using the metabolic data processing method of equal-dimensional neutrosophic recurrence prediction. By predicting the number of nuclear weapons, countries can make reasonable plans for future nuclear weapons production and hope to reach a global consensus, which will help to solve the nuclear crisis.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123183940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Desensitization method of image data in the Internet of Vehicles based on instance segmentation 基于实例分割的车联网图像数据脱敏方法
Shuang Li, Yue Zhou, Xin Zhang, Meng Zhang
With the continuous development of intelligent connected vehicle industry, cameras and other vehicle-mounted devices are widely used, so the amount of data collection is increasing. There is a large amount of sensitive information hidden in the image data generated by connected vehicles. Once the data leakage event occurs, it may cause very serious consequences. In order to improve the security of connected vehicle data and reduce the threat of sensitive information leakage in image data, this paper provides a desensitization process of connected vehicle image data, and desensitizes sensitive information based on instance segmentation technology. In this paper, a real road image dataset is collected, and realizes desensitization of the modified dataset based on proposed framework.
随着智能网联汽车产业的不断发展,摄像头等车载设备被广泛使用,数据采集量也在不断增加。车联网产生的图像数据中隐藏着大量的敏感信息。数据泄露事件一旦发生,可能会造成非常严重的后果。为了提高车联网数据的安全性,降低图像数据中敏感信息泄露的威胁,本文提出了一种车联网图像数据的脱敏处理方法,并基于实例分割技术对敏感信息进行脱敏处理。本文以真实道路图像数据集为基础,基于所提出的框架实现了修改后数据集的脱敏。
{"title":"Desensitization method of image data in the Internet of Vehicles based on instance segmentation","authors":"Shuang Li, Yue Zhou, Xin Zhang, Meng Zhang","doi":"10.1117/12.2679260","DOIUrl":"https://doi.org/10.1117/12.2679260","url":null,"abstract":"With the continuous development of intelligent connected vehicle industry, cameras and other vehicle-mounted devices are widely used, so the amount of data collection is increasing. There is a large amount of sensitive information hidden in the image data generated by connected vehicles. Once the data leakage event occurs, it may cause very serious consequences. In order to improve the security of connected vehicle data and reduce the threat of sensitive information leakage in image data, this paper provides a desensitization process of connected vehicle image data, and desensitizes sensitive information based on instance segmentation technology. In this paper, a real road image dataset is collected, and realizes desensitization of the modified dataset based on proposed framework.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114130015","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Human fall detection scheme based on YOLO visual recognition and embedded ARM architecture 基于YOLO视觉识别和嵌入式ARM架构的人体跌倒检测方案
Zhuoya Jia, Hanbo Zhang, Yang Jia, Yunjing Zheng, Dong Li, Shaobo Jia
This paper proposes a fall detection technology based on the YOLOv5s algorithm to solve the problem of hit injury. The method is designed based on the embedded ARM development board of Orange Pi Zero 2. The camera is used to collect human data in real-time, and algorithms train the collected data and are finally verified. The experimental results show that: (1) this method has a reasonable success rate of recognition for standing, walking, and falling, but the success rate of recognition for squatting needs to be improved; (2) Compared with the OpenPose algorithm, the YOLOv5 algorithm has better accuracy, precision, and average accuracy means, but the performance in recall rate is not very good.
本文提出了一种基于YOLOv5s算法的跌倒检测技术,以解决碰撞损伤问题。该方法是基于嵌入式ARM开发板Orange Pi Zero 2设计的。该摄像机用于实时采集人体数据,算法对采集到的数据进行训练并最终验证。实验结果表明:(1)该方法对站立、行走和跌倒的识别成功率都比较合理,但对蹲下的识别成功率还有待提高;(2)与OpenPose算法相比,YOLOv5算法具有更好的正确率、精密度和平均正确率均值,但在召回率方面表现不佳。
{"title":"Human fall detection scheme based on YOLO visual recognition and embedded ARM architecture","authors":"Zhuoya Jia, Hanbo Zhang, Yang Jia, Yunjing Zheng, Dong Li, Shaobo Jia","doi":"10.1117/12.2678904","DOIUrl":"https://doi.org/10.1117/12.2678904","url":null,"abstract":"This paper proposes a fall detection technology based on the YOLOv5s algorithm to solve the problem of hit injury. The method is designed based on the embedded ARM development board of Orange Pi Zero 2. The camera is used to collect human data in real-time, and algorithms train the collected data and are finally verified. The experimental results show that: (1) this method has a reasonable success rate of recognition for standing, walking, and falling, but the success rate of recognition for squatting needs to be improved; (2) Compared with the OpenPose algorithm, the YOLOv5 algorithm has better accuracy, precision, and average accuracy means, but the performance in recall rate is not very good.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133193555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on matrix entropy value between industrial ethernet data and internet data based on information entropy 基于信息熵的工业以太网数据与互联网数据间矩阵熵值研究
Hao Jiang, Jiongxuan Jia, Jianlei Gao, Fuyan Wang, Fengjuan Xu
The adoption of network technology for control and measurement makes Industrial Control Systems (ICSs) and Internet of Things (IoTs) more and more powerful, whose communication data plays an important role to realize data intercommunication and sharing. However, communication data have different characteristics, even if the same communication protocol is used in ICS or IT, the communication characteristics and modes are quite different. In order to understand the difference of communication data between internet and industrial ethernet, in this paper, we conduct a series of experiments that use information entropy algorithm to data changes based on different industrial protocol and http protocol, which always is used to measure the uncertainty and describe the uncertainty of data packets. The experimental result analysis shows that there exists a big difference in industrial ethernet data and internet data, the industrial ethernet data are more relatively balanced and more regular than those of internet data.
网络控制与测量技术的采用使得工业控制系统(ics)和物联网(iot)的功能越来越强大,其通信数据对实现数据的互通和共享起着重要作用。然而,通信数据具有不同的特性,即使在ICS或IT中使用相同的通信协议,其通信特性和模式也大不相同。为了了解互联网和工业以太网通信数据的差异,本文对基于不同工业协议和http协议的数据变化使用信息熵算法进行了一系列实验,信息熵算法通常用于测量数据包的不确定性和描述数据包的不确定性。实验结果分析表明,工业以太网数据与互联网数据存在较大差异,工业以太网数据比互联网数据更相对均衡、更有规律。
{"title":"Research on matrix entropy value between industrial ethernet data and internet data based on information entropy","authors":"Hao Jiang, Jiongxuan Jia, Jianlei Gao, Fuyan Wang, Fengjuan Xu","doi":"10.1117/12.2679243","DOIUrl":"https://doi.org/10.1117/12.2679243","url":null,"abstract":"The adoption of network technology for control and measurement makes Industrial Control Systems (ICSs) and Internet of Things (IoTs) more and more powerful, whose communication data plays an important role to realize data intercommunication and sharing. However, communication data have different characteristics, even if the same communication protocol is used in ICS or IT, the communication characteristics and modes are quite different. In order to understand the difference of communication data between internet and industrial ethernet, in this paper, we conduct a series of experiments that use information entropy algorithm to data changes based on different industrial protocol and http protocol, which always is used to measure the uncertainty and describe the uncertainty of data packets. The experimental result analysis shows that there exists a big difference in industrial ethernet data and internet data, the industrial ethernet data are more relatively balanced and more regular than those of internet data.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133525516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An analysis algorithm for real-time monitoring of campus crowd density based on campus wireless network logs 基于校园无线网络日志的校园人群密度实时监控分析算法
Ying Xia, Shuping Wu, Hui-qun Yu
In order to implement precision management on the campus, the decisions need data support, and the crowd density on campus is one of the important parts. Based on campus wireless network logs, which is widely used on the campus, this paper proposes an analysis algorithm to obtain online wireless network user numbers in real time and draws the conclusion that the numbers of online users can represent crowd density on campus. Experimental results show that this algorithm can effectively get the numbers of online users in each area of the campus, and the campus heat map made with these data can reflect the real-time distribution of campus crowd and crowd density. This method uses log analysis method which is a general solution for some problems and has practical value for in-depth analysis.
为了在校园实施精准管理,决策需要数据支持,而校园人群密度是其中的重要组成部分。基于校园中广泛使用的校园无线网络日志,本文提出了一种实时获取校园无线网络在线用户数的分析算法,并得出在线用户数可以代表校园人群密度的结论。实验结果表明,该算法可以有效地获取校园各区域的在线用户数,利用这些数据制作的校园热图可以反映校园人群的实时分布和人群密度。该方法采用对数分析法,是解决某些问题的通解,对深入分析具有实用价值。
{"title":"An analysis algorithm for real-time monitoring of campus crowd density based on campus wireless network logs","authors":"Ying Xia, Shuping Wu, Hui-qun Yu","doi":"10.1117/12.2679233","DOIUrl":"https://doi.org/10.1117/12.2679233","url":null,"abstract":"In order to implement precision management on the campus, the decisions need data support, and the crowd density on campus is one of the important parts. Based on campus wireless network logs, which is widely used on the campus, this paper proposes an analysis algorithm to obtain online wireless network user numbers in real time and draws the conclusion that the numbers of online users can represent crowd density on campus. Experimental results show that this algorithm can effectively get the numbers of online users in each area of the campus, and the campus heat map made with these data can reflect the real-time distribution of campus crowd and crowd density. This method uses log analysis method which is a general solution for some problems and has practical value for in-depth analysis.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128804315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A calculation model of the air threat range of ship route
F. Long, Ziang He, Qi Su
Aiming at the calculation of the air threat range of the ship route, the paper described the requirements and judges whether the route segment was under threat. On this basis, an algorithm model was established for the specific threat range of the threatened route segment, and a calculation example was given for verification. The analysis and verification results showed that the calculation model proposed in the paper could quickly and accurately calculate the specific threat range of the ship route, it can provide effective auxiliary decision-making reference for commanders or operators when drawing routes.
针对船舶航路空中威胁范围的计算,阐述了航路段是否受到威胁的要求和判断。在此基础上,针对受威胁路由段的具体威胁范围,建立了算法模型,并给出了计算实例进行验证。分析与验证结果表明,本文提出的计算模型能够快速、准确地计算出舰船航路的具体威胁范围,可为指挥员或作战员在航路绘制时提供有效的辅助决策参考。
{"title":"A calculation model of the air threat range of ship route","authors":"F. Long, Ziang He, Qi Su","doi":"10.1117/12.2678896","DOIUrl":"https://doi.org/10.1117/12.2678896","url":null,"abstract":"Aiming at the calculation of the air threat range of the ship route, the paper described the requirements and judges whether the route segment was under threat. On this basis, an algorithm model was established for the specific threat range of the threatened route segment, and a calculation example was given for verification. The analysis and verification results showed that the calculation model proposed in the paper could quickly and accurately calculate the specific threat range of the ship route, it can provide effective auxiliary decision-making reference for commanders or operators when drawing routes.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129906325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research on gated recurrent unit based stock price prediction model with multi-features under low time scale 低时间尺度下基于门控循环单元的多特征股价预测模型研究
Yinan Lyu, Yuanhao You
Under the development of people's living environment, more and more people are willing to use their money to invest in financial projects such as stocks and insurance. Nowadays, science and technology are widely applied in people's life. Machine learning is one of them. Machine learning is particularly important to apply to stock forecasting to better meet the requirements of people who want to gain more benefits. The purpose of this work is to compare using GRU, LSTM, and bidirectional LSTM's MAE and RMSE on the closing price. The method of this experiment is to compare with root mean squared error (RMSE) and mean absolute error (MSE) after the input variables of the past 63 trading days passing through those three models. The results of the experiment indicate that MAE of GRU model is lowest. Still, only nine of fifteen experiments show that RMSE of GRU model is lowest, and five of fifteen experiments show that RMSE of LSTM is lowest. One of fifteen experiments expresses that RMSE of bidirectional LSTM has the lowest RMSE. Thus, GRU is considered to be the best model for stock price regression.
在人们生活环境的发展下,越来越多的人愿意用自己的钱投资于股票、保险等金融项目。如今,科学技术被广泛应用于人们的生活中。机器学习就是其中之一。将机器学习应用到股票预测中,以更好地满足想要获得更多利益的人们的要求,就显得尤为重要。这项工作的目的是比较使用GRU, LSTM和双向LSTM的MAE和RMSE对收盘价的影响。本实验的方法是比较过去63个交易日的输入变量经过这三个模型后的均方根误差(RMSE)和平均绝对误差(MSE)。实验结果表明,GRU模型的MAE最低。然而,15个实验中只有9个实验表明GRU模型的RMSE最低,15个实验中有5个实验表明LSTM的RMSE最低。15个实验中有一个表明双向LSTM的RMSE最低。因此,GRU被认为是股票价格回归的最佳模型。
{"title":"Research on gated recurrent unit based stock price prediction model with multi-features under low time scale","authors":"Yinan Lyu, Yuanhao You","doi":"10.1117/12.2636639","DOIUrl":"https://doi.org/10.1117/12.2636639","url":null,"abstract":"Under the development of people's living environment, more and more people are willing to use their money to invest in financial projects such as stocks and insurance. Nowadays, science and technology are widely applied in people's life. Machine learning is one of them. Machine learning is particularly important to apply to stock forecasting to better meet the requirements of people who want to gain more benefits. The purpose of this work is to compare using GRU, LSTM, and bidirectional LSTM's MAE and RMSE on the closing price. The method of this experiment is to compare with root mean squared error (RMSE) and mean absolute error (MSE) after the input variables of the past 63 trading days passing through those three models. The results of the experiment indicate that MAE of GRU model is lowest. Still, only nine of fifteen experiments show that RMSE of GRU model is lowest, and five of fifteen experiments show that RMSE of LSTM is lowest. One of fifteen experiments expresses that RMSE of bidirectional LSTM has the lowest RMSE. Thus, GRU is considered to be the best model for stock price regression.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125105095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An ensemble multilingual model for toxic comment classification 有毒评论分类的集成多语言模型
Gaofei Xie
The online toxic comments cause enormous harm to the society, where toxicity is defined as anything rude, disrespectful or otherwise likely to make someone leave a discussion. To have a safer, more collaborative internet, grateful contributions are made by a main area of focus on machine learning models to identify toxicity in English, whereas part of misinformation disseminates in other languages. Over the past year, pretraining multilingual language models give rise to impressive gains for cross lingual toxicity classification. This paper presents an approach to build toxicity models applying the Jigsaw Multilingual Toxic Comment Classification dataset provided by Kaggle. We set our ensemble model in three parts based on Besides, we implement subsample, Pseudo-labeling with open-subtitles, translating non-English languages to English language, and Post Processing to improve the classification accuracy indispensably. Our final model achieved an AUC of 0.9469 for the training set and 0.9485 for the validation set, demonstrating the effectiveness of performance under cross-lingual toxicity detectors.
网络有毒评论对社会造成了巨大的危害,在这里,毒性被定义为任何粗鲁、不尊重或可能导致某人离开讨论的东西。为了拥有一个更安全、更协作的互联网,机器学习模型的主要关注领域做出了可喜的贡献,以识别英语中的毒性,而部分错误信息则以其他语言传播。在过去的一年中,预训练多语言语言模型在跨语言毒性分类方面取得了令人印象深刻的进展。本文提出了一种利用Kaggle提供的Jigsaw多语言毒性评论分类数据集构建毒性模型的方法。在此基础上,我们将集成模型分为三部分,并实现了子样本、开字幕伪标注、非英语语言翻译为英语语言和后处理,以提高分类精度。我们的最终模型的训练集和验证集的AUC分别为0.9469和0.9485,证明了跨语言毒性检测器下性能的有效性。
{"title":"An ensemble multilingual model for toxic comment classification","authors":"Gaofei Xie","doi":"10.1117/12.2636419","DOIUrl":"https://doi.org/10.1117/12.2636419","url":null,"abstract":"The online toxic comments cause enormous harm to the society, where toxicity is defined as anything rude, disrespectful or otherwise likely to make someone leave a discussion. To have a safer, more collaborative internet, grateful contributions are made by a main area of focus on machine learning models to identify toxicity in English, whereas part of misinformation disseminates in other languages. Over the past year, pretraining multilingual language models give rise to impressive gains for cross lingual toxicity classification. This paper presents an approach to build toxicity models applying the Jigsaw Multilingual Toxic Comment Classification dataset provided by Kaggle. We set our ensemble model in three parts based on Besides, we implement subsample, Pseudo-labeling with open-subtitles, translating non-English languages to English language, and Post Processing to improve the classification accuracy indispensably. Our final model achieved an AUC of 0.9469 for the training set and 0.9485 for the validation set, demonstrating the effectiveness of performance under cross-lingual toxicity detectors.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126979081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A landmark building detection and recognition based on improved Faster R-RDN algorithm 基于改进的Faster R-RDN算法的地标建筑检测与识别
Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu
This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.
本文在Faster R-CNN的基础上,提出了一种改进的Faster R-DRN (Dense Residual Network, DRN)算法,使用密集连接的残差网络DRNet代替VGG网络。该算法适用于特殊场景的建筑物识别。它具有残差网络和深度卷积残差网络结构,可以有效地进行图像检测、分类和识别。本设计针对网络深度增加导致的算法过拟合问题进行了优化。本文建立了各种地标性建筑的综合样本数据集,采用不同天气、不同光照、不同角度的样本,有效提高了训练模型的阻力。结合网络结构的优化和目标数据集的训练,DRNet生成的最终特征块图不仅没有丢失底层边缘纹理信息,而且在深度卷积网络中重用底层特征块图,使融合后的特征块特征信息更加丰富,有效提高了模型对复杂环境下拍摄的照片的识别率。实验结果表明,该方法对地标性建筑的预测准确率可达到mAP的82.0%,对复杂环境下拍摄的图像具有优异的识别性能。
{"title":"A landmark building detection and recognition based on improved Faster R-RDN algorithm","authors":"Wu Jun, Kai Yan, ZiBo Huang, Haiyan Tan, Xiaofang Tu, Chengjun Zhu","doi":"10.1117/12.2636502","DOIUrl":"https://doi.org/10.1117/12.2636502","url":null,"abstract":"This paper proposes an improved Faster R-DRN (Dense Residual Network, DRN) algorithm, which is based on Faster R-CNN, using densely connected residual network DRNet to replace VGG network. This algorithm is suitable for special scenes of building recognition. It has a residual network and a deep convolution residual network structure, which can efficiently perform image detection, classification and recognition. This design optimizes the problem of algorithm overfitting due to the increase of network depth. In this paper, a comprehensive sample data set for various landmark buildings is established, and samples with different weather, different lighting, and different angles are taken to effectively improve the resistance of the training model. Combined with the optimization of the network structure and the training of targeted data sets, the final feature block diagram generated by DRNet not only does not lose the lowlevel edge texture information, but also reuses the low-level feature block diagrams in the deep convolutional network to make the fused feature block Richer feature information effectively improves the model's recognition rate for photos taken in complex environments. The experimental results show that the accuracy of this method for predicting landmark buildings can reach 82.0% of mAP, and the recognition performance of images taken in complex environments is excellent.","PeriodicalId":342847,"journal":{"name":"International Conference on Algorithms, Microchips and Network Applications","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124331830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Conference on Algorithms, Microchips and Network Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1