首页 > 最新文献

Proceedings of the 2022 6th International Conference on Deep Learning Technologies最新文献

英文 中文
Deep-Learning-Based Automated Scoring for the Severity of Toxic Comments Using Electra 基于深度学习的使用Electra的有毒评论严重性自动评分
Tiancong Zhang
With the increasing popularity of the Internet, social media plays a crucial role in people's daily communication. However, due to the anonymity of Internet, toxic comments emerge in an endless stream on the Internet, which seriously affects the health of online social environment. To effectively reduce the impact of toxic comments, automated scoring methods for the severity of toxic comments are in great demand. For that purpose, a deep-learning-based natural language processing technique is proposed using ELECTRA to automatically score the toxicity of a comment in this work. The backbone of our model is the ELECTRA discriminator, and the downstream regression task is accomplished by the following head layer. Three head layers are implemented separately: multi-layer perceptron, convolutional neural network, and attention. The dataset used for model training is from the Kaggle competition Toxic Comment Classification Challenge, and the model performance is evaluated through another Kaggle competition Jigsaw Rate Severity of Toxic Comments. By a boost from the K-Fold cross validation and an ensemble of three models with different head layers, our method can reach a competition score 0.80343. Such score ranks 71/2301 (top 3.1%) in the leaderboard and can get a silver medal in the competition. The results in this work would help filter the toxic comments and harmful text information automatically and effectively on the Internet, and could greatly reduce the cost of manual review and help build a healthier Internet environment.
随着互联网的日益普及,社交媒体在人们的日常交流中起着至关重要的作用。然而,由于网络的匿名性,网络上的不良评论层出不穷,严重影响了网络社会环境的健康发展。为了有效地减少有毒评论的影响,对有毒评论严重程度的自动评分方法有很大的需求。为此,本文提出了一种基于深度学习的自然语言处理技术,该技术使用ELECTRA自动对评论的毒性进行评分。我们的模型的主干是ELECTRA鉴别器,下游的回归任务由下面的头部层完成。三个头层分别实现:多层感知器、卷积神经网络和注意力。用于模型训练的数据集来自Kaggle竞赛有毒评论分类挑战赛,并通过另一个Kaggle竞赛有毒评论的拼图率严重性来评估模型的性能。通过K-Fold交叉验证和具有不同头部层的三个模型的集成,我们的方法可以达到0.80343的竞争分数。该成绩在排行榜上排名71/2301(前3.1%),可在比赛中获得银牌。本文的研究结果将有助于自动有效地过滤互联网上的有毒评论和有害文本信息,大大降低人工审查的成本,有助于建立一个更健康的互联网环境。
{"title":"Deep-Learning-Based Automated Scoring for the Severity of Toxic Comments Using Electra","authors":"Tiancong Zhang","doi":"10.1145/3556677.3556693","DOIUrl":"https://doi.org/10.1145/3556677.3556693","url":null,"abstract":"With the increasing popularity of the Internet, social media plays a crucial role in people's daily communication. However, due to the anonymity of Internet, toxic comments emerge in an endless stream on the Internet, which seriously affects the health of online social environment. To effectively reduce the impact of toxic comments, automated scoring methods for the severity of toxic comments are in great demand. For that purpose, a deep-learning-based natural language processing technique is proposed using ELECTRA to automatically score the toxicity of a comment in this work. The backbone of our model is the ELECTRA discriminator, and the downstream regression task is accomplished by the following head layer. Three head layers are implemented separately: multi-layer perceptron, convolutional neural network, and attention. The dataset used for model training is from the Kaggle competition Toxic Comment Classification Challenge, and the model performance is evaluated through another Kaggle competition Jigsaw Rate Severity of Toxic Comments. By a boost from the K-Fold cross validation and an ensemble of three models with different head layers, our method can reach a competition score 0.80343. Such score ranks 71/2301 (top 3.1%) in the leaderboard and can get a silver medal in the competition. The results in this work would help filter the toxic comments and harmful text information automatically and effectively on the Internet, and could greatly reduce the cost of manual review and help build a healthier Internet environment.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126173202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of Bitcoin Price Since COVID-19 by Using Neural Network Models 基于神经网络模型的COVID-19以来比特币价格预测
Zhiheng Jiang
After Covid-19 swept the globe and bitcoin prices suddenly soared, machine learnings were used to predict the trend of bitcoin prices, but these studies were lack of performance analysis in different time-scale span. In this paper, three neural network models are designed and used to forecast the price of bitcoin after the outbreak of COVID-19. The models A uses the high/low price, open/close price of four-days of bitcoin as input variables and the close price of the fifth day as target variable, the models B uses same variable as the model A and uses optimal weights, and the model C uses same structure as the model B, but adds the trading volume to the input variables. The results show that the model C may lower the difference between actual and calculated outputs, thus boosting the prediction accuracy. Also, it is found that the models that can work well when predicting bitcoin prices in a short time span can be obviously less precise when it comes to predicting bitcoin prices in a longer time span.
在新冠肺炎疫情席卷全球后,比特币价格突然飙升,机器学习被用于预测比特币价格走势,但这些研究缺乏不同时间尺度跨度的性能分析。本文设计了三个神经网络模型,并将其用于预测COVID-19爆发后比特币的价格。模型A以比特币4天的高价/低价、开盘价/收盘价作为输入变量,第5天的收盘价作为目标变量,模型B使用与模型A相同的变量,并使用最优权重,模型C使用与模型B相同的结构,但在输入变量中加入了交易量。结果表明,C模型可以减小实际输出与计算输出之间的差值,从而提高预测精度。此外,研究发现,可以在短时间内预测比特币价格的模型,在较长时间内预测比特币价格时,显然不那么精确。
{"title":"Prediction of Bitcoin Price Since COVID-19 by Using Neural Network Models","authors":"Zhiheng Jiang","doi":"10.1145/3556677.3556679","DOIUrl":"https://doi.org/10.1145/3556677.3556679","url":null,"abstract":"After Covid-19 swept the globe and bitcoin prices suddenly soared, machine learnings were used to predict the trend of bitcoin prices, but these studies were lack of performance analysis in different time-scale span. In this paper, three neural network models are designed and used to forecast the price of bitcoin after the outbreak of COVID-19. The models A uses the high/low price, open/close price of four-days of bitcoin as input variables and the close price of the fifth day as target variable, the models B uses same variable as the model A and uses optimal weights, and the model C uses same structure as the model B, but adds the trading volume to the input variables. The results show that the model C may lower the difference between actual and calculated outputs, thus boosting the prediction accuracy. Also, it is found that the models that can work well when predicting bitcoin prices in a short time span can be obviously less precise when it comes to predicting bitcoin prices in a longer time span.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131762691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smoke Detection Algorithm Based on Improved EfficientDet 基于改进effentdet的烟雾检测算法
Zengquan Yang, Han Huang, Fuming Xia, Zhen Shi
In the early stage of fire, smoke alarm detection is an important means to prevent fire. And with the continuous construction of monitoring facilities, it is of great significance for the study of smoke video monitoring. In order to meet the detection accuracy and speed of the video, the EfficientDet target detection algorithm was improved. Firstly, the visual analysis of the smoke data set was carried out by clustering method, and the anchor frame parameters in the EfficientDet algorithm were re-calibrated by K-means clustering method. Secondly, the Bi-FPN feature fusion algorithm is improved to reduce the transverse cross-layer connection and increase the longitudinal cross-layer connection, which reduces the calculation of parameters and improves the detection accuracy. Finally, in order to solve the problem of missing detection in small smoke area, a two-channel attention mechanism is added to improve the detection accuracy.
在火灾发生初期,烟雾报警器的探测是预防火灾的重要手段。随着监控设施的不断建设,对烟雾视频监控的研究具有重要的意义。为了满足视频的检测精度和速度要求,对EfficientDet目标检测算法进行了改进。首先,采用聚类方法对烟雾数据集进行可视化分析,并采用k均值聚类方法对effentdet算法中的锚架参数进行重新标定;其次,对Bi-FPN特征融合算法进行改进,减少横向跨层连接,增加纵向跨层连接,减少了参数的计算,提高了检测精度;最后,为了解决小烟雾区域的漏检问题,增加了双通道关注机制,提高了检测精度。
{"title":"Smoke Detection Algorithm Based on Improved EfficientDet","authors":"Zengquan Yang, Han Huang, Fuming Xia, Zhen Shi","doi":"10.1145/3556677.3556678","DOIUrl":"https://doi.org/10.1145/3556677.3556678","url":null,"abstract":"In the early stage of fire, smoke alarm detection is an important means to prevent fire. And with the continuous construction of monitoring facilities, it is of great significance for the study of smoke video monitoring. In order to meet the detection accuracy and speed of the video, the EfficientDet target detection algorithm was improved. Firstly, the visual analysis of the smoke data set was carried out by clustering method, and the anchor frame parameters in the EfficientDet algorithm were re-calibrated by K-means clustering method. Secondly, the Bi-FPN feature fusion algorithm is improved to reduce the transverse cross-layer connection and increase the longitudinal cross-layer connection, which reduces the calculation of parameters and improves the detection accuracy. Finally, in order to solve the problem of missing detection in small smoke area, a two-channel attention mechanism is added to improve the detection accuracy.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131001289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Image-based Transfer Learning Framework for Classification of E-Commerce Products 基于图像的电子商务产品分类迁移学习框架
Vrushali Atul Surve, Pramod Pathak, Mohammed Hasanuzzaman, Rejwanul Haque, Paul Stynes
Classification of e-commerce products involves identifying the products and placing those products into the correct category. For example, men’s Nike Air Max will be in the men’s category shoes on an e-Commerce platform. Identifying the correct classification of a product from hundreds of categories is time-consuming for businesses. This research proposes an Image-based Transfer Learning Framework to classify the images into the correct category in the shortest time. The framework combines Image-based algorithms with Transfer Learning. This research compares the time to predict the category and accuracy of traditional CNN and transfer learning models such as VGG19, InceptionV3, ResNet50, and MobileNet. A visual classifier is trained CNN and transfer learning models such as VGG19, InceptionV3, ResNet50, and MobileNet. The models are trained on an e-commerce product dataset that combines the ImageNet dataset with pre-trained weights. The dataset consists of 15000 images scraped from the web. Results demonstrate that Inception V3 outperforms all other models based on a TIMING of 0.10 seconds and an accuracy of 85%.
电子商务产品的分类包括识别产品并将这些产品放入正确的类别中。例如,男士耐克Air Max将在电子商务平台上的男士类鞋中。从数百个类别中确定产品的正确分类对企业来说是非常耗时的。本研究提出了一种基于图像的迁移学习框架,在最短的时间内将图像分类到正确的类别中。该框架结合了基于图像的算法和迁移学习。本研究比较了传统CNN和迁移学习模型(如VGG19、InceptionV3、ResNet50和MobileNet)预测类别的时间和准确性。视觉分类器训练CNN和迁移学习模型,如VGG19、InceptionV3、ResNet50和MobileNet。模型在电子商务产品数据集上进行训练,该数据集结合了ImageNet数据集和预训练的权重。该数据集由15000张从网络上抓取的图像组成。结果表明,基于0.10秒的TIMING和85%的准确率,Inception V3优于所有其他模型。
{"title":"An Image-based Transfer Learning Framework for Classification of E-Commerce Products","authors":"Vrushali Atul Surve, Pramod Pathak, Mohammed Hasanuzzaman, Rejwanul Haque, Paul Stynes","doi":"10.1145/3556677.3556689","DOIUrl":"https://doi.org/10.1145/3556677.3556689","url":null,"abstract":"Classification of e-commerce products involves identifying the products and placing those products into the correct category. For example, men’s Nike Air Max will be in the men’s category shoes on an e-Commerce platform. Identifying the correct classification of a product from hundreds of categories is time-consuming for businesses. This research proposes an Image-based Transfer Learning Framework to classify the images into the correct category in the shortest time. The framework combines Image-based algorithms with Transfer Learning. This research compares the time to predict the category and accuracy of traditional CNN and transfer learning models such as VGG19, InceptionV3, ResNet50, and MobileNet. A visual classifier is trained CNN and transfer learning models such as VGG19, InceptionV3, ResNet50, and MobileNet. The models are trained on an e-commerce product dataset that combines the ImageNet dataset with pre-trained weights. The dataset consists of 15000 images scraped from the web. Results demonstrate that Inception V3 outperforms all other models based on a TIMING of 0.10 seconds and an accuracy of 85%.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"173 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123170021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Household Load Identification Based on Multi-label and Convolutional Neural Networks 基于多标签和卷积神经网络的家庭负荷识别
Zhengquan Wang, Qi Xie
In low-voltage residential electricity scenarios, simple identification algorithms are difficult to be effective because of the many types of appliances and similar power characteristics. We propose a household load identification method based on multi-label and convolutional neural networks (ML-CNN) to address these problems. Firstly, we analyze the V-I trajectory characteristics of different loads and use the binary images of V-I trajectory mapping as the study features. Secondly, we collect the original steady-state voltage and current data of the combined operation of common household appliances and build a dataset. Finally, we pre-process and multi-label the dataset and input it into the ML-CNN network structure for training and validation. The experimental results show that the average identification accuracy of the ML-CNN method is 97.63%, which is better than the load identification methods such as multi-label k-nearest neighbor (ML-KNN) and support vector machine (SVM).
在低压居民用电场景中,由于电器种类多、功率特性相似,简单的识别算法很难有效。为了解决这些问题,我们提出了一种基于多标签和卷积神经网络(ML-CNN)的家庭负荷识别方法。首先,分析了不同载荷的V-I轨迹特征,并以V-I轨迹映射的二值图像作为研究特征;其次,采集常用家电组合运行的原始稳态电压电流数据,建立数据集;最后,我们对数据集进行预处理和多标签,并将其输入到ML-CNN网络结构中进行训练和验证。实验结果表明,ML-CNN方法的平均识别准确率为97.63%,优于多标签k近邻(ML-KNN)和支持向量机(SVM)等负载识别方法。
{"title":"Household Load Identification Based on Multi-label and Convolutional Neural Networks","authors":"Zhengquan Wang, Qi Xie","doi":"10.1145/3556677.3556695","DOIUrl":"https://doi.org/10.1145/3556677.3556695","url":null,"abstract":"In low-voltage residential electricity scenarios, simple identification algorithms are difficult to be effective because of the many types of appliances and similar power characteristics. We propose a household load identification method based on multi-label and convolutional neural networks (ML-CNN) to address these problems. Firstly, we analyze the V-I trajectory characteristics of different loads and use the binary images of V-I trajectory mapping as the study features. Secondly, we collect the original steady-state voltage and current data of the combined operation of common household appliances and build a dataset. Finally, we pre-process and multi-label the dataset and input it into the ML-CNN network structure for training and validation. The experimental results show that the average identification accuracy of the ML-CNN method is 97.63%, which is better than the load identification methods such as multi-label k-nearest neighbor (ML-KNN) and support vector machine (SVM).","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"53 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120971271","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of cancer classification algorithms based on clustering analysis 基于聚类分析的癌症分类算法比较
Jiawei Guo, Yu-shan Cai
Nowadays, omics datasets have been widely used to study cancer and other related problems, but there are many cancer subtypes in some types of cancer, and some types have not been studied, so we must use unsupervised methods for cluster analysis. Cluster analysis is the process of finding similar data points in a pile of data points and classifying them. In this paper, five omics data sets are used to compare the three clustering methods, in order to find a more suitable clustering method for omics datasets. The conclusion of this paper is that OPTICS method is a better clustering method.
目前,组学数据集已被广泛应用于癌症等相关问题的研究,但在某些类型的癌症中存在许多癌症亚型,而有些类型尚未被研究,因此我们必须使用无监督的方法进行聚类分析。聚类分析是在一堆数据点中找到相似的数据点并对其进行分类的过程。本文利用5个组学数据集对三种聚类方法进行比较,以期找到更适合组学数据集的聚类方法。本文的结论是光学聚类方法是一种较好的聚类方法。
{"title":"Comparison of cancer classification algorithms based on clustering analysis","authors":"Jiawei Guo, Yu-shan Cai","doi":"10.1145/3556677.3556684","DOIUrl":"https://doi.org/10.1145/3556677.3556684","url":null,"abstract":"Nowadays, omics datasets have been widely used to study cancer and other related problems, but there are many cancer subtypes in some types of cancer, and some types have not been studied, so we must use unsupervised methods for cluster analysis. Cluster analysis is the process of finding similar data points in a pile of data points and classifying them. In this paper, five omics data sets are used to compare the three clustering methods, in order to find a more suitable clustering method for omics datasets. The conclusion of this paper is that OPTICS method is a better clustering method.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132615908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semi supervised ocean mesoscale vortex detection method based on feature invariance 基于特征不变性的半监督海洋中尺度涡旋检测方法
Haiyan Liu, Bo Qin, Y. Liu
Ocean mesoscale eddy detection is an important hotspot of Marine scientific research. Over the last few years, with the development of machine learning research, eddy detection methods based on machine learning have been applied in various fields. However, the traditional scroll detection algorithm has weak generalization ability and low detection accuracy, and the fully supervised scroll detection algorithm needs a large amount of marker data, which is costly and has poor readability. In this paper, a new semi-supervised ocean mesoscale eddy detection method based on feature invariance is proposed. The fully supervised loss calculation model is optimized to solve the problem of serious imbalance of positive and negative samples in loss calculation, so as to achieve the purpose of training the model. In addition, based on the feature invariance, an interpolation consistency calculation method based on flipped image and original image is proposed, which is combined with the consistency method algorithm put forward in CSD networks to increase the precision of detection. Compared with SSD and ISD networks, the proposed meso-scale eddy detection algorithm achieves better performance, with the AP value increasing by 1.7% and 1.1%, respectively.
海洋中尺度涡旋探测是海洋科学研究的一个重要热点。近年来,随着机器学习研究的发展,基于机器学习的涡流检测方法在各个领域得到了应用。然而,传统的滚动检测算法泛化能力较弱,检测精度较低,且全监督滚动检测算法需要大量的标记数据,成本高,可读性差。提出了一种基于特征不变性的半监督海洋中尺度涡旋检测方法。对全监督损失计算模型进行优化,解决损失计算中正负样本严重失衡的问题,从而达到训练模型的目的。此外,基于特征不变性,提出了一种基于翻转图像与原始图像的插值一致性计算方法,并与CSD网络中提出的一致性方法算法相结合,提高了检测精度。与SSD和ISD网络相比,本文提出的中尺度涡流检测算法性能更好,AP值分别提高1.7%和1.1%。
{"title":"Semi supervised ocean mesoscale vortex detection method based on feature invariance","authors":"Haiyan Liu, Bo Qin, Y. Liu","doi":"10.1145/3556677.3556682","DOIUrl":"https://doi.org/10.1145/3556677.3556682","url":null,"abstract":"Ocean mesoscale eddy detection is an important hotspot of Marine scientific research. Over the last few years, with the development of machine learning research, eddy detection methods based on machine learning have been applied in various fields. However, the traditional scroll detection algorithm has weak generalization ability and low detection accuracy, and the fully supervised scroll detection algorithm needs a large amount of marker data, which is costly and has poor readability. In this paper, a new semi-supervised ocean mesoscale eddy detection method based on feature invariance is proposed. The fully supervised loss calculation model is optimized to solve the problem of serious imbalance of positive and negative samples in loss calculation, so as to achieve the purpose of training the model. In addition, based on the feature invariance, an interpolation consistency calculation method based on flipped image and original image is proposed, which is combined with the consistency method algorithm put forward in CSD networks to increase the precision of detection. Compared with SSD and ISD networks, the proposed meso-scale eddy detection algorithm achieves better performance, with the AP value increasing by 1.7% and 1.1%, respectively.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"55 4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123659024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Instance Segmentation Model to Categorize Clothes from Wild Fashion Images 基于实例分割模型的服装分类方法
Rohan Indrajeet Jadhav, Paul Stynes, Pramod Pathak, Rejwanul Haque, Mohammed Hasanuzzaman
Categorizing of clothes from wild fashion images involves identifying the type of clothes a person wears from non-studio images such as a shirt, trousers, and so on. Identifying the fashion clothes from wild images that are often grainy, unfocused, with people in different poses is a challenge. This research proposes a comparison between object detection and instance segmentation based models to categorise clothes from wild fashion images. The Object detection model is implemented using Faster Region-Based Convolutional Neural Network (RCNN). Mask RCNN is used to implement an instance segmentation model. We have trained the models on standard benchmark dataset namely deepfashion2. Results demonstrate that Instance Segmentation models such as Mask RCNN outperforms Object Detection models by 20%. Mask RCNN achieved 21.05% average precision, 73% recall across the different IoU (Intersection over Union). These results show promise for using Instance Segmentation models for faster image retrieval based e-commerce applications.
从疯狂的时尚图片中对衣服进行分类包括从非工作室图片(如衬衫、裤子等)中识别一个人穿的衣服类型。从杂乱无章的照片中识别时尚服装是一项挑战,这些照片往往是颗粒状的,没有聚焦,人们摆出不同的姿势。本文提出了一种基于对象检测和实例分割的服装分类模型的比较方法。目标检测模型采用基于更快区域的卷积神经网络(RCNN)实现。掩码RCNN是用来实现实例分割模型的。我们在标准基准数据集deepfashion2上训练了模型。结果表明,Mask RCNN等实例分割模型的性能比目标检测模型高出20%。掩模RCNN在不同IoU (Intersection over Union)上的平均准确率达到21.05%,召回率达到73%。这些结果显示了使用实例分割模型来实现更快的基于图像检索的电子商务应用的前景。
{"title":"An Instance Segmentation Model to Categorize Clothes from Wild Fashion Images","authors":"Rohan Indrajeet Jadhav, Paul Stynes, Pramod Pathak, Rejwanul Haque, Mohammed Hasanuzzaman","doi":"10.1145/3556677.3556690","DOIUrl":"https://doi.org/10.1145/3556677.3556690","url":null,"abstract":"Categorizing of clothes from wild fashion images involves identifying the type of clothes a person wears from non-studio images such as a shirt, trousers, and so on. Identifying the fashion clothes from wild images that are often grainy, unfocused, with people in different poses is a challenge. This research proposes a comparison between object detection and instance segmentation based models to categorise clothes from wild fashion images. The Object detection model is implemented using Faster Region-Based Convolutional Neural Network (RCNN). Mask RCNN is used to implement an instance segmentation model. We have trained the models on standard benchmark dataset namely deepfashion2. Results demonstrate that Instance Segmentation models such as Mask RCNN outperforms Object Detection models by 20%. Mask RCNN achieved 21.05% average precision, 73% recall across the different IoU (Intersection over Union). These results show promise for using Instance Segmentation models for faster image retrieval based e-commerce applications.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128832441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Hybrid Feature Selection for Efficient Detection of DDoS Attacks in IoT 物联网中高效检测DDoS攻击的混合特征选择
Liang Hong, Khadijeh Wehbi, Tulha Hasan Alsalah
The increasing Distributed Denial of Service (DDoS) attacks on the Internet of Things (IoT) is leading to the need for an efficient detection approach. Although much research has been conducted to detect DDoS attacks on traditional networks, such as machine learning (ML) based approaches that have improved accuracy and confidence, the limited bandwidth and computation resources in IoT networks restrict the application of ML, especially deep learning (DL) based solutions that require extensive input data. In order to appropriately address the security issues in the resources-constrained IoT network, this paper is aimed to reduce the input data dimensions by extracting a subset of the most relevant features from the original features and using this subset to detect DDoS attacks on IoT without degrading the detection performance. A cost-effective model is developed to clean and prepare raw data before dimensionality reduction. A hybrid feature selection that uses Mutual Information (MI), Analysis of Variance (ANOVA), Chi-Squared, L1-based feature selection, and Tree-based feature selection algorithms is designed to identify important data features and reduce the data inputs needed for detection. Simulation results show that detection accuracy is improved with the combination of features chosen by the proposed hybrid feature selection approach. The training time is much less than the combination of each individual feature selection method.
物联网(IoT)上越来越多的分布式拒绝服务(DDoS)攻击导致需要一种有效的检测方法。尽管已经进行了大量研究来检测传统网络上的DDoS攻击,例如基于机器学习(ML)的方法提高了准确性和置信度,但物联网网络中有限的带宽和计算资源限制了ML的应用,特别是需要大量输入数据的基于深度学习(DL)的解决方案。为了适当解决资源受限的物联网网络中的安全问题,本文旨在通过从原始特征中提取最相关特征的子集,并使用该子集在不降低检测性能的情况下检测物联网上的DDoS攻击,从而降低输入数据维度。在降维之前,开发了一个具有成本效益的模型来清理和准备原始数据。混合特征选择使用互信息(MI)、方差分析(ANOVA)、卡方、基于l1的特征选择和基于树的特征选择算法,旨在识别重要的数据特征并减少检测所需的数据输入。仿真结果表明,结合所提出的混合特征选择方法所选择的特征,可以提高检测精度。其训练时间远小于各个单独特征选择方法的组合。
{"title":"Hybrid Feature Selection for Efficient Detection of DDoS Attacks in IoT","authors":"Liang Hong, Khadijeh Wehbi, Tulha Hasan Alsalah","doi":"10.1145/3556677.3556687","DOIUrl":"https://doi.org/10.1145/3556677.3556687","url":null,"abstract":"The increasing Distributed Denial of Service (DDoS) attacks on the Internet of Things (IoT) is leading to the need for an efficient detection approach. Although much research has been conducted to detect DDoS attacks on traditional networks, such as machine learning (ML) based approaches that have improved accuracy and confidence, the limited bandwidth and computation resources in IoT networks restrict the application of ML, especially deep learning (DL) based solutions that require extensive input data. In order to appropriately address the security issues in the resources-constrained IoT network, this paper is aimed to reduce the input data dimensions by extracting a subset of the most relevant features from the original features and using this subset to detect DDoS attacks on IoT without degrading the detection performance. A cost-effective model is developed to clean and prepare raw data before dimensionality reduction. A hybrid feature selection that uses Mutual Information (MI), Analysis of Variance (ANOVA), Chi-Squared, L1-based feature selection, and Tree-based feature selection algorithms is designed to identify important data features and reduce the data inputs needed for detection. Simulation results show that detection accuracy is improved with the combination of features chosen by the proposed hybrid feature selection approach. The training time is much less than the combination of each individual feature selection method.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"290 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121264942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Topology-oriented 3D ocean flow field feature classification and tracking algorithm 面向拓扑的三维海洋流场特征分类与跟踪算法
Y. Liu, Bo Qin, Haiyan Liu
The tracking analysis of ocean feature phenomena exists many problems, such as incomplete topological structure information extraction and unclear time-varying law information display, etc. In this paper, a topology-oriented 3D ocean flow field feature classification and tracking algorithm is proposed to solve the problem of flow field feature tracking in different scales. The algorithm consists of three parts: Initially, the adaptive circular sampling space manner is optimized and improved to adapt to the extraction of flow field feature regions at different scales in view of the imprecise definition of traditional feature regions. Secondly, feature seed points were screened by setting information entropy threshold and denoised by template detection method. Eventually, combined with the eigenvalues of Jacobian matrix at critical points, the extracted two-dimensional feature regions are classified, and the continuous three-dimensional flow field features are visually tracked. By analyzing the experimental results of ocean flow field data of different depth and dimension, the validity and feasibility of topological feature structure classification and tracking algorithm are proved.
海洋地物现象的跟踪分析存在拓扑结构信息提取不完整、时变规律信息显示不清等问题。针对不同尺度下的流场特征跟踪问题,提出了一种面向拓扑的三维海洋流场特征分类与跟踪算法。该算法由三部分组成:首先针对传统特征区域定义不精确的问题,对自适应圆形采样空间方式进行优化和改进,以适应不同尺度下流场特征区域的提取;其次,通过设置信息熵阈值筛选特征种子点,采用模板检测方法去噪;最后结合雅可比矩阵在临界点处的特征值,对提取的二维特征区域进行分类,可视化跟踪连续的三维流场特征。通过对不同深度、不同维度海洋流场数据的实验结果分析,证明了拓扑特征结构分类与跟踪算法的有效性和可行性。
{"title":"Topology-oriented 3D ocean flow field feature classification and tracking algorithm","authors":"Y. Liu, Bo Qin, Haiyan Liu","doi":"10.1145/3556677.3556683","DOIUrl":"https://doi.org/10.1145/3556677.3556683","url":null,"abstract":"The tracking analysis of ocean feature phenomena exists many problems, such as incomplete topological structure information extraction and unclear time-varying law information display, etc. In this paper, a topology-oriented 3D ocean flow field feature classification and tracking algorithm is proposed to solve the problem of flow field feature tracking in different scales. The algorithm consists of three parts: Initially, the adaptive circular sampling space manner is optimized and improved to adapt to the extraction of flow field feature regions at different scales in view of the imprecise definition of traditional feature regions. Secondly, feature seed points were screened by setting information entropy threshold and denoised by template detection method. Eventually, combined with the eigenvalues of Jacobian matrix at critical points, the extracted two-dimensional feature regions are classified, and the continuous three-dimensional flow field features are visually tracked. By analyzing the experimental results of ocean flow field data of different depth and dimension, the validity and feasibility of topological feature structure classification and tracking algorithm are proved.","PeriodicalId":350340,"journal":{"name":"Proceedings of the 2022 6th International Conference on Deep Learning Technologies","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127513010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the 2022 6th International Conference on Deep Learning Technologies
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1