首页 > 最新文献

2020 International Conference on Machine Learning and Cybernetics (ICMLC)最新文献

英文 中文
Video-Based Vehicle Flow Detection Algorithm 基于视频的车辆流量检测算法
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469561
Cheng Xu, G. Ji, Bin Zhao
In recent years, urban traffic congestion has become increasingly serious. More and more scholars have begun to study about intelligent transportation system, and the real-time detection of vehicle flow is one of the most valuable research issues. In this paper, we propose an algorithm called VFDV (Vehicle Flow Detection algorithm based on Video) that can detect vehicle flow in real time. This algorithm uses road video surveillance as the source data and extracts valid images from it to detect vehicle flow. Different from the traditional methods that use vehicle recognition method to detect vehicle flow, algorithm VFDV uses a classification algorithm to detect vehicle flow. Compared with traditional algorithms, our algorithm achieves higher accuracy. In the verification phase, the video taken at the real intersection is used as the data source. Experiments on real dataset are designed to verify the effectiveness and superiority of the proposed algorithm.
近年来,城市交通拥堵问题日益严重。越来越多的学者开始对智能交通系统进行研究,其中车辆流量的实时检测是最有价值的研究问题之一。本文提出了一种基于视频的车辆流量检测算法VFDV (Vehicle Flow Detection algorithm based on Video),可以实时检测车辆流量。该算法以道路视频监控为源数据,提取有效图像进行车辆流量检测。与传统的车辆识别检测车辆流量的方法不同,VFDV算法采用分类算法检测车辆流量。与传统算法相比,我们的算法达到了更高的精度。在验证阶段,使用在真实路口拍摄的视频作为数据源。在实际数据集上进行了实验,验证了该算法的有效性和优越性。
{"title":"Video-Based Vehicle Flow Detection Algorithm","authors":"Cheng Xu, G. Ji, Bin Zhao","doi":"10.1109/ICMLC51923.2020.9469561","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469561","url":null,"abstract":"In recent years, urban traffic congestion has become increasingly serious. More and more scholars have begun to study about intelligent transportation system, and the real-time detection of vehicle flow is one of the most valuable research issues. In this paper, we propose an algorithm called VFDV (Vehicle Flow Detection algorithm based on Video) that can detect vehicle flow in real time. This algorithm uses road video surveillance as the source data and extracts valid images from it to detect vehicle flow. Different from the traditional methods that use vehicle recognition method to detect vehicle flow, algorithm VFDV uses a classification algorithm to detect vehicle flow. Compared with traditional algorithms, our algorithm achieves higher accuracy. In the verification phase, the video taken at the real intersection is used as the data source. Experiments on real dataset are designed to verify the effectiveness and superiority of the proposed algorithm.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128087115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic Segmentation of Brain Tumor from 3D Structural MRI Using U-Net Autoencoder 基于U-Net自编码器的三维结构MRI脑肿瘤语义分割
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469580
Maisha Farzana, Md. Jahid Hossain Any, Md. Tanzim Reza, M. Parvez
Automated semantic segmentation of brain tumors from 3D MRI images plays a significant role in medical image processing, monitoring and diagnosis. Early detection of these brain tumors is highly requisite for the treatment, diagnosis and surgical pre-planning of the anomalies. The physicians normally follow the manual way of delineation for diagnosis of tumors which is time consuming and requires too much knowledge of anatomy. To resolve these limitations, convolutional neural network (CNN) based U-Net autoencoder model is proposed which performs automated segmentation of brain tumors from 3D MRI brain images by extracting the key features of the tumor. Additionally, Image normalization, image augmentation, image binarization etc. are applied for data pre-processing. Later on, the model is applied to the new 3D MRI brain images to test the accuracy of it. Applying the proposed method, the accuracy is obtained upto 96.06% considering the 18 subjects. Finally, this approach is a well-structured model for segmenting the tumor region from MRI brain images as compare to the other existing models which may assist the physicians for better diagnosis and therefore, opening the door for more precise therapy and better treatment to the patient.
三维MRI图像中脑肿瘤的语义自动分割在医学图像处理、监测和诊断中具有重要意义。早期发现这些脑肿瘤对于异常的治疗、诊断和手术前计划是非常必要的。医生通常采用手工绘制的方法来诊断肿瘤,这既费时又需要太多的解剖学知识。为了解决这些局限性,提出了基于卷积神经网络(CNN)的U-Net自编码器模型,该模型通过提取肿瘤的关键特征,从三维MRI脑图像中实现脑肿瘤的自动分割。此外,还采用图像归一化、图像增强、图像二值化等方法进行数据预处理。随后,将该模型应用于新的3D MRI脑图像,以测试其准确性。应用所提出的方法,对18个被试进行分析,准确率达到96.06%。最后,与其他现有模型相比,该方法是一个结构良好的模型,可以从MRI脑图像中分割肿瘤区域,这可以帮助医生更好地诊断,从而为更精确的治疗和更好的治疗打开大门。
{"title":"Semantic Segmentation of Brain Tumor from 3D Structural MRI Using U-Net Autoencoder","authors":"Maisha Farzana, Md. Jahid Hossain Any, Md. Tanzim Reza, M. Parvez","doi":"10.1109/ICMLC51923.2020.9469580","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469580","url":null,"abstract":"Automated semantic segmentation of brain tumors from 3D MRI images plays a significant role in medical image processing, monitoring and diagnosis. Early detection of these brain tumors is highly requisite for the treatment, diagnosis and surgical pre-planning of the anomalies. The physicians normally follow the manual way of delineation for diagnosis of tumors which is time consuming and requires too much knowledge of anatomy. To resolve these limitations, convolutional neural network (CNN) based U-Net autoencoder model is proposed which performs automated segmentation of brain tumors from 3D MRI brain images by extracting the key features of the tumor. Additionally, Image normalization, image augmentation, image binarization etc. are applied for data pre-processing. Later on, the model is applied to the new 3D MRI brain images to test the accuracy of it. Applying the proposed method, the accuracy is obtained upto 96.06% considering the 18 subjects. Finally, this approach is a well-structured model for segmenting the tumor region from MRI brain images as compare to the other existing models which may assist the physicians for better diagnosis and therefore, opening the door for more precise therapy and better treatment to the patient.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132706135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Novel Class Detection Using Hybrid Ensemble 基于混合集成的新型类检测
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469587
Diptangshu Pandit, Li Zhang, Kamlesh Mistry, Richard M. Jiang
In this research, we propose a hybrid meta-classifier for novel class detection. It is able to efficiently detect the arrival of novel unseen classes as well as tackle real-time data stream classification. Specifically, the proposed hybrid meta-classifler includes three ensemble models, i.e. class-specific, cluster-specific and complementary boosting ensemble classifiers. Distinctive training strategies are also proposed for the generation of effective and diversified ensemble classifiers. The weights of the above ensemble models and the threshold of the novel class confidence are subsequently optimized using a modified Firefly Algorithm, to enhance performance. The above proposed ensemble and optimization algorithms cooperate with each other to conduct the detection of novel unseen classes. Several UCI databases are employed for evaluation, i.e. The KDD Cup, Image Segmentation, Soybean Large, Glass and Iris databases. In comparison with the baseline meta-algorithms such as Boosting, Bagging and Stacking, our approach shows significantly enhanced performance with the increment of the number of novel classes for all the test data sets, which poses great challenges to the existing baseline ensemble methods.
在这项研究中,我们提出了一种混合元分类器用于新的类检测。它能够有效地检测到新的未见类的到来,并处理实时数据流分类。具体而言,所提出的混合元分类器包括三种集成模型,即类特定集成分类器、簇特定集成分类器和互补增强集成分类器。为了生成有效的、多样化的集成分类器,本文还提出了不同的训练策略。随后,使用改进的萤火虫算法对上述集成模型的权重和新类置信度的阈值进行优化,以提高性能。本文提出的集成算法与优化算法相互配合,进行新的未见类的检测。我们使用了几个UCI数据库进行评价,分别是KDD Cup、Image Segmentation、Soybean Large、Glass和Iris数据库。与Boosting、Bagging和Stacking等基线元算法相比,我们的方法随着所有测试数据集的新类数量的增加,性能显著提高,这对现有的基线集成方法提出了很大的挑战。
{"title":"Novel Class Detection Using Hybrid Ensemble","authors":"Diptangshu Pandit, Li Zhang, Kamlesh Mistry, Richard M. Jiang","doi":"10.1109/ICMLC51923.2020.9469587","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469587","url":null,"abstract":"In this research, we propose a hybrid meta-classifier for novel class detection. It is able to efficiently detect the arrival of novel unseen classes as well as tackle real-time data stream classification. Specifically, the proposed hybrid meta-classifler includes three ensemble models, i.e. class-specific, cluster-specific and complementary boosting ensemble classifiers. Distinctive training strategies are also proposed for the generation of effective and diversified ensemble classifiers. The weights of the above ensemble models and the threshold of the novel class confidence are subsequently optimized using a modified Firefly Algorithm, to enhance performance. The above proposed ensemble and optimization algorithms cooperate with each other to conduct the detection of novel unseen classes. Several UCI databases are employed for evaluation, i.e. The KDD Cup, Image Segmentation, Soybean Large, Glass and Iris databases. In comparison with the baseline meta-algorithms such as Boosting, Bagging and Stacking, our approach shows significantly enhanced performance with the increment of the number of novel classes for all the test data sets, which poses great challenges to the existing baseline ensemble methods.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"190 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133644214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applying a Genetic Algorithm to Determine Premium Rate of Occupational Accident Insurance 应用遗传算法确定职业意外保险费率
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469570
Jia-Ching Ying, Chi-Kai Chan, Yen-Ting Chang
At present, the Occupational Accident Labor Insurance premium rate is calculated based on the business categories in Taiwan. The premium rate is calculated as a combination of the experience rate and the manual rate for each business category. The traditional actuarial methods are based on many hypotheses to calculate future actual claims and adjust the rate for each business category. Unfortunately, with such adjustments, the risk level of the insured in the business category will be affected. To accurately estimate the size of actual losses for specific industries, we propose a genetic algorithm applied grouping to determine the premium rate for occupational accidents. The proposed approach has been evaluated using the real-world dataset from the Bureau of Labor Insurance in Taiwan that includes occupational accident insurance data from 2009 to 2015. The results demonstrate that the method is practicable at predicting the applicable premium. The proposed method differs from Taiwan's prevailing occupational accident premium rate calculation method. Moreover, it is efficient at selecting the best group of the Standard Industrial Classification from the genetic algorithm. Lastly, the accuracy of the estimates of the total claim amounts are analyzed.
目前,我国职业意外劳动保险费率是按业务类别计算。保险费率是根据每个业务类别的经验费率和人工费率的组合计算的。传统的精算方法是基于许多假设来计算未来的实际索赔并调整每个业务类别的费率。不幸的是,通过这样的调整,业务类别中的被保险人的风险水平将受到影响。为了准确估计特定行业的实际损失规模,我们提出了一种应用分组的遗传算法来确定职业事故的保险费率。本文使用台湾劳动保险局的真实数据集(包括2009年至2015年的职业意外保险数据)对所提出的方法进行了评估。结果表明,该方法在预测适用溢价方面是可行的。本方法不同于台湾现行的职业意外保险费率计算方法。此外,它还能有效地从遗传算法中选择出标准工业分类的最佳组。最后,对索赔总额估计的准确性进行了分析。
{"title":"Applying a Genetic Algorithm to Determine Premium Rate of Occupational Accident Insurance","authors":"Jia-Ching Ying, Chi-Kai Chan, Yen-Ting Chang","doi":"10.1109/ICMLC51923.2020.9469570","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469570","url":null,"abstract":"At present, the Occupational Accident Labor Insurance premium rate is calculated based on the business categories in Taiwan. The premium rate is calculated as a combination of the experience rate and the manual rate for each business category. The traditional actuarial methods are based on many hypotheses to calculate future actual claims and adjust the rate for each business category. Unfortunately, with such adjustments, the risk level of the insured in the business category will be affected. To accurately estimate the size of actual losses for specific industries, we propose a genetic algorithm applied grouping to determine the premium rate for occupational accidents. The proposed approach has been evaluated using the real-world dataset from the Bureau of Labor Insurance in Taiwan that includes occupational accident insurance data from 2009 to 2015. The results demonstrate that the method is practicable at predicting the applicable premium. The proposed method differs from Taiwan's prevailing occupational accident premium rate calculation method. Moreover, it is efficient at selecting the best group of the Standard Industrial Classification from the genetic algorithm. Lastly, the accuracy of the estimates of the total claim amounts are analyzed.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124364689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Development of a Drawing Assistant System for Blind Users Using an Object-Oriented Graphic Description Language 用面向对象的图形描述语言开发盲人绘图辅助系统
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469041
N. Takagi, Takashi Suzuki, Tomoyuki Araki
With the development of information processing systems such as screen readers, the accessibility of various information to visually impaired people has been dramatically improved in recent years. In addition, InftyReader, an OCR system for mathematical expression recognition, is making it easier to access scientific documents with mathematical expressions. However, there are still large barriers for blind people to access visual information such as graphs and diagrams. In particular, it is almost impossible for blind people to generate precise figures without the assistance of sighted people. Therefore, we are developing a graphic description language available for the blind. In conventional graphic description languages, it was necessary to accurately specify the coordinates of parameters when drawing elementary shapes. By introducing an object-oriented idea, our language enables users not to specify many coordinates. We have produced an experimental drawing assistant system using our language. In this paper, we outline our language and system, and show the results of experiments which were conducted to verify the effectiveness of our system.
近年来,随着屏幕阅读器等信息处理系统的发展,视障人士获取各种信息的能力得到了极大的提高。此外,用于数学表达式识别的OCR系统InftyReader使访问带有数学表达式的科学文档变得更加容易。然而,对于盲人来说,获取图形和图表等视觉信息仍然存在很大的障碍。特别是,如果没有明眼人的帮助,盲人几乎不可能生成精确的数字。因此,我们正在开发一种可供盲人使用的图形描述语言。在传统的图形描述语言中,在绘制初等形状时,需要精确地指定参数的坐标。通过引入面向对象的思想,我们的语言使用户不必指定许多坐标。我们用自己的语言开发了一个实验性的绘图辅助系统。在本文中,我们概述了我们的语言和系统,并展示了实验结果,以验证我们的系统的有效性。
{"title":"Development of a Drawing Assistant System for Blind Users Using an Object-Oriented Graphic Description Language","authors":"N. Takagi, Takashi Suzuki, Tomoyuki Araki","doi":"10.1109/ICMLC51923.2020.9469041","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469041","url":null,"abstract":"With the development of information processing systems such as screen readers, the accessibility of various information to visually impaired people has been dramatically improved in recent years. In addition, InftyReader, an OCR system for mathematical expression recognition, is making it easier to access scientific documents with mathematical expressions. However, there are still large barriers for blind people to access visual information such as graphs and diagrams. In particular, it is almost impossible for blind people to generate precise figures without the assistance of sighted people. Therefore, we are developing a graphic description language available for the blind. In conventional graphic description languages, it was necessary to accurately specify the coordinates of parameters when drawing elementary shapes. By introducing an object-oriented idea, our language enables users not to specify many coordinates. We have produced an experimental drawing assistant system using our language. In this paper, we outline our language and system, and show the results of experiments which were conducted to verify the effectiveness of our system.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126138443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bert-Pair-Networks for Sentiment Classification 情感分类的bert - pair网络
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469534
Ziwen Wang, Haiming Wu, Han Liu, Qianhua Cai
BERT has demonstrated excellent performance in natural language processing due to the training on large amounts of text corpus in an unsupervised way. However, this model is trained to predict the next sentence, and thus it is good at dealing with sentence pair tasks but may not be sufficiently good for other tasks. In our paper, we introduce a novel representation framework BERT-pair-Networks (p-BERTs) for sentiment classification, where p-BERTs involve adopting BERT to encode sentences for sentiment classification as a classic task of single sentence classification, using the auxiliary sentence, and a feature extraction layer on the top. Results on three datasets show that our method achieves considerably improved performance.
BERT以无监督的方式对大量文本语料库进行训练,在自然语言处理中表现出优异的性能。然而,这个模型被训练来预测下一个句子,因此它很擅长处理句子对任务,但对于其他任务可能不够好。在本文中,我们引入了一种新的情感分类表示框架BERT-pair- networks (p-BERTs),其中p-BERTs涉及将BERT作为单句分类的经典任务对句子进行情感分类编码,使用辅助句,并在其顶部添加特征提取层。在三个数据集上的结果表明,我们的方法取得了显著的性能提升。
{"title":"Bert-Pair-Networks for Sentiment Classification","authors":"Ziwen Wang, Haiming Wu, Han Liu, Qianhua Cai","doi":"10.1109/ICMLC51923.2020.9469534","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469534","url":null,"abstract":"BERT has demonstrated excellent performance in natural language processing due to the training on large amounts of text corpus in an unsupervised way. However, this model is trained to predict the next sentence, and thus it is good at dealing with sentence pair tasks but may not be sufficiently good for other tasks. In our paper, we introduce a novel representation framework BERT-pair-Networks (p-BERTs) for sentiment classification, where p-BERTs involve adopting BERT to encode sentences for sentiment classification as a classic task of single sentence classification, using the auxiliary sentence, and a feature extraction layer on the top. Results on three datasets show that our method achieves considerably improved performance.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"452 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121833253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Novel Chinese Reading Comprehension Model Based on Attention Mechanism and Convolutional Neural Networks 一种基于注意机制和卷积神经网络的汉语阅读理解模型
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469533
C. Fahn, Yi-Lun Wang, Chu-Ping Lee
This paper presents a novel machine reading comprehension model based on deep learning techniques in Chinese environment. In our manner, the training process can be performed using a general-level GPU, and the convergence of the training process can be accelerated for a shorter period of time. In the architectural design, two main constituting parts include Self-Attention Mechanism and Convolutional Neural Networks. To enhance the interaction between an article and questions, we carry out the operation of Context-Query Attention twice, so that our model is more effectively for acquiring the information of the questions related to the article and converges faster while training. In the experiment, we adopt the Delta Reading Comprehension Dataset for model evaluation in Chinese environment. The experimental results reveal that our model is able to reach the accuracy of 64.9% for EM and 79.0% for Fl. The convergence time is less than 1 hour using the Titan XP GPU, and the memory usage is comparatively lower. The training performance is about 3 times faster than other models with state- of-the-art architecture.
提出了一种基于深度学习技术的中文环境下机器阅读理解模型。在我们的方法中,训练过程可以使用通用级GPU来执行,并且可以在更短的时间内加速训练过程的收敛。在架构设计中,自注意机制和卷积神经网络是两个主要组成部分。为了增强文章与问题之间的交互性,我们进行了两次上下文查询关注操作,使我们的模型能够更有效地获取文章相关问题的信息,并且在训练时收敛速度更快。在实验中,我们采用Delta阅读理解数据集进行中文环境下的模型评价。实验结果表明,我们的模型在EM和Fl上的准确率分别达到64.9%和79.0%,在Titan XP GPU上的收敛时间小于1小时,并且内存占用相对较低。训练性能比其他具有最先进架构的模型快3倍左右。
{"title":"A Novel Chinese Reading Comprehension Model Based on Attention Mechanism and Convolutional Neural Networks","authors":"C. Fahn, Yi-Lun Wang, Chu-Ping Lee","doi":"10.1109/ICMLC51923.2020.9469533","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469533","url":null,"abstract":"This paper presents a novel machine reading comprehension model based on deep learning techniques in Chinese environment. In our manner, the training process can be performed using a general-level GPU, and the convergence of the training process can be accelerated for a shorter period of time. In the architectural design, two main constituting parts include Self-Attention Mechanism and Convolutional Neural Networks. To enhance the interaction between an article and questions, we carry out the operation of Context-Query Attention twice, so that our model is more effectively for acquiring the information of the questions related to the article and converges faster while training. In the experiment, we adopt the Delta Reading Comprehension Dataset for model evaluation in Chinese environment. The experimental results reveal that our model is able to reach the accuracy of 64.9% for EM and 79.0% for Fl. The convergence time is less than 1 hour using the Titan XP GPU, and the memory usage is comparatively lower. The training performance is about 3 times faster than other models with state- of-the-art architecture.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129317782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hyperspectral Image Classification Approach Based on Wasserstein Generative Adversarial Networks 基于Wasserstein生成对抗网络的高光谱图像分类方法
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469586
Naigeng Chen, Chenming Li
Hyperspectral image classification is an important research direction in the application of remote sensing technology. In the process of labeling different types of objects based on spectral information and geometric spatial characteristics, noise interference often exists in continuous multi-band spectral information, which brings great troubles to spectral feature extraction. Besides, far from enough spectral samples will restrict the classification performance of the algorithm to some extent. In order to solve the problem of small amount of original spectral sample data and noisy signal, Wasserstein generative adversarial networks (WGAN) is used to generate samples similar to the original spectrum, and spectral features are extracted from the samples. In the case of small samples, the original materials are provided for the classification of hyperspectral images and a semi-supervised classification model WGAN-CNN for hyperspectral images based on Wasserstein generation antagonistic network is proposed in this paper. This model combines with CNN classifier and completes the classification of terrain objects according to the label for the synthesized samples. The proposed method is compared with several classical hyperspectral image classification methods in classification accuracy. WGAN-CNN can achieve higher classification accuracy in the case of small sample size, which proves the effectiveness of the proposed method.
高光谱图像分类是遥感技术应用中的一个重要研究方向。在基于光谱信息和几何空间特征对不同类型目标进行标记的过程中,连续的多波段光谱信息中往往存在噪声干扰,给光谱特征提取带来很大的困扰。此外,光谱样本的不足也会在一定程度上限制算法的分类性能。为了解决原始频谱样本数据量少、信号有噪声的问题,采用Wasserstein生成对抗网络(WGAN)生成与原始频谱相似的样本,并从样本中提取频谱特征。在小样本情况下,为高光谱图像分类提供了原始材料,提出了一种基于Wasserstein生成对抗网络的高光谱图像半监督分类模型WGAN-CNN。该模型结合CNN分类器,根据合成样本的标签完成地形目标的分类。将该方法与几种经典的高光谱图像分类方法在分类精度上进行了比较。在小样本量的情况下,WGAN-CNN可以达到较高的分类精度,证明了所提方法的有效性。
{"title":"Hyperspectral Image Classification Approach Based on Wasserstein Generative Adversarial Networks","authors":"Naigeng Chen, Chenming Li","doi":"10.1109/ICMLC51923.2020.9469586","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469586","url":null,"abstract":"Hyperspectral image classification is an important research direction in the application of remote sensing technology. In the process of labeling different types of objects based on spectral information and geometric spatial characteristics, noise interference often exists in continuous multi-band spectral information, which brings great troubles to spectral feature extraction. Besides, far from enough spectral samples will restrict the classification performance of the algorithm to some extent. In order to solve the problem of small amount of original spectral sample data and noisy signal, Wasserstein generative adversarial networks (WGAN) is used to generate samples similar to the original spectrum, and spectral features are extracted from the samples. In the case of small samples, the original materials are provided for the classification of hyperspectral images and a semi-supervised classification model WGAN-CNN for hyperspectral images based on Wasserstein generation antagonistic network is proposed in this paper. This model combines with CNN classifier and completes the classification of terrain objects according to the label for the synthesized samples. The proposed method is compared with several classical hyperspectral image classification methods in classification accuracy. WGAN-CNN can achieve higher classification accuracy in the case of small sample size, which proves the effectiveness of the proposed method.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"302 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132096272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Multiple Deep Learner Approach for X-Ray Image-Based Pneumonia Detection 基于x射线图像的肺炎检测的多重深度学习方法
Pub Date : 2020-12-02 DOI: 10.1109/ICMLC51923.2020.9469043
Zonglin Yang, Qiang Zhao
Pneumonia is a lung disease caused by bacterial or viral infection. Early diagnosis is an important factor for successful treatment. In this study, we use three well-known convolutional neural network models, namely Faster RCNN ResNet-101, Mask RCNN ResNet-101, and Mask RCNN ResNet-50 for detection of pneumonia. We use data augmentation, transfer learning and fine-tuning in the training stage. Experimental results show that different networks have different characteristics on the same dataset. Therefore, we propose a multiple deep learner approach to improve the prediction performance via combination of different object detection models. As a result, the proposed approach can find more opacity areas of the lungs where the early symptoms are not evident. While maintaining the prediction accuracy, the proposed method can predict the bounding box size more precisely with a higher confidence score.
肺炎是一种由细菌或病毒感染引起的肺部疾病。早期诊断是成功治疗的重要因素。在本研究中,我们使用了三个著名的卷积神经网络模型,即Faster RCNN ResNet-101、Mask RCNN ResNet-101和Mask RCNN ResNet-50来检测肺炎。我们在训练阶段使用数据增强、迁移学习和微调。实验结果表明,不同的网络在同一数据集上具有不同的特征。因此,我们提出了一种多深度学习方法,通过组合不同的目标检测模型来提高预测性能。因此,该方法可以发现更多早期症状不明显的肺不透明区域。在保持预测精度的同时,该方法能够以较高的置信度更精确地预测边界盒大小。
{"title":"A Multiple Deep Learner Approach for X-Ray Image-Based Pneumonia Detection","authors":"Zonglin Yang, Qiang Zhao","doi":"10.1109/ICMLC51923.2020.9469043","DOIUrl":"https://doi.org/10.1109/ICMLC51923.2020.9469043","url":null,"abstract":"Pneumonia is a lung disease caused by bacterial or viral infection. Early diagnosis is an important factor for successful treatment. In this study, we use three well-known convolutional neural network models, namely Faster RCNN ResNet-101, Mask RCNN ResNet-101, and Mask RCNN ResNet-50 for detection of pneumonia. We use data augmentation, transfer learning and fine-tuning in the training stage. Experimental results show that different networks have different characteristics on the same dataset. Therefore, we propose a multiple deep learner approach to improve the prediction performance via combination of different object detection models. As a result, the proposed approach can find more opacity areas of the lungs where the early symptoms are not evident. While maintaining the prediction accuracy, the proposed method can predict the bounding box size more precisely with a higher confidence score.","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127824933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
[Copyright notice] (版权)
Pub Date : 2020-12-02 DOI: 10.1109/icmlc51923.2020.9469563
{"title":"[Copyright notice]","authors":"","doi":"10.1109/icmlc51923.2020.9469563","DOIUrl":"https://doi.org/10.1109/icmlc51923.2020.9469563","url":null,"abstract":"","PeriodicalId":170815,"journal":{"name":"2020 International Conference on Machine Learning and Cybernetics (ICMLC)","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128218982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2020 International Conference on Machine Learning and Cybernetics (ICMLC)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1