首页 > 最新文献

Proceedings of the International Conference on Research in Adaptive and Convergent Systems最新文献

英文 中文
A study of the estimation of Stroke ASPECTS Scores based on NCCT brain scan images using deep learning 基于NCCT脑扫描图像的脑卒中方面评分的深度学习估计研究
Su-min Jung, T. Whangbo
Stroke is a high-risk disease causing death, permanent disability in patients, and is the leading cause of death worldwide. Stroke can be quickly examined for disease through CT, an imaging diagnostic tool. However, the diagnosis of Ischemic Stroke using a CT image has the advantage of being able to take a picture in a short time with less restrictions in place, but there is a problem that diagnosis through an image is very difficult. In this paper, we propose a deep learning system capable of learning and classifying ischemic stroke diseases that are small datasets and difficult to learn about image data. We propose a preprocessing algorithm optimized for ischemic stroke based on Non-Contrast CT data in Middle Cerebral Artery (MCA) area.
中风是一种导致患者死亡和永久性残疾的高风险疾病,是世界范围内死亡的主要原因。通过CT(一种成像诊断工具)可以快速检查中风的疾病。然而,利用CT图像诊断缺血性中风的优点是可以在短时间内拍摄一张照片,限制较少,但也存在一个问题,即通过图像进行诊断非常困难。在本文中,我们提出了一个深度学习系统,能够对小数据集和难以学习的图像数据进行缺血性中风疾病的学习和分类。提出了一种基于大脑中动脉(MCA)区域非对比CT数据的缺血性脑卒中预处理算法。
{"title":"A study of the estimation of Stroke ASPECTS Scores based on NCCT brain scan images using deep learning","authors":"Su-min Jung, T. Whangbo","doi":"10.1145/3400286.3418268","DOIUrl":"https://doi.org/10.1145/3400286.3418268","url":null,"abstract":"Stroke is a high-risk disease causing death, permanent disability in patients, and is the leading cause of death worldwide. Stroke can be quickly examined for disease through CT, an imaging diagnostic tool. However, the diagnosis of Ischemic Stroke using a CT image has the advantage of being able to take a picture in a short time with less restrictions in place, but there is a problem that diagnosis through an image is very difficult. In this paper, we propose a deep learning system capable of learning and classifying ischemic stroke diseases that are small datasets and difficult to learn about image data. We propose a preprocessing algorithm optimized for ischemic stroke based on Non-Contrast CT data in Middle Cerebral Artery (MCA) area.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"94 3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117296966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using Synthesized Data to Train Deep Neural Net with Few Data 利用综合数据训练少数据深度神经网络
Cheng-Shao Chiang, C.-S. Shih
As Computer-Assisted Surgery (CAS) getting popular, more and more research has been conducted to help surgeons operate. We aim at the semantic segmentation in the endoscopy surgery scenario because semantic segmentation is the first step for a computer to grasp what shows up in the vision of an endoscope. However, modern Deep Learning algorithms need myriads of training data. Since data of the endoscopy surgery scene is relatively scarce, the performance of existing algorithms is thus rather limited. Therefore, we tried to solve the problem of training a semantic segmentation network with few data in this work. We propose a proof-of-concept system offering the ability to enlarge the dataset and improve the performance. The system aims to synthesize a pair of training data in a single pass and provides a sufficient amount of data to train a network. We evaluated our method using the dataset provided by MICCAI 2018 Robotic Scene Segmentation Sub-Challenge. Our method yielded 11.79% mIoU improvement in recognizing anatomical objects and 2.2% mIoU in recognizing surgical instruments. Recognizing anatomical objects accurately would definitely benefit CAS. Preliminary results suggest our method helps the classifier become more robust and accurate even if not having large amount of data.
随着计算机辅助手术(CAS)的普及,越来越多的研究被用于帮助外科医生进行手术。我们的目标是内窥镜手术场景中的语义分割,因为语义分割是计算机掌握内窥镜视觉中显示的内容的第一步。然而,现代深度学习算法需要大量的训练数据。由于内窥镜手术场景的数据相对较少,现有算法的性能相当有限。因此,我们在这项工作中试图解决在数据较少的情况下训练语义分割网络的问题。我们提出了一个概念验证系统,提供了扩大数据集和提高性能的能力。该系统旨在一次合成一对训练数据,并提供足够的数据量来训练网络。我们使用MICCAI 2018机器人场景分割子挑战提供的数据集评估了我们的方法。该方法在解剖物体识别上的mIoU提高了11.79%,在手术器械识别上的mIoU提高了2.2%。准确识别解剖对象对CAS无疑是有益的。初步结果表明,即使没有大量的数据,我们的方法也可以帮助分类器变得更加鲁棒和准确。
{"title":"Using Synthesized Data to Train Deep Neural Net with Few Data","authors":"Cheng-Shao Chiang, C.-S. Shih","doi":"10.1145/3400286.3418244","DOIUrl":"https://doi.org/10.1145/3400286.3418244","url":null,"abstract":"As Computer-Assisted Surgery (CAS) getting popular, more and more research has been conducted to help surgeons operate. We aim at the semantic segmentation in the endoscopy surgery scenario because semantic segmentation is the first step for a computer to grasp what shows up in the vision of an endoscope. However, modern Deep Learning algorithms need myriads of training data. Since data of the endoscopy surgery scene is relatively scarce, the performance of existing algorithms is thus rather limited. Therefore, we tried to solve the problem of training a semantic segmentation network with few data in this work. We propose a proof-of-concept system offering the ability to enlarge the dataset and improve the performance. The system aims to synthesize a pair of training data in a single pass and provides a sufficient amount of data to train a network. We evaluated our method using the dataset provided by MICCAI 2018 Robotic Scene Segmentation Sub-Challenge. Our method yielded 11.79% mIoU improvement in recognizing anatomical objects and 2.2% mIoU in recognizing surgical instruments. Recognizing anatomical objects accurately would definitely benefit CAS. Preliminary results suggest our method helps the classifier become more robust and accurate even if not having large amount of data.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130903970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Content-Based Collaborative Filtering using Word Embedding: A Case Study on Movie Recommendation 基于内容的基于词嵌入的协同过滤:以电影推荐为例
Luong Vuong Nguyen, Tri-Hai Nguyen, Jason J. Jung
The lack of sufficient ratings will reduce effectively modeling user reference and finding trustworthy similar users in collaborative filtering (CF)-based recommendation systems, also known as a cold-start problem. To solve this problem and improve the efficiency of recommendation systems, we propose a new content-based CF approach based on item similarity. We apply the model in the movie domain and extract features such as genres, directors, actors, and plots of the movies. We use the Jaccard coefficient index to covert the extracted features such as genres, directors, actors to the vectors while the plot feature is converted to the semantic vectors. Then, the similarity of the movies is calculated by soft cosine measure based on vectorized features. We apply the word embedding model (i.e., Word2Vec) for representing the plots feature as semantic vectors instead of using traditional models such as a binary bag of words and a TF-IDF vector space. Experiment results show the superiority of the proposed system in terms of accuracy, precision, recall, and F1 scores in cold-start conditions compared to the baseline systems.
在基于协同过滤(CF)的推荐系统中,缺乏足够的评分将降低用户参考的有效建模和寻找值得信赖的相似用户,也称为冷启动问题。为了解决这一问题并提高推荐系统的效率,我们提出了一种基于项目相似度的基于内容的CF方法。我们将该模型应用于电影领域,并提取电影的类型、导演、演员和情节等特征。我们使用Jaccard系数指数将提取的类型、导演、演员等特征转换为向量,同时将情节特征转换为语义向量。然后,基于矢量化特征,通过软余弦度量计算电影的相似度;我们使用词嵌入模型(即Word2Vec)来将地块特征表示为语义向量,而不是使用传统的模型,如二进制词包和TF-IDF向量空间。实验结果表明,与基线系统相比,该系统在冷启动条件下的准确率、精密度、召回率和F1分数方面具有优势。
{"title":"Content-Based Collaborative Filtering using Word Embedding: A Case Study on Movie Recommendation","authors":"Luong Vuong Nguyen, Tri-Hai Nguyen, Jason J. Jung","doi":"10.1145/3400286.3418253","DOIUrl":"https://doi.org/10.1145/3400286.3418253","url":null,"abstract":"The lack of sufficient ratings will reduce effectively modeling user reference and finding trustworthy similar users in collaborative filtering (CF)-based recommendation systems, also known as a cold-start problem. To solve this problem and improve the efficiency of recommendation systems, we propose a new content-based CF approach based on item similarity. We apply the model in the movie domain and extract features such as genres, directors, actors, and plots of the movies. We use the Jaccard coefficient index to covert the extracted features such as genres, directors, actors to the vectors while the plot feature is converted to the semantic vectors. Then, the similarity of the movies is calculated by soft cosine measure based on vectorized features. We apply the word embedding model (i.e., Word2Vec) for representing the plots feature as semantic vectors instead of using traditional models such as a binary bag of words and a TF-IDF vector space. Experiment results show the superiority of the proposed system in terms of accuracy, precision, recall, and F1 scores in cold-start conditions compared to the baseline systems.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130423259","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Data Augmentation and D-vector Representation Methods for Speaker Change Detection 说话人变化检测的数据增强和d向量表示方法
Jisu Park, Shin Cha, Seongbae Eun, J. Park, Young-Sun Yun
Speaker Change Detection (SCD) is the process that detects speaker changes during a conversation. The conversation can be divided into homogeneous segments using a typical SCD system or speaker diarization system in which the segments are partitioned according to a speaker identity. When the d-vectors are used to identify or verify the speakers with deep neural network model, they are often considered insufficient to train model for detecting the speaker changes by using only acoustic information. There are few dedicated datasets for system training, so the progress of the SCD study is slow and the performance is poor. Therefore, we presented data augmentation method based on TIMIT dataset to suit for the system, and we also proposed several methods to represent d-vectors for SCD systems and their preliminary results. In the proposed data augmentation method, the boundary information of speakers is transformed into probability according to the offset in a given frame and collected in the segment. To model the boundaries of the speakers, we concatenate two random speech sentences dedicated to speech recognition system. The preliminary experimental results, specifically recall percentage, shows the possibility of the proposed approaches. In the future, we will add linguistic information to the proposed classification system, or improve the system to use hybrid system of d-vector and frame vectors, or convolutional networks.
说话人变化检测(SCD)是在会话过程中检测说话人变化的过程。使用典型的SCD系统或根据说话人身份划分的说话人分组系统,可以将会话划分为同质段。当d向量用于深度神经网络模型识别或验证说话人时,通常认为仅使用声学信息不足以训练检测说话人变化的模型。由于用于系统训练的专用数据集较少,SCD研究进展缓慢,性能较差。因此,我们提出了适合该系统的基于TIMIT数据集的数据增强方法,并提出了几种用于SCD系统的d向量表示方法及其初步结果。在本文提出的数据增强方法中,根据给定帧中的偏移量将说话人的边界信息转换为概率,并在片段中收集。为了模拟说话者的边界,我们将两个随机的语音句子连接在一起,用于语音识别系统。初步的实验结果,特别是召回率,表明了所提出的方法的可能性。在未来,我们将在提出的分类系统中添加语言信息,或者使用d向量和帧向量的混合系统或卷积网络来改进系统。
{"title":"Data Augmentation and D-vector Representation Methods for Speaker Change Detection","authors":"Jisu Park, Shin Cha, Seongbae Eun, J. Park, Young-Sun Yun","doi":"10.1145/3400286.3418270","DOIUrl":"https://doi.org/10.1145/3400286.3418270","url":null,"abstract":"Speaker Change Detection (SCD) is the process that detects speaker changes during a conversation. The conversation can be divided into homogeneous segments using a typical SCD system or speaker diarization system in which the segments are partitioned according to a speaker identity. When the d-vectors are used to identify or verify the speakers with deep neural network model, they are often considered insufficient to train model for detecting the speaker changes by using only acoustic information. There are few dedicated datasets for system training, so the progress of the SCD study is slow and the performance is poor. Therefore, we presented data augmentation method based on TIMIT dataset to suit for the system, and we also proposed several methods to represent d-vectors for SCD systems and their preliminary results. In the proposed data augmentation method, the boundary information of speakers is transformed into probability according to the offset in a given frame and collected in the segment. To model the boundaries of the speakers, we concatenate two random speech sentences dedicated to speech recognition system. The preliminary experimental results, specifically recall percentage, shows the possibility of the proposed approaches. In the future, we will add linguistic information to the proposed classification system, or improve the system to use hybrid system of d-vector and frame vectors, or convolutional networks.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133299049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Accelerating Variant Calling with Parallelized DeepVariant 用并行化DeepVariant加速变量调用
Chih-Han Yang, Jhih-Wun Zeng, C. Liu, Shih-Hao Hung
Due to the rapid evolution of the next-generation sequencing (NGS) technology, the sequence of an individual's genome can be determined from billions of short reads at a decreasing cost, which has advanced the fields of medical research and precision medicine with the ability to correlate mutations between genomes. Analysis of genome sequences, especially variant calling, is exceedingly computationally intensive, as it demands large storage capacity, computing power, and high-speed network to reduce the processing time. In the case of DeepVariant, an open-source software package which employs a deep neural network (DNN) to calls genetic variants, it took four hours to complete the analysis on a workstation with a high-performance GPU device to accelerate the DNN. Therefore, we profiled the performance of DeepVariant and refactored the code to reduce the time and cost of the NGS pipeline with a series of code optimization works. As a result, our distributed version of DeepVariant can finish the same job within 8 minutes on 8 dual-CPU nodes and 8 GPUs, which outperforms commercial versions in the market.
由于下一代测序(NGS)技术的快速发展,可以以更低的成本从数十亿个短读数中确定个体基因组的序列,这推动了医学研究和精准医学领域的发展,能够将基因组之间的突变联系起来。基因组序列的分析,特别是对变异的调用,需要大量的存储容量和计算能力,需要高速的网络来缩短处理时间。DeepVariant是一个开源软件包,它使用深度神经网络(DNN)来调用遗传变异,在一个工作站上用高性能GPU设备来加速DNN,花了四个小时完成分析。因此,我们对DeepVariant的性能进行了分析,并对代码进行了重构,通过一系列的代码优化工作来减少NGS管道的时间和成本。因此,我们的分布式版本DeepVariant可以在8个双cpu节点和8个gpu上在8分钟内完成相同的工作,优于市场上的商业版本。
{"title":"Accelerating Variant Calling with Parallelized DeepVariant","authors":"Chih-Han Yang, Jhih-Wun Zeng, C. Liu, Shih-Hao Hung","doi":"10.1145/3400286.3418243","DOIUrl":"https://doi.org/10.1145/3400286.3418243","url":null,"abstract":"Due to the rapid evolution of the next-generation sequencing (NGS) technology, the sequence of an individual's genome can be determined from billions of short reads at a decreasing cost, which has advanced the fields of medical research and precision medicine with the ability to correlate mutations between genomes. Analysis of genome sequences, especially variant calling, is exceedingly computationally intensive, as it demands large storage capacity, computing power, and high-speed network to reduce the processing time. In the case of DeepVariant, an open-source software package which employs a deep neural network (DNN) to calls genetic variants, it took four hours to complete the analysis on a workstation with a high-performance GPU device to accelerate the DNN. Therefore, we profiled the performance of DeepVariant and refactored the code to reduce the time and cost of the NGS pipeline with a series of code optimization works. As a result, our distributed version of DeepVariant can finish the same job within 8 minutes on 8 dual-CPU nodes and 8 GPUs, which outperforms commercial versions in the market.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123425916","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Design and Research of Permanent Magnet Synchronous Motor Controller and Protection System Based on FPGA 基于FPGA的永磁同步电机控制器及保护系统的设计与研究
G. Peng, Yufeng Chen, Zhengtao Xiang, Kai Che, Jinliang Zhang, Lianbing Xu
Motors have a wide range of applications in various aspects such as automotive, medical, industrial production, etc. Commonly used motors can generally be divided into DC motors and AC motors. Permanent Magnet Synchronous Motor (PMSM) is a type of AC motors with strong starting ability, high peak efficiency and high reliability, and with greater application value. This paper mainly studies the control technology of PMSM, and designs a set of motor controller and its protection system based on FPGA. First, based on the development and simulation platform of FPGA, the important algorithms of the motor controller and protection system are simulated and analysed. Then the system is built in the actual environment. Through actual testing, the speed control of the motor can be accurately achieved and various fault protection and instructions of the motor controller can be realized. The system is based on the development platform of FPGA, which with fast running speed, high flexibility, short development cycle, high resource utilization rate, and strong portability.
电机在汽车、医疗、工业生产等各个方面有着广泛的应用。常用的电机一般可分为直流电机和交流电机。永磁同步电机(PMSM)是一种起动能力强、峰值效率高、可靠性高的交流电机,具有较大的应用价值。本文主要研究了永磁同步电机的控制技术,设计了一套基于FPGA的电机控制器及其保护系统。首先,基于FPGA开发仿真平台,对电机控制器和保护系统的重要算法进行了仿真分析。然后在实际环境中构建系统。通过实际测试,可以准确实现电机的转速控制,实现电机控制器的各种故障保护和指令。该系统基于FPGA开发平台,具有运行速度快、灵活性高、开发周期短、资源利用率高、可移植性强等特点。
{"title":"Design and Research of Permanent Magnet Synchronous Motor Controller and Protection System Based on FPGA","authors":"G. Peng, Yufeng Chen, Zhengtao Xiang, Kai Che, Jinliang Zhang, Lianbing Xu","doi":"10.1145/3400286.3418247","DOIUrl":"https://doi.org/10.1145/3400286.3418247","url":null,"abstract":"Motors have a wide range of applications in various aspects such as automotive, medical, industrial production, etc. Commonly used motors can generally be divided into DC motors and AC motors. Permanent Magnet Synchronous Motor (PMSM) is a type of AC motors with strong starting ability, high peak efficiency and high reliability, and with greater application value. This paper mainly studies the control technology of PMSM, and designs a set of motor controller and its protection system based on FPGA. First, based on the development and simulation platform of FPGA, the important algorithms of the motor controller and protection system are simulated and analysed. Then the system is built in the actual environment. Through actual testing, the speed control of the motor can be accurately achieved and various fault protection and instructions of the motor controller can be realized. The system is based on the development platform of FPGA, which with fast running speed, high flexibility, short development cycle, high resource utilization rate, and strong portability.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"75 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122610249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Kernel-controlled DQN based CNN Pruning for Model Compression and Acceleration 基于核控制DQN的CNN剪枝模型压缩与加速
Romancha Khatri, Kwanghee Won
Apart from the accuracy, the size of Convolutional Neural Networks (CNN) model is another principal factor for facilitating the deployment of models on memory, power and budget constrained devices. Conventional compression techniques require human expert to setup parameters to explore the design space and iterative based pruning requires heavy training which is sub-optimal and time consuming. Given a CNN model, we propose deep reinforcement learning [8] DQN based automated compression which effectively turned off kernels on each layer by observing its significance. Observing accuracy, compression ratio and convergence rate, proposed DQN model can automatically re- activate the healthiest kernels back to train it again to regain accuracy which greatly ameliorate the model compression quality. Based on experiments on MNIST [3] dataset, our method can compress convolution layers for VGG-like [10] model up to 60% with 0.5% increase in test accuracy within less than a half the number of initial amount of training (speed-up up to 2.5×), state- of-the-art results of dropping 80% of kernels (compressed 86% parameters) with increase in accuracy by 0.14%. Further dropping 84% of kernels (compressed 94% parameters) with the loss of 0.4% accuracy. The first proposed Auto-AEC (Accuracy-Ensured Compression) model can compress the network by preserving original accuracy or increase in accuracy of the model, whereas, the second proposed Auto-CECA (Compression-Ensured Considering the Accuracy) model can compress to the maximum by preserving original accuracy or minimal drop of accuracy. We further analyze effectiveness of kernels on different layers based on how our model explores and exploits in various stages of training.
除了精度之外,卷积神经网络(CNN)模型的大小是促进模型在内存,功率和预算受限的设备上部署的另一个主要因素。传统的压缩技术需要人类专家设置参数来探索设计空间,而基于迭代的剪枝需要大量的训练,这是次优的且耗时的。给定一个CNN模型,我们提出了基于深度强化学习[8]DQN的自动压缩,通过观察每一层的显著性,有效地关闭了每一层的核。通过观察准确率、压缩率和收敛率,该DQN模型可以自动重新激活最健康的核重新训练,从而获得精度,大大改善了模型的压缩质量。基于MNIST[3]数据集的实验,我们的方法可以在不到初始训练量的一半的时间内将vgg -类[10]模型的卷积层压缩到60%,测试精度提高0.5%(加速高达2.5倍),最先进的结果是减少80%的核(压缩86%的参数),精度提高0.14%。进一步减少84%的核(压缩94%的参数),损失0.4%的精度。第一种Auto-AEC (accuracy - assured Compression)模型可以通过保持原始精度或提高模型精度来压缩网络,而第二种Auto-CECA (Compression- assured Considering The accuracy)模型可以通过保持原始精度或降低精度来最大限度地压缩网络。基于模型在不同训练阶段的探索和利用方式,我们进一步分析了不同层上核的有效性。
{"title":"Kernel-controlled DQN based CNN Pruning for Model Compression and Acceleration","authors":"Romancha Khatri, Kwanghee Won","doi":"10.1145/3400286.3418258","DOIUrl":"https://doi.org/10.1145/3400286.3418258","url":null,"abstract":"Apart from the accuracy, the size of Convolutional Neural Networks (CNN) model is another principal factor for facilitating the deployment of models on memory, power and budget constrained devices. Conventional compression techniques require human expert to setup parameters to explore the design space and iterative based pruning requires heavy training which is sub-optimal and time consuming. Given a CNN model, we propose deep reinforcement learning [8] DQN based automated compression which effectively turned off kernels on each layer by observing its significance. Observing accuracy, compression ratio and convergence rate, proposed DQN model can automatically re- activate the healthiest kernels back to train it again to regain accuracy which greatly ameliorate the model compression quality. Based on experiments on MNIST [3] dataset, our method can compress convolution layers for VGG-like [10] model up to 60% with 0.5% increase in test accuracy within less than a half the number of initial amount of training (speed-up up to 2.5×), state- of-the-art results of dropping 80% of kernels (compressed 86% parameters) with increase in accuracy by 0.14%. Further dropping 84% of kernels (compressed 94% parameters) with the loss of 0.4% accuracy. The first proposed Auto-AEC (Accuracy-Ensured Compression) model can compress the network by preserving original accuracy or increase in accuracy of the model, whereas, the second proposed Auto-CECA (Compression-Ensured Considering the Accuracy) model can compress to the maximum by preserving original accuracy or minimal drop of accuracy. We further analyze effectiveness of kernels on different layers based on how our model explores and exploits in various stages of training.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"186 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132584112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gender Classification from Fingerprint-images using Deep Learning Approach 基于深度学习方法的指纹图像性别分类
Beanbonyka Rim, Junseob Kim, Min Hong
Accurate gender classification from fingerprint-images brings benefits to various forensic, security and authentication analysis. Those benefits help to narrow down the space for searching and speed up the process for matching for applications such as automatic fingerprint identification systems (AFIS). However, achieving high prediction accuracy without human intervention (such as preprocessing and hand-crafted feature extraction) is currently and potentially a challenge. Therefore, this paper presents a deep learning method to automatically and conveniently estimate gender from fingerprint-images. In particular, the VGG-19, ResNet-50 and EfficientNet-B3 model were exploited to train from scratch. The raw images of fingerprints were fed into the networks for end-to-end learning. The networks trained on 8,000 images, validated on 1,520 images and tested on 360 images. Our experimental results showed that by comparing between those state-of-the-art models (VGG-19, ResNet-50 and EfficientNet-B3), EfficientNet-B3 model achieved the best accuracy of 97.89%, 69.86% and 63.05% for training, validating, and testing, respectively.
从指纹图像中准确的性别分类为各种法医、安全和认证分析带来了好处。这些优点有助于缩小搜索空间,加快自动指纹识别系统(AFIS)等应用程序的匹配过程。然而,在没有人为干预(如预处理和手工特征提取)的情况下实现高预测精度目前和潜在是一个挑战。为此,本文提出了一种基于深度学习的指纹图像性别自动估计方法。特别是,VGG-19, ResNet-50和EfficientNet-B3模型被利用来从头开始训练。指纹的原始图像被输入到网络中进行端到端学习。这些网络在8000张图片上进行训练,在1520张图片上进行验证,在360张图片上进行测试。实验结果表明,通过对比VGG-19、ResNet-50和EfficientNet-B3三种最先进的模型,在训练、验证和测试中,EfficientNet-B3模型的准确率分别达到了97.89%、69.86%和63.05%。
{"title":"Gender Classification from Fingerprint-images using Deep Learning Approach","authors":"Beanbonyka Rim, Junseob Kim, Min Hong","doi":"10.1145/3400286.3418237","DOIUrl":"https://doi.org/10.1145/3400286.3418237","url":null,"abstract":"Accurate gender classification from fingerprint-images brings benefits to various forensic, security and authentication analysis. Those benefits help to narrow down the space for searching and speed up the process for matching for applications such as automatic fingerprint identification systems (AFIS). However, achieving high prediction accuracy without human intervention (such as preprocessing and hand-crafted feature extraction) is currently and potentially a challenge. Therefore, this paper presents a deep learning method to automatically and conveniently estimate gender from fingerprint-images. In particular, the VGG-19, ResNet-50 and EfficientNet-B3 model were exploited to train from scratch. The raw images of fingerprints were fed into the networks for end-to-end learning. The networks trained on 8,000 images, validated on 1,520 images and tested on 360 images. Our experimental results showed that by comparing between those state-of-the-art models (VGG-19, ResNet-50 and EfficientNet-B3), EfficientNet-B3 model achieved the best accuracy of 97.89%, 69.86% and 63.05% for training, validating, and testing, respectively.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131822569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Multimodal Neuroimaging Game Theoretic Data Fusion in Adversarial Conditions 对抗条件下的多模态神经成像博弈论数据融合
C. Esposito, Oscar Tamburis, Chang Choi
This paper proposes the application of three key methods to multimodal neuroimaging data fusion. The first step is to classify neurodegenerative brain diseases in the considered scans from the available neuroimaging techniques. We propose to classify scans by selecting relevant disease detection features utilizing a gametheoretic approach and evidence combination. We applied a filtering feature selection based on a coalitional game. The second step is to aggregate the classifiers' outcomes by leveraging an improvement of the Dempster-Shafer combination rule obtained by applying evolutionary game theory to determine a final decision from the various classifiers' results, also considering the subjective doctor opinion. Last, the overall solution can be deployed in a distributed manner. The robustness of the interactions is achievable by modeling them as a signaling game to determine when rejecting those messages suspected of being malicious.
本文提出了三种关键方法在多模态神经影像数据融合中的应用。第一步是从可用的神经成像技术中对考虑的扫描中的神经退行性脑疾病进行分类。我们建议通过使用博弈论方法和证据组合来选择相关的疾病检测特征来对扫描进行分类。我们应用了一个基于联盟博弈的过滤特征选择。第二步是利用进化博弈论得到的Dempster-Shafer组合规则的改进来汇总分类器的结果,从各个分类器的结果中确定最终决策,同时考虑主观医生意见。最后,整个解决方案可以以分布式方式部署。通过将交互建模为信号游戏来确定何时拒绝那些被怀疑是恶意的消息,可以实现交互的鲁棒性。
{"title":"Multimodal Neuroimaging Game Theoretic Data Fusion in Adversarial Conditions","authors":"C. Esposito, Oscar Tamburis, Chang Choi","doi":"10.1145/3400286.3418269","DOIUrl":"https://doi.org/10.1145/3400286.3418269","url":null,"abstract":"This paper proposes the application of three key methods to multimodal neuroimaging data fusion. The first step is to classify neurodegenerative brain diseases in the considered scans from the available neuroimaging techniques. We propose to classify scans by selecting relevant disease detection features utilizing a gametheoretic approach and evidence combination. We applied a filtering feature selection based on a coalitional game. The second step is to aggregate the classifiers' outcomes by leveraging an improvement of the Dempster-Shafer combination rule obtained by applying evolutionary game theory to determine a final decision from the various classifiers' results, also considering the subjective doctor opinion. Last, the overall solution can be deployed in a distributed manner. The robustness of the interactions is achievable by modeling them as a signaling game to determine when rejecting those messages suspected of being malicious.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125374530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Channel-Wise Attention and Channel Combination for Knowledge Distillation 知识升华的渠道关注与渠道组合
C. Han, K. Lee
Knowledge distillation is a strategy to build machine learning models efficiently by making use of knowledge embedded in a pretrained model. Teacher-student framework is a well-known one to use knowledge distillation, where a teacher network usually contains knowledge for a specific task and a student network is constructed in a simpler architecture inheriting the knowledge of the teacher network. This paper proposes a new approach that uses an attention mechanism to extract knowledge from a teacher network. The attention function plays the role of determining which channels of feature maps in the teacher network to be used for training the student network so that the student network can only learn useful features. This approach allows a new model to learn useful features considering the model complexity.
知识蒸馏是一种利用预训练模型中嵌入的知识高效构建机器学习模型的策略。师生框架是一种著名的知识蒸馏框架,其中教师网络通常包含特定任务的知识,学生网络在继承教师网络知识的基础上以更简单的架构构建。本文提出了一种利用注意力机制从教师网络中提取知识的新方法。注意函数的作用是决定教师网络中哪些特征映射通道用于训练学生网络,使学生网络只学习有用的特征。这种方法允许新模型在考虑模型复杂性的情况下学习有用的特征。
{"title":"Channel-Wise Attention and Channel Combination for Knowledge Distillation","authors":"C. Han, K. Lee","doi":"10.1145/3400286.3418273","DOIUrl":"https://doi.org/10.1145/3400286.3418273","url":null,"abstract":"Knowledge distillation is a strategy to build machine learning models efficiently by making use of knowledge embedded in a pretrained model. Teacher-student framework is a well-known one to use knowledge distillation, where a teacher network usually contains knowledge for a specific task and a student network is constructed in a simpler architecture inheriting the knowledge of the teacher network. This paper proposes a new approach that uses an attention mechanism to extract knowledge from a teacher network. The attention function plays the role of determining which channels of feature maps in the teacher network to be used for training the student network so that the student network can only learn useful features. This approach allows a new model to learn useful features considering the model complexity.","PeriodicalId":326100,"journal":{"name":"Proceedings of the International Conference on Research in Adaptive and Convergent Systems","volume":"485 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121122424","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings of the International Conference on Research in Adaptive and Convergent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1