首页 > 最新文献

2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)最新文献

英文 中文
A comparative study of voice conversion techniques: A review 语音转换技术的比较研究综述
Kadria Ezzine, M. Frikha
Speaker identity, the sound of a person's voice, is one of the most important characteristics in human communication. Voice conversion (VC) is an emergent problem in voice and speech processing that deals with the process of modifying a speaker's identity. More particularly, the speech signal spoken by the source speaker is modified to sound a sifit had been pronounced by another speaker, referred to as the target speaker. A variety of VC techniques has been proposed since the first appearance of the voice conversion problem. The choice among those techniques represents a compromise between the similarity of the converted voice to the target voice and the quality of the output speech signal, both rated by the used technique. In this paper, we review a comprehensive state-of-the-art of voice conversion techniques while pointing out their advantages and disadvantages. These techniques will be applied in significant and most versatile areas of speech technology; applications that are far beyond speech synthesis.
说话人的身份,即一个人的声音,是人类交际中最重要的特征之一。语音转换(VC)是语音和语音处理中的一个新兴问题,涉及到说话人身份的修改过程。更具体地说,源说话者发出的语音信号被修改成另一个说话者(称为目标说话者)发出的声音。自从语音转换问题首次出现以来,各种VC技术已经被提出。在这些技术之间的选择代表了转换语音与目标语音的相似性和输出语音信号的质量之间的折衷,两者都由所使用的技术评定。在本文中,我们回顾了语音转换技术的综合技术,同时指出了它们的优点和缺点。这些技术将应用于语音技术的重要和最通用的领域;远远超出语音合成的应用。
{"title":"A comparative study of voice conversion techniques: A review","authors":"Kadria Ezzine, M. Frikha","doi":"10.1109/ATSIP.2017.8075528","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075528","url":null,"abstract":"Speaker identity, the sound of a person's voice, is one of the most important characteristics in human communication. Voice conversion (VC) is an emergent problem in voice and speech processing that deals with the process of modifying a speaker's identity. More particularly, the speech signal spoken by the source speaker is modified to sound a sifit had been pronounced by another speaker, referred to as the target speaker. A variety of VC techniques has been proposed since the first appearance of the voice conversion problem. The choice among those techniques represents a compromise between the similarity of the converted voice to the target voice and the quality of the output speech signal, both rated by the used technique. In this paper, we review a comprehensive state-of-the-art of voice conversion techniques while pointing out their advantages and disadvantages. These techniques will be applied in significant and most versatile areas of speech technology; applications that are far beyond speech synthesis.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"62 14","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121006974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Tumor extraction and elimination of pectoral muscle based on hidden Markov and region growing: Applied based MIAS 基于隐马尔可夫和区域生长的胸肌肿瘤提取与消除:基于MIAS的应用
Soukaina El Idrissi El Kaitouni, A. Abbad, H. Tairi
In this article, we propose an automatic method for the detection and extraction of the tumor on mammogram images. Most methods of detection of a tumor require the extraction of a large number of texture features from multiple calculations. The study first examines a technique of preprocessing images to obtain the Otsu thresholding method to eliminate items that do not belong in. After performing the thresholding, we estimate the number of base classes of technical LBP (Local Binary Pattern). To automate the initialization task, the classification proposed by applying dynamic k-means and improve the classes obtained by the method of Markov. Then we calculate the correlation between these classes and the original image, we deduce the class that contains the tumor and muscle pectoral. Finally, it uses the method of growing the region to eliminate pectoral muscle. The result obtained by this approach shows the quality and accuracy of extracting parts of the tumor compared to existing approaches in the literature.
在本文中,我们提出了一种自动检测和提取乳房x线照片上肿瘤的方法。大多数检测肿瘤的方法需要从多次计算中提取大量的纹理特征。本研究首先探讨了一种预处理图像的技术,以获得Otsu阈值法来消除不属于的项目。在执行阈值分割后,我们估计了技术LBP(局部二值模式)的基类数量。为了实现初始化任务的自动化,提出了采用动态k-means的分类方法,并对马尔可夫方法得到的分类进行了改进。然后我们计算这些类别与原始图像之间的相关性,我们推断出包含肿瘤和胸肌的类别。最后,它使用生长区域的方法来消除胸肌。与文献中已有的方法相比,该方法获得的结果显示了肿瘤部分提取的质量和准确性。
{"title":"Tumor extraction and elimination of pectoral muscle based on hidden Markov and region growing: Applied based MIAS","authors":"Soukaina El Idrissi El Kaitouni, A. Abbad, H. Tairi","doi":"10.1109/ATSIP.2017.8075583","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075583","url":null,"abstract":"In this article, we propose an automatic method for the detection and extraction of the tumor on mammogram images. Most methods of detection of a tumor require the extraction of a large number of texture features from multiple calculations. The study first examines a technique of preprocessing images to obtain the Otsu thresholding method to eliminate items that do not belong in. After performing the thresholding, we estimate the number of base classes of technical LBP (Local Binary Pattern). To automate the initialization task, the classification proposed by applying dynamic k-means and improve the classes obtained by the method of Markov. Then we calculate the correlation between these classes and the original image, we deduce the class that contains the tumor and muscle pectoral. Finally, it uses the method of growing the region to eliminate pectoral muscle. The result obtained by this approach shows the quality and accuracy of extracting parts of the tumor compared to existing approaches in the literature.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114975070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Embedded approach for a Riemannian-based framework of analyzing 3D faces 基于黎曼的三维人脸分析框架的嵌入式方法
T. Frikha, Faten Chaabane, Boukhchim Said, Hassen Drira, Mohamed Abid, C. Amar, Lifl Lille
Developing multimedia embedded applications continues to flourish. In fact, a biometric facial recognition system can be used not only on PCs abut also in embedded systems, it is a potential enhancer to meet security and surveillance needs. The analysis of facial recognition consists offoursteps: face analysis, face expressions’ recognition, missing data completion and full face recognition. This paper proposes a hardware architecture based on an adaptation approach foran algorithm which has proven good face detection and recognition in 3D space. The proposed application was tested using a co design technique based on a mixed Hardware Software architecture: the FPGA platform.
开发多媒体嵌入式应用程序继续蓬勃发展。事实上,生物识别面部识别系统不仅可以用于个人电脑,也可以用于嵌入式系统,它是满足安全和监控需求的潜在增强器。人脸识别的分析包括人脸分析、人脸表情识别、缺失数据补全和全人脸识别四个步骤。本文提出了一种基于自适应算法的硬件架构,该算法在三维空间中具有良好的人脸检测和识别效果。采用基于混合硬件软件架构的协同设计技术:FPGA平台对所提出的应用程序进行了测试。
{"title":"Embedded approach for a Riemannian-based framework of analyzing 3D faces","authors":"T. Frikha, Faten Chaabane, Boukhchim Said, Hassen Drira, Mohamed Abid, C. Amar, Lifl Lille","doi":"10.1109/ATSIP.2017.8075548","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075548","url":null,"abstract":"Developing multimedia embedded applications continues to flourish. In fact, a biometric facial recognition system can be used not only on PCs abut also in embedded systems, it is a potential enhancer to meet security and surveillance needs. The analysis of facial recognition consists offoursteps: face analysis, face expressions’ recognition, missing data completion and full face recognition. This paper proposes a hardware architecture based on an adaptation approach foran algorithm which has proven good face detection and recognition in 3D space. The proposed application was tested using a co design technique based on a mixed Hardware Software architecture: the FPGA platform.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121030135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Landmine detection improvement using one-class SVM for unbalanced data 基于一类支持向量机的非平衡数据地雷探测改进
Khaoula Tbarki, S. B. Said, Riadh Ksantini, Z. Lachiri
Ground Penetrating Radar (GPR) has been a precious tool for humanitarian demining. The GPR scans the ground and delivers a three-dimensional matrix representing three types of data: Ascan, Bscan and Cscan. The Ascan data represents the response from a reflection signal of a pulse emitted by the GPR at a given position. In the proposed landmine detection method, the Ascan data is normalized and then classified using Kernel based One Class Support Vector Machine (OSVM). In fact, OSVM has the main advantage of handling unbalanced data, where is not the case for multiclass SVM. Our landmine detection method was tested and evaluated on the MACADAM database which is composed of 11 scenarios of landmines and 3 scenarios of inoffensive objects (wood stick, SodaCan, pine, stone). Experimental results have shown the superiority of the RBF kernel OSVM over others kernel functions based multiclass SVM in term of classification accuracy especially, as landmine data is unbalanced.
探地雷达一直是人道主义排雷的宝贵工具。探地雷达扫描地面并提供三维矩阵,表示三种类型的数据:Ascan, Bscan和Cscan。Ascan数据表示探地雷达在给定位置发出的脉冲反射信号的响应。在提出的地雷探测方法中,对Ascan数据进行归一化,然后使用基于核的一类支持向量机(OSVM)进行分类。事实上,OSVM的主要优势在于处理不平衡数据,而多类SVM则不具备这一点。我们的地雷探测方法在MACADAM数据库上进行了测试和评估,该数据库由11个地雷场景和3个无害物体场景(木棒、苏打棒、松树、石头)组成。实验结果表明,RBF核OSVM在分类精度方面优于其他基于核函数的多类支持向量机,特别是在地雷数据不平衡的情况下。
{"title":"Landmine detection improvement using one-class SVM for unbalanced data","authors":"Khaoula Tbarki, S. B. Said, Riadh Ksantini, Z. Lachiri","doi":"10.1109/ATSIP.2017.8075597","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075597","url":null,"abstract":"Ground Penetrating Radar (GPR) has been a precious tool for humanitarian demining. The GPR scans the ground and delivers a three-dimensional matrix representing three types of data: Ascan, Bscan and Cscan. The Ascan data represents the response from a reflection signal of a pulse emitted by the GPR at a given position. In the proposed landmine detection method, the Ascan data is normalized and then classified using Kernel based One Class Support Vector Machine (OSVM). In fact, OSVM has the main advantage of handling unbalanced data, where is not the case for multiclass SVM. Our landmine detection method was tested and evaluated on the MACADAM database which is composed of 11 scenarios of landmines and 3 scenarios of inoffensive objects (wood stick, SodaCan, pine, stone). Experimental results have shown the superiority of the RBF kernel OSVM over others kernel functions based multiclass SVM in term of classification accuracy especially, as landmine data is unbalanced.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128141699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A fusion-based blind image quality metric for blurred stereoscopic images 基于融合的模糊立体图像盲图像质量度量
A. Chetouani
Blur is certainly one of the most encountered and the most annoying degradation types in image. It is due to several causes such as compression, motion, filtering and so on. In order to estimate the quality of this kind of degraded images, several metrics have been proposed in the literature. In this paper, we focus our attention on stereoscopic images and we propose a fusion-based blind stereoscopic image quality metric for blur degradation. In order to characterize the considered degradation type, some relevant features are first computed. Note that these features are extracted from a cyclopean image (CI) derived from the stereoscopic image. The final index quality is given by combined all features through a Support Vector Machine (SVM) model used as a regression tool. The 3D LIVE and the IEEE image databases have been used to evaluate our method. The achieved performance has been compared to the state-of-the-art.
模糊无疑是图像中最常见和最令人讨厌的退化类型之一。这是由于压缩、运动、滤波等几种原因造成的。为了估计这类退化图像的质量,文献中提出了几个度量。本文以立体图像为研究对象,提出了一种基于融合的盲立体图像质量指标。为了表征所考虑的退化类型,首先计算一些相关特征。注意,这些特征是从从立体图像派生的单眼图像(CI)中提取的。通过支持向量机(SVM)模型作为回归工具,将所有特征组合在一起,得到最终的指标质量。使用3D LIVE和IEEE图像数据库对我们的方法进行了评估。所取得的成绩已被与最先进的技术相比较。
{"title":"A fusion-based blind image quality metric for blurred stereoscopic images","authors":"A. Chetouani","doi":"10.1109/ATSIP.2017.8075530","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075530","url":null,"abstract":"Blur is certainly one of the most encountered and the most annoying degradation types in image. It is due to several causes such as compression, motion, filtering and so on. In order to estimate the quality of this kind of degraded images, several metrics have been proposed in the literature. In this paper, we focus our attention on stereoscopic images and we propose a fusion-based blind stereoscopic image quality metric for blur degradation. In order to characterize the considered degradation type, some relevant features are first computed. Note that these features are extracted from a cyclopean image (CI) derived from the stereoscopic image. The final index quality is given by combined all features through a Support Vector Machine (SVM) model used as a regression tool. The 3D LIVE and the IEEE image databases have been used to evaluate our method. The achieved performance has been compared to the state-of-the-art.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130424300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Automatic body segmentation from computed tomography image 计算机断层扫描图像的人体自动分割
O. Dorgham
Medical imaging segmentation provides vital information for surgical diagnosis, and usually demands an accurate segmentation. A fully automated computed tomography image segmentation method is proposed. This method is unsupervised and automatic estimation of the required parameters for identifying the human body as a region of interest. The proposed methodology consists of four steps: First, a body region of interest is masked by a method based on thresholding and basic morphological operations. Second, a body region of interest is identified using chain codes and a method for collecting adjacent contours. Next, the identification of background non-regions of interest is performed using an entropy algorithm. Finally, the human body segment is identified using a GrabCut algorithm. According to the visual evaluation results, segmentation of the human body, from the Computed Tomography images, was seen to be precise and accurate. The analysis provided evidence that the human body segmentation method could be applied to segmenting other organs, registering different image modalities or speeding-up the generation of digitally reconstructed radiographs.
医学影像分割为外科诊断提供了重要的信息,通常需要精确的分割。提出了一种全自动计算机断层图像分割方法。该方法是一种无监督和自动估计所需参数的方法,用于识别人体作为感兴趣的区域。提出的方法包括四个步骤:首先,通过基于阈值和基本形态学操作的方法来掩盖感兴趣的身体区域;其次,使用链码和收集相邻轮廓的方法识别感兴趣的身体区域。接下来,使用熵算法进行背景非感兴趣区域的识别。最后,使用GrabCut算法对人体片段进行识别。从视觉评价结果来看,ct图像对人体的分割是精确和准确的。分析表明,人体分割方法可以应用于其他器官的分割、不同图像模态的配准或加速数字重建x线照片的生成。
{"title":"Automatic body segmentation from computed tomography image","authors":"O. Dorgham","doi":"10.1109/ATSIP.2017.8075612","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075612","url":null,"abstract":"Medical imaging segmentation provides vital information for surgical diagnosis, and usually demands an accurate segmentation. A fully automated computed tomography image segmentation method is proposed. This method is unsupervised and automatic estimation of the required parameters for identifying the human body as a region of interest. The proposed methodology consists of four steps: First, a body region of interest is masked by a method based on thresholding and basic morphological operations. Second, a body region of interest is identified using chain codes and a method for collecting adjacent contours. Next, the identification of background non-regions of interest is performed using an entropy algorithm. Finally, the human body segment is identified using a GrabCut algorithm. According to the visual evaluation results, segmentation of the human body, from the Computed Tomography images, was seen to be precise and accurate. The analysis provided evidence that the human body segmentation method could be applied to segmenting other organs, registering different image modalities or speeding-up the generation of digitally reconstructed radiographs.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133700572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Hybrid context dependent CD-DNN-HMM keywords spotting on continuous speech 基于上下文的CD-DNN-HMM连续语音关键字识别
Hinda Dridi, K. Ouni
In this paper we describe a systematic procedure to implement two-stage based keywords spotting system (KWS). In first stage, a phonetic decoding of continuous speech is obtained using a CD-DNN-HMM model built with the Kaldi toolkit. In second stage, these results of phonetic transcriptions will serve to construct a system to search the keywords embedded in continuous speech using the classification and regression tree (CART) implemented with the software MATLAB. The work will be done using the TIMIT data base.
本文描述了一个基于两阶段的关键词识别系统的系统实现过程。首先,利用Kaldi工具箱构建CD-DNN-HMM模型,对连续语音进行语音解码。在第二阶段,这些语音转录结果将用于构建一个系统,使用MATLAB软件实现的分类与回归树(CART)来搜索嵌入在连续语音中的关键词。这项工作将使用TIMIT数据库完成。
{"title":"Hybrid context dependent CD-DNN-HMM keywords spotting on continuous speech","authors":"Hinda Dridi, K. Ouni","doi":"10.1109/ATSIP.2017.8075541","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075541","url":null,"abstract":"In this paper we describe a systematic procedure to implement two-stage based keywords spotting system (KWS). In first stage, a phonetic decoding of continuous speech is obtained using a CD-DNN-HMM model built with the Kaldi toolkit. In second stage, these results of phonetic transcriptions will serve to construct a system to search the keywords embedded in continuous speech using the classification and regression tree (CART) implemented with the software MATLAB. The work will be done using the TIMIT data base.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124897936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Use of ropes histograms as joints trajectories representation for human motion recognition 绳索直方图作为关节轨迹表示在人体运动识别中的应用
Z. Nejim, Makrem Mestiri, H. Amiri
In this paper, a new approach for 3D skeleton-based human motion recognition is discussed. First, we opted to represent the movement as a set of body joints trajectories. Those trajectories are then converted into ropes histograms. The motion records are obtained using the Kinect motion sensor. The classification phase consists in comparing those histograms with ropes histograms of a set of reference motions. This method is then tested on a random dataset of recorded motions and have presented an accuracy rate of 85%.
本文讨论了一种基于骨骼的三维人体运动识别新方法。首先,我们选择将运动表示为一组身体关节轨迹。然后将这些轨迹转换为绳索直方图。通过Kinect运动传感器获取运动记录。分类阶段包括将这些直方图与一组参考运动的绳索直方图进行比较。然后在随机记录的运动数据集上对该方法进行了测试,准确率达到85%。
{"title":"Use of ropes histograms as joints trajectories representation for human motion recognition","authors":"Z. Nejim, Makrem Mestiri, H. Amiri","doi":"10.1109/ATSIP.2017.8075532","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075532","url":null,"abstract":"In this paper, a new approach for 3D skeleton-based human motion recognition is discussed. First, we opted to represent the movement as a set of body joints trajectories. Those trajectories are then converted into ropes histograms. The motion records are obtained using the Kinect motion sensor. The classification phase consists in comparing those histograms with ropes histograms of a set of reference motions. This method is then tested on a random dataset of recorded motions and have presented an accuracy rate of 85%.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126677052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Breast tumor classification based on deep convolutional neural networks 基于深度卷积神经网络的乳腺肿瘤分类
I. Bakkouri, K. Afdel
This paper presents a novel deep learning approach focused on the classification of tumors in mammograms as malignant or benign. It is a modern machine learning method which promises to create models that learn from large dataset and make accurate predictions. In this study, we propose a discriminative objective for supervised feature learning by training a Convolutional Neural Network (CNN). Choosing CNN involves input image with a fixed-length and as a consequence, we equip our networks with a scaling process based on Gaussian pyramids for obtaining regions of interest with normalized size. The dataset used in this research is augmented with applying the geometric transformation techniques in order to prevent overfitting and create a robust deep learning model. We perform classification with Softmax layer. It is used to train CNN for classification. We evaluate our methodology on both of the publicly available dataset DDSM and BCDR. In comparison with the current state-of-the-art methods, the experiments show that our proposed system provides good results, achieving high accuracy of 97.28% that will assist radiologists in making diagnostic decisions without increasing false negatives.
本文提出了一种新的深度学习方法,专注于乳房x光片中肿瘤的恶性或良性分类。它是一种现代机器学习方法,有望创建从大型数据集学习并做出准确预测的模型。在本研究中,我们通过训练卷积神经网络(CNN)提出了一个有监督特征学习的判别目标。选择CNN涉及固定长度的输入图像,因此,我们为网络配备了基于高斯金字塔的缩放过程,以获得具有标准化大小的感兴趣区域。本研究中使用的数据集通过应用几何变换技术进行增强,以防止过拟合并创建鲁棒的深度学习模型。我们使用Softmax层进行分类。用于训练CNN进行分类。我们在公开可用的数据集DDSM和BCDR上评估了我们的方法。与目前最先进的方法相比,实验表明,我们提出的系统提供了良好的结果,达到97.28%的高精度,这将有助于放射科医生做出诊断决策,而不会增加假阴性。
{"title":"Breast tumor classification based on deep convolutional neural networks","authors":"I. Bakkouri, K. Afdel","doi":"10.1109/ATSIP.2017.8075562","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075562","url":null,"abstract":"This paper presents a novel deep learning approach focused on the classification of tumors in mammograms as malignant or benign. It is a modern machine learning method which promises to create models that learn from large dataset and make accurate predictions. In this study, we propose a discriminative objective for supervised feature learning by training a Convolutional Neural Network (CNN). Choosing CNN involves input image with a fixed-length and as a consequence, we equip our networks with a scaling process based on Gaussian pyramids for obtaining regions of interest with normalized size. The dataset used in this research is augmented with applying the geometric transformation techniques in order to prevent overfitting and create a robust deep learning model. We perform classification with Softmax layer. It is used to train CNN for classification. We evaluate our methodology on both of the publicly available dataset DDSM and BCDR. In comparison with the current state-of-the-art methods, the experiments show that our proposed system provides good results, achieving high accuracy of 97.28% that will assist radiologists in making diagnostic decisions without increasing false negatives.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116617720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Usefulness of time-frequency treatment to an acoustic signal 声信号时频处理的有效性
A. Elhanaoui, G. Maze, E. Aassif, D. Decultot
In this document, smoothed pseudo Wigner-Ville (SPWD) and reassigned spectrogram (RSPD) time-frequency distributions were developed and used to analyze an acoustic signal scattered from an thin cylindrical metallic tube immersed in water. In the work, the studied tube is made of two parts. The obtained results suggest that the time-frequency methods are suitable to find various resonances of circumferential waves that are propagated around the shell, and give better concordance with the theoretical results.
本文建立了光滑伪Wigner-Ville (SPWD)和重分配谱图(RSPD)时频分布,并将其用于分析浸入水中的金属细圆柱管的声信号。在工作中,所研究的管子由两部分组成。计算结果表明,时频法适用于计算绕壳传播的周波的各种共振,并与理论结果有较好的一致性。
{"title":"Usefulness of time-frequency treatment to an acoustic signal","authors":"A. Elhanaoui, G. Maze, E. Aassif, D. Decultot","doi":"10.1109/ATSIP.2017.8075578","DOIUrl":"https://doi.org/10.1109/ATSIP.2017.8075578","url":null,"abstract":"In this document, smoothed pseudo Wigner-Ville (SPWD) and reassigned spectrogram (RSPD) time-frequency distributions were developed and used to analyze an acoustic signal scattered from an thin cylindrical metallic tube immersed in water. In the work, the studied tube is made of two parts. The obtained results suggest that the time-frequency methods are suitable to find various resonances of circumferential waves that are propagated around the shell, and give better concordance with the theoretical results.","PeriodicalId":259951,"journal":{"name":"2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)","volume":"216 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122090984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2017 International Conference on Advanced Technologies for Signal and Image Processing (ATSIP)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1