首页 > 最新文献

2021 6th International Conference on Computer Science and Engineering (UBMK)最新文献

英文 中文
Turkish Speech Recognition in Call Centers 呼叫中心中的土耳其语语音识别
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558890
Hatice Altınok, Serdar Kurdal, Murtaza Mehdi Yucal
Bu çalışmada çagrı merkezlerinde kullanılan otomatik konuşma tanıma sisteminde Türkçe ve yabancı kökenli sözeükleri taniyabilen bir yapıdan bahsedilmektedir. Çagrı merkezlerinin farkli sektör ve alanlara hizmet vermesinden kaynaklı Türkçe kökenli konuşmalarin yanı sıra birçok yabancı kökenli kelimelerin geçtigi konuşmalara da sahiptir. Sadece Türkçe için geliştirilen konuşma tanıma sistemleri bu merkezlerin ihtiyaeını tam anlamıyla karşılayamadigi için, Türkçe ve yabancı kelimelerin bir arada yer aldığı hibrit bir yapı oluşturulmuştur. Bu yapıda seslere karşılık gelen metinler okunuldugu gibi yazılmak yerine, okunuşun düzgün söylendigi varsayilarak yazilmiştir. Böylelikle metine çevrilen konuşmanın anlamlandırılmasındaki geçen zaman azalmıştir. Söyleyiş sözlügünde düzgün söylemlerin karşısına okunuş varyasyonları eklenerek degişik agızdaki konuşmalarin ve yabancı kökenli kelimelerin tanınması saglanmıştır. Gerçekleştirilen çalişma sonueunda uygulanan yöntemler sayesinde öneeki sisteme göre başarım oramnin arttığı gözlemlenmiştir.
{"title":"Turkish Speech Recognition in Call Centers","authors":"Hatice Altınok, Serdar Kurdal, Murtaza Mehdi Yucal","doi":"10.1109/UBMK52708.2021.9558890","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558890","url":null,"abstract":"Bu çalışmada çagrı merkezlerinde kullanılan otomatik konuşma tanıma sisteminde Türkçe ve yabancı kökenli sözeükleri taniyabilen bir yapıdan bahsedilmektedir. Çagrı merkezlerinin farkli sektör ve alanlara hizmet vermesinden kaynaklı Türkçe kökenli konuşmalarin yanı sıra birçok yabancı kökenli kelimelerin geçtigi konuşmalara da sahiptir. Sadece Türkçe için geliştirilen konuşma tanıma sistemleri bu merkezlerin ihtiyaeını tam anlamıyla karşılayamadigi için, Türkçe ve yabancı kelimelerin bir arada yer aldığı hibrit bir yapı oluşturulmuştur. Bu yapıda seslere karşılık gelen metinler okunuldugu gibi yazılmak yerine, okunuşun düzgün söylendigi varsayilarak yazilmiştir. Böylelikle metine çevrilen konuşmanın anlamlandırılmasındaki geçen zaman azalmıştir. Söyleyiş sözlügünde düzgün söylemlerin karşısına okunuş varyasyonları eklenerek degişik agızdaki konuşmalarin ve yabancı kökenli kelimelerin tanınması saglanmıştır. Gerçekleştirilen çalişma sonueunda uygulanan yöntemler sayesinde öneeki sisteme göre başarım oramnin arttığı gözlemlenmiştir.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115816111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Research of Cluster Analysis Methods for Group Solutions of the Pattern Recognition Problem 模式识别问题群解的聚类分析方法研究
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558884
L. Cherikbayeva, A. Yerimbetova, Elmira Daiyrbayeva
This paper proposes the study of cluster analysis methods for solving the problem of pattern recognition, including group solution methods. The study selected methods for solving the problem of cluster analysis based on a group solution with incomplete training information, investigated and developed models of group solutions based on existing known algorithms. The novelty of the work consists in a combination of algorithms for collective cluster analysis and nuclear classification methods. Numerical experiments on test problems and a real hyperspectral image demonstrate the effectiveness of the proposed method, including in the presence of noisy data.
本文提出了解决模式识别问题的聚类分析方法的研究,包括群解方法。本研究选择了基于不完全训练信息的群解解决聚类分析问题的方法,并在现有已知算法的基础上研究开发了群解的模型。这项工作的新颖性在于集体聚类分析和核分类方法的算法组合。在测试问题和真实高光谱图像上的数值实验证明了该方法的有效性,包括在存在噪声数据的情况下。
{"title":"Research of Cluster Analysis Methods for Group Solutions of the Pattern Recognition Problem","authors":"L. Cherikbayeva, A. Yerimbetova, Elmira Daiyrbayeva","doi":"10.1109/UBMK52708.2021.9558884","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558884","url":null,"abstract":"This paper proposes the study of cluster analysis methods for solving the problem of pattern recognition, including group solution methods. The study selected methods for solving the problem of cluster analysis based on a group solution with incomplete training information, investigated and developed models of group solutions based on existing known algorithms. The novelty of the work consists in a combination of algorithms for collective cluster analysis and nuclear classification methods. Numerical experiments on test problems and a real hyperspectral image demonstrate the effectiveness of the proposed method, including in the presence of noisy data.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121353783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Learning and Predicting Asset Management 学习和预测资产管理
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558904
Kağan Küçük, Fatih Kahraman, M. Kamasak, E. Adali
Instant exchange rates offered to customers are the most critical issues in the banking industry. It is very important for both the bank and the customer that the offers given are at the appropriate level. In this study, approximately 5 months of data were used and estimation models were designed for the estimation of the currency offers given to the customers. The study was conducted over 18 different currencies. In the study, dependent variables were determined as customer segment, instant exchange rate, day information, time information and volatility value. The independent variable is the exchange rate margin. The training was carried out with daily data and using RF, GBM, ANN, DNN and CNN algorithms. Random search algorithm was used to find the hyperparameters of the algorithms and the results of the model training were compared. The models with the lowest error values were selected to be used in the estimation phase. Mean Square Error (MSE) and Mean Absolute Error (MAE) functions were used to measure performance. It has been observed that artificial neural networks and convolutional neural network algorithms reveal better results than other algorithms according to the trainings carried out on three different models. Estimation time for 18 currencies is about 3 seconds.
向客户提供即时汇率是银行业最关键的问题。对于银行和客户来说,提供适当的报价是非常重要的。在本研究中,使用了大约5个月的数据,并设计了估计模型来估计提供给客户的货币报价。这项研究针对18种不同的货币进行。在研究中,因变量被确定为客户细分,即时汇率,日信息,时间信息和波动值。自变量是汇率保证金。使用日常数据,使用RF、GBM、ANN、DNN和CNN算法进行训练。采用随机搜索算法寻找算法的超参数,并对模型训练结果进行比较。选择误差值最小的模型用于估计阶段。均方误差(MSE)和平均绝对误差(MAE)函数用于测量性能。通过对三种不同模型的训练,观察到人工神经网络和卷积神经网络算法比其他算法表现出更好的结果。18种货币的估算时间约为3秒。
{"title":"Learning and Predicting Asset Management","authors":"Kağan Küçük, Fatih Kahraman, M. Kamasak, E. Adali","doi":"10.1109/UBMK52708.2021.9558904","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558904","url":null,"abstract":"Instant exchange rates offered to customers are the most critical issues in the banking industry. It is very important for both the bank and the customer that the offers given are at the appropriate level. In this study, approximately 5 months of data were used and estimation models were designed for the estimation of the currency offers given to the customers. The study was conducted over 18 different currencies. In the study, dependent variables were determined as customer segment, instant exchange rate, day information, time information and volatility value. The independent variable is the exchange rate margin. The training was carried out with daily data and using RF, GBM, ANN, DNN and CNN algorithms. Random search algorithm was used to find the hyperparameters of the algorithms and the results of the model training were compared. The models with the lowest error values were selected to be used in the estimation phase. Mean Square Error (MSE) and Mean Absolute Error (MAE) functions were used to measure performance. It has been observed that artificial neural networks and convolutional neural network algorithms reveal better results than other algorithms according to the trainings carried out on three different models. Estimation time for 18 currencies is about 3 seconds.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126780242","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Classification of Covid-19 X-ray Images Using Tridiagonal Matrix Enhanced Multivariance Products Representation (TMEMPR) 基于三对角矩阵增强多方差乘积表示(TMEMPR)的Covid-19 x射线图像分类
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558982
Furkan Eren, Zeynep Gündoğar
Medical images are crucial data sources for diseases that can not be diagnosed easily. X-rays, one of the medical images, have high resolution. Processing high-resolution images leads to a few problems such as difficulties in data storage, computational load, and the time required to process high-dimensional data. It is vital to be able to diagnose diseases fast and accurately. In this study, a data set consisting of lung X-rays of patients with and without COVID-19 symptoms was taken into consideration. Disease diagnosis from these images can be summarized in two steps as preprocessing and classification. The preprocessing step covers the feature extraction process and for this the recently developed decomposition-based method, Tridiagonal Matrix Enhanced Multivariance Products Representation (TMEMPR), is proposed as a feature extraction method. The classification of images is the second step where the methods of Random Forests and Support Vector Machines are applied. Also, the X-ray images have been reduced by 99,9% with TMEMPR and with several state-of-the-art feature extraction methods such as Discrete Wavelet Transform (DWT), Discrete Cosine Transform (DCT). The results are examined with regard to different feature extraction methods and it is observed that a higher accuracy rate is achieved when the TMEMPR method is used.
对于不易诊断的疾病,医学图像是重要的数据来源。x射线是医学图像中的一种,分辨率很高。高分辨率图像的处理存在数据存储困难、计算量大、处理高维数据耗时等问题。能够快速准确地诊断疾病是至关重要的。在本研究中,考虑了有COVID-19症状和无COVID-19症状患者的肺部x射线数据集。从这些图像进行疾病诊断可分为预处理和分类两个步骤。预处理步骤包括特征提取过程,为此提出了基于分解的三对角矩阵增强多方差乘积表示(TMEMPR)方法作为特征提取方法。图像分类是第二步,其中应用了随机森林和支持向量机的方法。此外,使用TMEMPR和几种最先进的特征提取方法(如离散小波变换(DWT),离散余弦变换(DCT)), x射线图像已经减少了99.9%。对不同特征提取方法的结果进行了检验,发现使用TMEMPR方法可以获得更高的准确率。
{"title":"Classification of Covid-19 X-ray Images Using Tridiagonal Matrix Enhanced Multivariance Products Representation (TMEMPR)","authors":"Furkan Eren, Zeynep Gündoğar","doi":"10.1109/UBMK52708.2021.9558982","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558982","url":null,"abstract":"Medical images are crucial data sources for diseases that can not be diagnosed easily. X-rays, one of the medical images, have high resolution. Processing high-resolution images leads to a few problems such as difficulties in data storage, computational load, and the time required to process high-dimensional data. It is vital to be able to diagnose diseases fast and accurately. In this study, a data set consisting of lung X-rays of patients with and without COVID-19 symptoms was taken into consideration. Disease diagnosis from these images can be summarized in two steps as preprocessing and classification. The preprocessing step covers the feature extraction process and for this the recently developed decomposition-based method, Tridiagonal Matrix Enhanced Multivariance Products Representation (TMEMPR), is proposed as a feature extraction method. The classification of images is the second step where the methods of Random Forests and Support Vector Machines are applied. Also, the X-ray images have been reduced by 99,9% with TMEMPR and with several state-of-the-art feature extraction methods such as Discrete Wavelet Transform (DWT), Discrete Cosine Transform (DCT). The results are examined with regard to different feature extraction methods and it is observed that a higher accuracy rate is achieved when the TMEMPR method is used.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126951108","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Static Dictionary-Based Approach To Compressing Short Texts 基于静态字典的短文本压缩方法
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9559035
Murat Aslanyürek, A. Mesut
In this study, Static Dictionary Compression (SDC) method, which is an approach developed to compress short texts, is proposed. The word-based static dictionaries used in this approach were obtained from clusters formed as a result of running a clustering method repeatedly until certain criteria are met. Short text is compressed with the dictionary that has the largest number of words in common with it. It has been shown by tests conducted with datasets containing short texts in 6 different languages that the proposed method compresses better than the general purpose compression methods Gzip, Bzip2, Zstd and PPMd. In the tests made with the data set containing only English short texts, it has been shown that the SDC method can compress better than the smza, shoco and b64pack methods used to compress short texts, and Brotli, which gives good results in short texts because it uses a static dictionary.
本文提出了静态字典压缩(Static Dictionary Compression, SDC)方法,这是一种针对短文本的压缩方法。该方法中使用的基于单词的静态字典是从重复运行聚类方法直至满足某些标准所形成的聚类中获得的。短文本使用与之有最多共同单词的字典进行压缩。对包含6种不同语言的短文本的数据集进行的测试表明,该方法的压缩效果优于通用的压缩方法Gzip、Bzip2、Zstd和PPMd。在仅包含英文短文本的数据集上进行的测试表明,SDC方法可以比用于压缩短文本的smza, shoco和b64pack方法以及Brotli方法更好地压缩短文本,Brotli方法由于使用静态字典而在短文本中获得了良好的结果。
{"title":"A Static Dictionary-Based Approach To Compressing Short Texts","authors":"Murat Aslanyürek, A. Mesut","doi":"10.1109/UBMK52708.2021.9559035","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9559035","url":null,"abstract":"In this study, Static Dictionary Compression (SDC) method, which is an approach developed to compress short texts, is proposed. The word-based static dictionaries used in this approach were obtained from clusters formed as a result of running a clustering method repeatedly until certain criteria are met. Short text is compressed with the dictionary that has the largest number of words in common with it. It has been shown by tests conducted with datasets containing short texts in 6 different languages that the proposed method compresses better than the general purpose compression methods Gzip, Bzip2, Zstd and PPMd. In the tests made with the data set containing only English short texts, it has been shown that the SDC method can compress better than the smza, shoco and b64pack methods used to compress short texts, and Brotli, which gives good results in short texts because it uses a static dictionary.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116533791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Early Stage Fault Prediction via Inter-Project Rule Transfer 基于项目间规则传递的早期故障预测
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558920
Nazli Ece Uykur, Begum Mutlu, E. Sezer
Software fault protection can be first achieved by fault prediction. The earlier the fault prediction can be done in the software development life-cycle, the lower the damage and repair costs caused by the defects that will occur. Machine learning is one well-known method for the decision-making part of automatic software fault prediction. However, the applicability of machine learning methods is low due to the lack of data in the early stages of development processes. In this study, the data needed in the design of rule-base was obtained from counterpart projects, and the fault prediction problem was evaluated by the fuzzy rule-based systems’ point of view since these systems have portability utility which allows rule transfer between different problems with similar goals in the same domain. Briefly, this study aims to show that early-stage fault prediction is possible with the portability characteristics of fuzzy systems sourced from the inter-project rule transfer. Several experiments have been performed by using the software metrics datasets of 5 software projects to support this idea. Fuzzy systems obtained from several combinations of these datasets were evaluated by their prediction accuracy. The results show that more accurate rules can be obtained from previously completed software projects, and the use of rule bases gathered from those projects’ software metrics repositories can be transfered to predict the faulty modules of the current software project.
软件故障保护首先可以通过故障预测实现。在软件开发生命周期中,越早地进行故障预测,将发生的缺陷造成的损害和修复成本就越低。机器学习是软件故障自动预测决策部分的一种知名方法。然而,由于在开发过程的早期阶段缺乏数据,机器学习方法的适用性很低。本研究从对应项目中获取规则库设计所需的数据,基于模糊规则的系统具有可移植性,允许同一领域内具有相似目标的不同问题之间的规则迁移,因此从模糊规则系统的角度对故障预测问题进行评价。简而言之,本研究旨在表明,基于项目间规则传递的模糊系统的可移植性特征可以实现早期故障预测。通过使用5个软件项目的软件度量数据集进行了几个实验来支持这个想法。从这些数据集的几个组合得到的模糊系统评估其预测精度。结果表明,可以从以前完成的软件项目中获得更准确的规则,并且可以使用从这些项目的软件度量存储库中收集的规则库来预测当前软件项目的故障模块。
{"title":"Early Stage Fault Prediction via Inter-Project Rule Transfer","authors":"Nazli Ece Uykur, Begum Mutlu, E. Sezer","doi":"10.1109/UBMK52708.2021.9558920","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558920","url":null,"abstract":"Software fault protection can be first achieved by fault prediction. The earlier the fault prediction can be done in the software development life-cycle, the lower the damage and repair costs caused by the defects that will occur. Machine learning is one well-known method for the decision-making part of automatic software fault prediction. However, the applicability of machine learning methods is low due to the lack of data in the early stages of development processes. In this study, the data needed in the design of rule-base was obtained from counterpart projects, and the fault prediction problem was evaluated by the fuzzy rule-based systems’ point of view since these systems have portability utility which allows rule transfer between different problems with similar goals in the same domain. Briefly, this study aims to show that early-stage fault prediction is possible with the portability characteristics of fuzzy systems sourced from the inter-project rule transfer. Several experiments have been performed by using the software metrics datasets of 5 software projects to support this idea. Fuzzy systems obtained from several combinations of these datasets were evaluated by their prediction accuracy. The results show that more accurate rules can be obtained from previously completed software projects, and the use of rule bases gathered from those projects’ software metrics repositories can be transfered to predict the faulty modules of the current software project.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114437004","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Finite State Machine Model for Uzbek Language Morphological Analyzer 乌兹别克语形态分析器的有限状态机模型
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9559023
Khamroyeva Shahlo Mirdjanovna
In this article, we discuss the development of a model of the Uzbek language FST (finite state transducer) in the creation of a morphological analyzer of the Uzbek language. There are key factors for automatic morphological analysis, such as stem, base, prefix, suffix, spelling rules. To do this, you need to create a database of word-formers in Uzbek (pre-/post-stem), lexical and syntactic suffixes, particles in the form of suffixes.
在本文中,我们讨论了乌兹别克语FST(有限状态换能器)模型在乌兹别克语形态分析仪创建中的发展。词干、词根、前缀、后缀、拼写规则等是词形自动分析的关键因素。要做到这一点,您需要创建一个数据库,其中包含乌兹别克语的词源(pre-/post-stem)、词法和句法后缀、后缀形式的小词。
{"title":"Finite State Machine Model for Uzbek Language Morphological Analyzer","authors":"Khamroyeva Shahlo Mirdjanovna","doi":"10.1109/UBMK52708.2021.9559023","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9559023","url":null,"abstract":"In this article, we discuss the development of a model of the Uzbek language FST (finite state transducer) in the creation of a morphological analyzer of the Uzbek language. There are key factors for automatic morphological analysis, such as stem, base, prefix, suffix, spelling rules. To do this, you need to create a database of word-formers in Uzbek (pre-/post-stem), lexical and syntactic suffixes, particles in the form of suffixes.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117333555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diagnosing Autism Spectrum Disorder Using Machine Learning Techniques 使用机器学习技术诊断自闭症谱系障碍
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558975
Hidayet Takçi, Saliha Yeşilyurt
Autism is a generalized pervasive developmental disorder that can be characterized by language and communication disorders. Screening tests are often used to diagnose such a disorder; however, they are usually time-consuming and costly tests. In recent years, machine learning methods have been frequently utilized for this purpose due to their performance and efficiency. This paper employs the most eight prominent machine learning algorithms and presents an empirical evaluation of their performances in diagnosing autism disorder on four different benchmark datasets, which are up-to-date and originate from the QCHAT, AQ-10-child, and AQ-10-adult screening tests. In doing so, we also utilize precision, sensitivity, specificity, and classification accuracy metrics to scrutinize their performances. According to the experimental results, the best outcomes are obtained with C-SVC, a classifier based on a support vector machine. More importantly, in terms of C-SVC performance metrics even lead to 100% in all datasets. Multivariate logistic regression has been taken second place. On the other hand, the lowest results are obtained with the C4.5 algorithm, a decision tree-based algorithm.
自闭症是一种广泛性广泛性发育障碍,其特征是语言和沟通障碍。筛查测试通常用于诊断这种疾病;然而,它们通常是耗时且昂贵的测试。近年来,机器学习方法由于其性能和效率而被频繁地用于此目的。本文采用了最著名的八种机器学习算法,并在四个不同的基准数据集上对它们在诊断自闭症障碍方面的表现进行了实证评估,这些数据集是最新的,来自QCHAT、aq -10儿童和aq -10成人筛查测试。在此过程中,我们还利用精度、灵敏度、特异性和分类准确性指标来仔细检查它们的性能。实验结果表明,基于支持向量机的C-SVC分类器效果最好。更重要的是,就C-SVC性能指标而言,甚至在所有数据集中都达到100%。多元逻辑回归已被排在第二位。另一方面,基于决策树的C4.5算法获得的结果最低。
{"title":"Diagnosing Autism Spectrum Disorder Using Machine Learning Techniques","authors":"Hidayet Takçi, Saliha Yeşilyurt","doi":"10.1109/UBMK52708.2021.9558975","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558975","url":null,"abstract":"Autism is a generalized pervasive developmental disorder that can be characterized by language and communication disorders. Screening tests are often used to diagnose such a disorder; however, they are usually time-consuming and costly tests. In recent years, machine learning methods have been frequently utilized for this purpose due to their performance and efficiency. This paper employs the most eight prominent machine learning algorithms and presents an empirical evaluation of their performances in diagnosing autism disorder on four different benchmark datasets, which are up-to-date and originate from the QCHAT, AQ-10-child, and AQ-10-adult screening tests. In doing so, we also utilize precision, sensitivity, specificity, and classification accuracy metrics to scrutinize their performances. According to the experimental results, the best outcomes are obtained with C-SVC, a classifier based on a support vector machine. More importantly, in terms of C-SVC performance metrics even lead to 100% in all datasets. Multivariate logistic regression has been taken second place. On the other hand, the lowest results are obtained with the C4.5 algorithm, a decision tree-based algorithm.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116311617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
3D Video Quality Evaluation Based on SSIM Model Improvement 基于SSIM模型改进的三维视频质量评价
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9558955
G. Yilmaz, G. Akar
In order to provide improved multimedia services to the end users, developing objective models efficiently predicting 3 Dimensional (3D) video Quality of Experience (QoE) can currently be considered as one of the most significant research areas. Nevertheless, there is currently no model standardized and widely utilized by the researchers due to its efficient and reliable assessment of the 3D video quality. Therefore, highly exploited 2 Dimensional (2D) video quality assessment models such as Structural SIMilarity Index (SSIM) are preferred for the 3D video quality evaluation. However, providing efficiency and reliability for the 3D video quality assessment using the 2D video quality assessment models can only be ensured if they include 3D video related features effecting Human Visual System (HVS). Under the light of these information, the SSIM model is improved for the 3D video quality assessment using perceptually significant feature, contrast and motion characteristics having impact on the HVS in this study. The results obtained by utilizing the improved SSIM model clearly present that the model is quite competent to provide enhanced multimedia services to the end users.
为了向终端用户提供更好的多媒体服务,开发能够有效预测三维视频体验质量的客观模型是当前最重要的研究方向之一。然而,目前还没有一种能够有效、可靠地评估3D视频质量的模型被研究者标准化并广泛使用。因此,高度利用二维(2D)视频质量评估模型,如结构相似指数(SSIM),是首选的3D视频质量评估。然而,使用2D视频质量评估模型进行3D视频质量评估时,只有包含影响人类视觉系统(HVS)的3D视频相关特征,才能保证评估的效率和可靠性。根据这些信息,本研究利用影响HVS的感知显著特征、对比度和运动特征对SSIM模型进行改进,用于3D视频质量评估。利用改进的SSIM模型获得的结果清楚地表明,该模型能够为最终用户提供增强的多媒体服务。
{"title":"3D Video Quality Evaluation Based on SSIM Model Improvement","authors":"G. Yilmaz, G. Akar","doi":"10.1109/UBMK52708.2021.9558955","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9558955","url":null,"abstract":"In order to provide improved multimedia services to the end users, developing objective models efficiently predicting 3 Dimensional (3D) video Quality of Experience (QoE) can currently be considered as one of the most significant research areas. Nevertheless, there is currently no model standardized and widely utilized by the researchers due to its efficient and reliable assessment of the 3D video quality. Therefore, highly exploited 2 Dimensional (2D) video quality assessment models such as Structural SIMilarity Index (SSIM) are preferred for the 3D video quality evaluation. However, providing efficiency and reliability for the 3D video quality assessment using the 2D video quality assessment models can only be ensured if they include 3D video related features effecting Human Visual System (HVS). Under the light of these information, the SSIM model is improved for the 3D video quality assessment using perceptually significant feature, contrast and motion characteristics having impact on the HVS in this study. The results obtained by utilizing the improved SSIM model clearly present that the model is quite competent to provide enhanced multimedia services to the end users.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125724234","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detection of Malware with Deep Learning Method 基于深度学习方法的恶意软件检测
Pub Date : 2021-09-15 DOI: 10.1109/UBMK52708.2021.9559020
Ümit Emre Köse, R. Samet
Nowadays, many studies are done on the detection of malicious software. Static, Dynamic and Hybrid analysis methods are used to collect data for malware detection. With these methods, data is created by reading the information in the file without running the malicious software, or by examining the places it affects such as changes on the network at runtime, api calls. With the advancement of today’s technology, these data are combined with Machine learning algorithms or architectures of Deep Learning to detect malware. Detection of malicious software On the data set containing malicious software, it was detected by using CNN and ANN neural networks. While close to 10,000 datasets showed a success rate of close to 99%, datasets close to 50,000 achieved close to 97% success. In our study, a success rate of 98.1% was achieved for nearly 50,000 data sets. Among the studies researched, malware detection was made with higher accuracy than the studies using data sets closest to 50,000.
目前,人们对恶意软件的检测进行了大量的研究。采用静态、动态和混合分析方法收集数据进行恶意软件检测。使用这些方法,通过在不运行恶意软件的情况下读取文件中的信息,或通过检查其影响的位置(如运行时网络上的更改、api调用)来创建数据。随着当今技术的进步,这些数据与机器学习算法或深度学习架构相结合,以检测恶意软件。在包含恶意软件的数据集上,分别使用CNN和ANN神经网络进行检测。接近1万个数据集的成功率接近99%,而接近5万个数据集的成功率接近97%。在我们的研究中,近5万个数据集的成功率达到了98.1%。在所研究的研究中,恶意软件检测的准确性高于使用接近50,000个数据集的研究。
{"title":"Detection of Malware with Deep Learning Method","authors":"Ümit Emre Köse, R. Samet","doi":"10.1109/UBMK52708.2021.9559020","DOIUrl":"https://doi.org/10.1109/UBMK52708.2021.9559020","url":null,"abstract":"Nowadays, many studies are done on the detection of malicious software. Static, Dynamic and Hybrid analysis methods are used to collect data for malware detection. With these methods, data is created by reading the information in the file without running the malicious software, or by examining the places it affects such as changes on the network at runtime, api calls. With the advancement of today’s technology, these data are combined with Machine learning algorithms or architectures of Deep Learning to detect malware. Detection of malicious software On the data set containing malicious software, it was detected by using CNN and ANN neural networks. While close to 10,000 datasets showed a success rate of close to 99%, datasets close to 50,000 achieved close to 97% success. In our study, a success rate of 98.1% was achieved for nearly 50,000 data sets. Among the studies researched, malware detection was made with higher accuracy than the studies using data sets closest to 50,000.","PeriodicalId":106516,"journal":{"name":"2021 6th International Conference on Computer Science and Engineering (UBMK)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2021-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125770817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2021 6th International Conference on Computer Science and Engineering (UBMK)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1