首页 > 最新文献

2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)最新文献

英文 中文
Classification of Radicalism Content from Twitter Written in Indonesian Language using Long Short Term Memory 用长短期记忆分类印尼文推特激进主义内容
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982498
N. Idris, Widyawan, T. B. Adji
Twitter was one of the most influential social media among users. It might give an either positive or negative impact. One of the negative impacts was the presence of radicalism content. In Indonesia radicalism was often connected to the issue of SARA (ethnicity, religion, race, and intergroup relations). It remained a public issue, requiring an analysis to process information related to radicalism. The research aimed to classify radical contents. The classification based on the types of radicalism and non-radicalism. Data were classified using LSTM. In finding higher accuracy, word2vec was used to transform words into vectors. The accuracy showed using LSTM method was compared with that obtained using SVM and k-NN. The two latest methods were the methods used by previous researchers regarding Indonesian radical contents of Twitter. Referring to the findings, LSTM showed higher accuracy 81.60%.
Twitter是用户中最具影响力的社交媒体之一。它可能会产生积极或消极的影响。其中一个负面影响是激进主义内容的存在。在印度尼西亚,激进主义经常与SARA(种族、宗教、种族和群体间关系)问题联系在一起。它仍然是一个公共问题,需要分析处理与激进主义有关的信息。本研究旨在对自由基含量进行分类。这种分类基于激进主义和非激进主义的类型。使用LSTM对数据进行分类。为了获得更高的精度,使用word2vec将单词转换为向量。将LSTM方法与SVM和k-NN方法的准确率进行了比较。这两种最新的方法是之前研究人员对Twitter上印尼激进内容使用的方法。LSTM的准确率更高,为81.60%。
{"title":"Classification of Radicalism Content from Twitter Written in Indonesian Language using Long Short Term Memory","authors":"N. Idris, Widyawan, T. B. Adji","doi":"10.1109/ICICoS48119.2019.8982498","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982498","url":null,"abstract":"Twitter was one of the most influential social media among users. It might give an either positive or negative impact. One of the negative impacts was the presence of radicalism content. In Indonesia radicalism was often connected to the issue of SARA (ethnicity, religion, race, and intergroup relations). It remained a public issue, requiring an analysis to process information related to radicalism. The research aimed to classify radical contents. The classification based on the types of radicalism and non-radicalism. Data were classified using LSTM. In finding higher accuracy, word2vec was used to transform words into vectors. The accuracy showed using LSTM method was compared with that obtained using SVM and k-NN. The two latest methods were the methods used by previous researchers regarding Indonesian radical contents of Twitter. Referring to the findings, LSTM showed higher accuracy 81.60%.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"403 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127573827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Diphtheria Case Number Forecasting using Radial Basis Function Neural Network 基于径向基函数神经网络的白喉病例数预测
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982403
Wiwik Anggraeni, Dina Nandika, Faizal Mahananto, Yeyen Sudiarti, Cut Alna Fadhilla
In Indonesia, diphtheria ranks fourth as a deadly disease after cardiovascular, tuberculosis, and pneumonia. The death rate of diphtheria is estimated to 21% with symptoms of malaise, anorexia, sore throat, and increased body temperature. The diphtheria cases which was reported in 2014 showed that East Java occupied the highest number for diphtheria cases which reached until 295, contributed to 74% cases of 22 provinces in Indonesia. In the mid-2017 until mid-2018, the Ministry of Health of the Republic of Indonesia announced that there has been an ongoing diphtheria outbreak in Indonesia. The number of diphtheria cases in East Java were highly raising up at the end of 2018. Forecasting is needed to reduce the number of diphtheria cases. The method used for forecasting is the Radial Basis Function Neural Network. Several variables are involved, including Immunization Coverage, Population Density, and Number of Cases. It is observed from the experimental results that the best model indicates only one variable involved, which is Number of Cases. This model is used to forecast the number of cases of diphtheria in Malang Regency, Surabaya City, and Sumenep Regency. The results showed that RBFNN method has a good performance for forecasting in Malang with MASE value of 0.84, Surabaya with MASE value of 0.817, and Sumenep with MASE value of 0.820, which all MASE values are less than 1.
在印度尼西亚,白喉是继心血管病、肺结核和肺炎之后的第四大致命疾病。白喉的死亡率估计为21%,症状为不适、厌食、喉咙痛和体温升高。2014年报告的白喉病例表明,东爪哇的白喉病例数量最多,达到295例,占印度尼西亚22个省的74%。在2017年年中至2018年年中,印度尼西亚共和国卫生部宣布,印度尼西亚发生了持续的白喉疫情。2018年底,东爪哇省白喉病例数大幅上升。需要进行预测以减少白喉病例数。用于预测的方法是径向基函数神经网络。涉及几个变量,包括免疫覆盖率、人口密度和病例数。从实验结果可以看出,最佳模型只涉及一个变量,即案例数。该模型用于预测玛琅县、泗水市和苏梅内普县的白喉病例数。结果表明,RBFNN方法在马琅、泗水和苏梅内普3个地区均具有较好的预测效果,MASE值分别为0.84、0.817和0.820,MASE值均小于1。
{"title":"Diphtheria Case Number Forecasting using Radial Basis Function Neural Network","authors":"Wiwik Anggraeni, Dina Nandika, Faizal Mahananto, Yeyen Sudiarti, Cut Alna Fadhilla","doi":"10.1109/ICICoS48119.2019.8982403","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982403","url":null,"abstract":"In Indonesia, diphtheria ranks fourth as a deadly disease after cardiovascular, tuberculosis, and pneumonia. The death rate of diphtheria is estimated to 21% with symptoms of malaise, anorexia, sore throat, and increased body temperature. The diphtheria cases which was reported in 2014 showed that East Java occupied the highest number for diphtheria cases which reached until 295, contributed to 74% cases of 22 provinces in Indonesia. In the mid-2017 until mid-2018, the Ministry of Health of the Republic of Indonesia announced that there has been an ongoing diphtheria outbreak in Indonesia. The number of diphtheria cases in East Java were highly raising up at the end of 2018. Forecasting is needed to reduce the number of diphtheria cases. The method used for forecasting is the Radial Basis Function Neural Network. Several variables are involved, including Immunization Coverage, Population Density, and Number of Cases. It is observed from the experimental results that the best model indicates only one variable involved, which is Number of Cases. This model is used to forecast the number of cases of diphtheria in Malang Regency, Surabaya City, and Sumenep Regency. The results showed that RBFNN method has a good performance for forecasting in Malang with MASE value of 0.84, Surabaya with MASE value of 0.817, and Sumenep with MASE value of 0.820, which all MASE values are less than 1.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131011238","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Performance Evaluation of Random Forest Feature Selection on Classification of Hepatocellular Carcinoma Gene Expression Data 随机森林特征选择在肝癌基因表达数据分类中的比较性能评价
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982435
M. Latief, T. Siswantining, A. Bustamam, Devvi Sarwinda
Hepatocellular carcinoma is one of the cancers that cause death in the world. We get hepatocellular carcinoma data in the form of microarray data gene expression obtained from the National Center for Biotechnology Information website consisting of 40 samples and 54675 features. The main purpose of this research is to compare the performance evaluation of Hepatocellular Carcinoma by applying feature selection to several classification algorithms. Random Forest feature selection method will be paired with several classification algorithms such as Support Vector Classification, Neural Network Classification, Random Forest, Logistic Regression, and Naïve Bayes. This study uses 5-fold cross-validation as an evaluation method. The results showed that Random Forest algorithm, Neural Network, Vector Machine Classification, and Naive Bayes show higher classification performance evaluation than without using random forest feature selection, while the Logistic Regression model provides a higher performance evaluation without using Random Forest feature selection. Support Vector Classification offers the highest performance evaluation compared to four other algorithms using feature selection, but Logistic Regression provides higher performance evaluation compared to different classification algorithms without feature selection.
肝细胞癌是世界上导致死亡的癌症之一。我们从国家生物技术信息中心网站上获得了40个样本和54675个特征的微阵列数据,以基因表达的形式获得肝细胞癌数据。本研究的主要目的是通过将特征选择应用于几种分类算法来比较肝癌的性能评价。随机森林特征选择方法将与几种分类算法配对,如支持向量分类、神经网络分类、随机森林、逻辑回归和Naïve贝叶斯。本研究采用5倍交叉验证作为评价方法。结果表明,随机森林算法、神经网络、向量机分类和朴素贝叶斯的分类性能评价高于未使用随机森林特征选择的分类性能评价,而未使用随机森林特征选择的逻辑回归模型的分类性能评价更高。与使用特征选择的其他四种算法相比,支持向量分类提供了最高的性能评估,但与不使用特征选择的其他分类算法相比,逻辑回归提供了更高的性能评估。
{"title":"A Comparative Performance Evaluation of Random Forest Feature Selection on Classification of Hepatocellular Carcinoma Gene Expression Data","authors":"M. Latief, T. Siswantining, A. Bustamam, Devvi Sarwinda","doi":"10.1109/ICICoS48119.2019.8982435","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982435","url":null,"abstract":"Hepatocellular carcinoma is one of the cancers that cause death in the world. We get hepatocellular carcinoma data in the form of microarray data gene expression obtained from the National Center for Biotechnology Information website consisting of 40 samples and 54675 features. The main purpose of this research is to compare the performance evaluation of Hepatocellular Carcinoma by applying feature selection to several classification algorithms. Random Forest feature selection method will be paired with several classification algorithms such as Support Vector Classification, Neural Network Classification, Random Forest, Logistic Regression, and Naïve Bayes. This study uses 5-fold cross-validation as an evaluation method. The results showed that Random Forest algorithm, Neural Network, Vector Machine Classification, and Naive Bayes show higher classification performance evaluation than without using random forest feature selection, while the Logistic Regression model provides a higher performance evaluation without using Random Forest feature selection. Support Vector Classification offers the highest performance evaluation compared to four other algorithms using feature selection, but Logistic Regression provides higher performance evaluation compared to different classification algorithms without feature selection.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132883196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Chlorophyll A and B Content Measurement System of Velvet Apple Leaf in Hyperspectral Imaging 丝绒苹果叶片叶绿素A和B含量的高光谱成像测量系统
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982485
Femilia Putri Mayranti, A. H. Saputro, W. Handayani
Pigments are a vital role in plants. Pigments can consist of several chemical structures, such as chlorophyll. Chlorophyll is a green pigment of plants can help to process photosynthetic. Chlorophyll divided into chlorophyll a and b. In this study, the authors were measured chlorophyll a and b content using hyperspectral imaging. Hyperspectral imaging had 224 full wavelengths in range 400 until 1000 nm. To measure that content, not all of 224 bands had important information of chlorophyll a and b. So that using DT method for wavelength selection had increased the performance system. The number of optimal wavelengths for chlorophyll a and b is 28 and 40 wavelengths. Comparing with several algorithms, i.e. PLSR and DT, PLSR model for full bands has the performance each chlorophyll a and b of 0.90 both for R2; also 3.25 and 3.46 for RPD. DT model for full bands has the performance each chlorophyll a and b of 0.94 and 0.96 for R2; also 4.57 and 5.02 for RPD. Then, DT with wavelength selection has improved the performance system each chlorophyll a and b of 0.99 and 0.99 for R2; also 12.00 and 13.09 for RPD.
色素在植物中起着至关重要的作用。色素可以由几种化学结构组成,比如叶绿素。叶绿素是一种绿色色素,可以帮助植物进行光合作用。叶绿素分为叶绿素a和叶绿素b。在本研究中,作者利用高光谱成像技术测量了叶绿素a和b的含量。高光谱成像在400到1000纳米范围内有224个全波长。在测定叶绿素a和叶绿素b含量时,并非所有224个波段都含有叶绿素a和叶绿素b的重要信息,因此采用DT法进行波长选择提高了系统的性能。叶绿素a和b的最佳波长分别为28和40个波长。与PLSR和DT算法相比,全波段PLSR模型在R2下的叶绿素a和叶绿素b的性能均为0.90;RPD也是3.25和3.46。全波段DT模型的叶绿素a和叶绿素b的R2分别为0.94和0.96;RPD分别为4.57和5.02。然后,波长选择DT提高了系统的性能,每个叶绿素a和b的R2分别为0.99和0.99;RPD收费为12.00及13.09。
{"title":"Chlorophyll A and B Content Measurement System of Velvet Apple Leaf in Hyperspectral Imaging","authors":"Femilia Putri Mayranti, A. H. Saputro, W. Handayani","doi":"10.1109/ICICoS48119.2019.8982485","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982485","url":null,"abstract":"Pigments are a vital role in plants. Pigments can consist of several chemical structures, such as chlorophyll. Chlorophyll is a green pigment of plants can help to process photosynthetic. Chlorophyll divided into chlorophyll a and b. In this study, the authors were measured chlorophyll a and b content using hyperspectral imaging. Hyperspectral imaging had 224 full wavelengths in range 400 until 1000 nm. To measure that content, not all of 224 bands had important information of chlorophyll a and b. So that using DT method for wavelength selection had increased the performance system. The number of optimal wavelengths for chlorophyll a and b is 28 and 40 wavelengths. Comparing with several algorithms, i.e. PLSR and DT, PLSR model for full bands has the performance each chlorophyll a and b of 0.90 both for R2; also 3.25 and 3.46 for RPD. DT model for full bands has the performance each chlorophyll a and b of 0.94 and 0.96 for R2; also 4.57 and 5.02 for RPD. Then, DT with wavelength selection has improved the performance system each chlorophyll a and b of 0.99 and 0.99 for R2; also 12.00 and 13.09 for RPD.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128872669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Ranking of Game Mechanics for Gamification in Mobile Payment Using AHP-TOPSIS: Uses and Gratification Perspective 基于AHP-TOPSIS的移动支付游戏化机制排名:使用和满足视角
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982458
Mutia Fadhila Putri, A. Hidayanto, E. S. Negara, Z. Abidin, P. Utari, N. Budi
Game mechanics are the most visible part of gamification that designed for interaction with the game state to produce an engaging experience. In a product with gamification, choosing the game mechanics has become the primary focus because it must be appropriate with the product goal. This study aims to investigate the most suitable game mechanics for gamification in mobile payment. AHP-TOPSIS methods used for the process of ranking and selecting the best game mechanics. The criteria and sub-criteria have been determined based on the Uses and Gratification perspective. Hedonic, utilitarian, and social gratification identified as criteria. While enjoyment, passing the time, ease of use, self-presentation, information quality, economic rewards, social value, and social interaction identified as sub-criteria. The questionnaire consists of pairwise comparison, and compatibility assessment of criteria and sub-criteria was conduct and distribute to collecting data from respondents. The results from the processes of AHP-TOPSIS identified feedback as the most suitable game mechanics for gamification in mobile payment. The consistency ratio from consistency checking is CR= 0.013514, and this is acceptable as consistent with the value of CR< 0. 1.
游戏机制是游戏化中最明显的部分,旨在与游戏状态进行交互,从而产生引人入胜的体验。在带有游戏化的产品中,选择游戏机制已成为主要焦点,因为它必须与产品目标相匹配。本研究旨在探讨最适合移动支付游戏化的游戏机制。AHP-TOPSIS方法用于排名和选择最佳游戏机制的过程。标准和子标准是根据使用和满足的观点确定的。享乐主义,功利主义和社会满足被确定为标准。而享受、打发时间、易用性、自我呈现、信息质量、经济回报、社会价值和社会互动则被确定为次级标准。问卷采用两两比较的方法,对标准和子标准进行相容性评估,并分发给被调查者收集数据。AHP-TOPSIS过程的结果表明,反馈是最适合移动支付游戏化的游戏机制。一致性检查得到的一致性比率CR= 0.013514,与CR< 0一致,可以接受。1.
{"title":"Ranking of Game Mechanics for Gamification in Mobile Payment Using AHP-TOPSIS: Uses and Gratification Perspective","authors":"Mutia Fadhila Putri, A. Hidayanto, E. S. Negara, Z. Abidin, P. Utari, N. Budi","doi":"10.1109/ICICoS48119.2019.8982458","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982458","url":null,"abstract":"Game mechanics are the most visible part of gamification that designed for interaction with the game state to produce an engaging experience. In a product with gamification, choosing the game mechanics has become the primary focus because it must be appropriate with the product goal. This study aims to investigate the most suitable game mechanics for gamification in mobile payment. AHP-TOPSIS methods used for the process of ranking and selecting the best game mechanics. The criteria and sub-criteria have been determined based on the Uses and Gratification perspective. Hedonic, utilitarian, and social gratification identified as criteria. While enjoyment, passing the time, ease of use, self-presentation, information quality, economic rewards, social value, and social interaction identified as sub-criteria. The questionnaire consists of pairwise comparison, and compatibility assessment of criteria and sub-criteria was conduct and distribute to collecting data from respondents. The results from the processes of AHP-TOPSIS identified feedback as the most suitable game mechanics for gamification in mobile payment. The consistency ratio from consistency checking is CR= 0.013514, and this is acceptable as consistent with the value of CR< 0. 1.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115500917","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Aspect Based Sentiment Analysis in E-Commerce User Reviews Using Latent Dirichlet Allocation (LDA) and Sentiment Lexicon 基于潜在狄利克雷分配和情感词典的面向方面的电子商务用户评论情感分析
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982522
Eko Wahyudi, R. Kusumaningrum
User ratings on products sold bye-commerce greatly influence the number of purchases. Positive ratings will encourage other buyers to participate in buying the product. While negative ratings given by users will reduce the interest in purchasing products. Nonconformities between rating and user reviews sometimes provide a wrong assessment of a product. This happens because buyers also provide reviews on the quality of delivery services from e-commerce. Based on that issue, the utilization of the Latent Dirichlet Allocation (LDA) could be used on sentiment analysis of the user reviews. Sentiment analysis of the user reviews aims to facilitate e-commerce in informing the product quality as rating supporters that have been given by users. This research aims to determine the classification performance of sentiment analysis on e-commerce user reviews using the LDA algorithm with input data in the form of e-commerce user reviews. Then, compare the application of sentiment analysis of the user reviews with the use of general training data and per category training data. The result of this research showed that in the first iteration the best architecture was produced by the application of LDA with a combination of parameters of alpha 0.001, beta 0.001, and number of topics 15. The architecture had 67,5% accuracy level. From the best architecture then training data input is given based on each product review category. The result showed that the combination of the usage of general data and per category data indicate an increase in the average accuracy of 0,82 % from the three-test data. Therefore, in order to produce the best performance of building a classification model of sentiment analysis of the user reviews, it should be performed by applying LDA with a combination of general data and per category data usage
用户对电子商务销售产品的评价对购买量有很大影响。积极的评价会鼓励其他买家参与购买产品。而用户给出的负面评价会降低购买产品的兴趣。评级和用户评论之间的不一致有时会导致对产品的错误评估。这是因为买家也会对电子商务的送货服务质量进行评论。在此基础上,利用潜在狄利克雷分配(Latent Dirichlet Allocation, LDA)对用户评论进行情感分析。用户评论情感分析的目的是为了方便电子商务告知产品质量,作为用户给予的评级支持。本研究旨在以电子商务用户评论的形式输入数据,利用LDA算法确定情感分析对电子商务用户评论的分类性能。然后,将用户评论情感分析的应用与一般训练数据和分类训练数据的应用进行比较。研究结果表明,在第一次迭代中,LDA的应用产生了最佳的体系结构,参数为alpha 0.001, beta 0.001,主题数15。该体系结构具有67.5%的精度水平。从最佳体系结构中,然后根据每个产品审查类别给出训练数据输入。结果表明,综合使用一般数据和每类数据表明,三次测试数据的平均准确率提高了0.82%。因此,为了获得构建用户评论情感分析分类模型的最佳性能,应该结合一般数据和每个类别数据的使用情况,应用LDA来执行
{"title":"Aspect Based Sentiment Analysis in E-Commerce User Reviews Using Latent Dirichlet Allocation (LDA) and Sentiment Lexicon","authors":"Eko Wahyudi, R. Kusumaningrum","doi":"10.1109/ICICoS48119.2019.8982522","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982522","url":null,"abstract":"User ratings on products sold bye-commerce greatly influence the number of purchases. Positive ratings will encourage other buyers to participate in buying the product. While negative ratings given by users will reduce the interest in purchasing products. Nonconformities between rating and user reviews sometimes provide a wrong assessment of a product. This happens because buyers also provide reviews on the quality of delivery services from e-commerce. Based on that issue, the utilization of the Latent Dirichlet Allocation (LDA) could be used on sentiment analysis of the user reviews. Sentiment analysis of the user reviews aims to facilitate e-commerce in informing the product quality as rating supporters that have been given by users. This research aims to determine the classification performance of sentiment analysis on e-commerce user reviews using the LDA algorithm with input data in the form of e-commerce user reviews. Then, compare the application of sentiment analysis of the user reviews with the use of general training data and per category training data. The result of this research showed that in the first iteration the best architecture was produced by the application of LDA with a combination of parameters of alpha 0.001, beta 0.001, and number of topics 15. The architecture had 67,5% accuracy level. From the best architecture then training data input is given based on each product review category. The result showed that the combination of the usage of general data and per category data indicate an increase in the average accuracy of 0,82 % from the three-test data. Therefore, in order to produce the best performance of building a classification model of sentiment analysis of the user reviews, it should be performed by applying LDA with a combination of general data and per category data usage","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124255246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Detection of the Emergence of Exudate on the Image of Retina Using Extreme Learning Machine Method 利用极限学习机方法检测视网膜图像上渗出物的出现
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982492
Zolanda Anggraeni, H. A. Wibawa
Diabetic retinopathy is a health problem that cause damage to the retinal blood vessels and occurs in more than half of people who suffer from diabetes. It is estimated that around 28 million people experience loss of sight for this reason. Thus, the system for detecting early signs of diabetic retinopathy will be very helpful and one of first signs of the onset of symptoms of diabetic retinopathy is the appearance of exudates in the retinal image of the eye. To build an exudate emergence detection system, in this study use the method of extreme learning machine (ELM) which has a fast learning speed. This system uses the gray level co-occurrence matrix feature extraction with 6 features, namely contrast, homogeneity, correlation, ASM, energy and dissimilarity. To get the best model, six scenarios are used by distinguishing the preprocessing flow. The pre processing stage carried out by all scenarios is optic disc removal, green channel separation, contrast limited adaptive histogram equalization (CLAHE) followed by two different preprocessing lines, namely applying brightness and dilation and erosion operations. Then the second path is radon transform, top-hat filtering, discrete wavelet transform and dilation and erosion. The best model results reached the best accuracy value of 65% with a combination of multiquadric activation functions and 30 hidden neurons.
糖尿病视网膜病变是一种导致视网膜血管损伤的健康问题,超过一半的糖尿病患者都会出现这种疾病。据估计,大约有2800万人因此丧失视力。因此,检测糖尿病视网膜病变早期症状的系统将非常有帮助,糖尿病视网膜病变症状发作的第一个迹象是眼睛视网膜图像中渗出物的出现。为了构建一个渗出物紧急检测系统,本研究采用了学习速度快的极限学习机(ELM)方法。该系统采用灰度共现矩阵特征提取,具有对比度、同质性、相关性、ASM、能量和不相似性6个特征。为了得到最佳模型,通过区分预处理流程,使用了6种场景。所有场景进行的预处理阶段是视盘去除、绿色通道分离、对比度有限的自适应直方图均衡化(CLAHE),然后是两条不同的预处理线,即应用亮度和膨胀侵蚀操作。第二条路径是氡变换、顶帽滤波、离散小波变换和膨胀侵蚀。采用多重二次激活函数和30个隐藏神经元相结合的方法,模型结果达到65%的最佳准确率值。
{"title":"Detection of the Emergence of Exudate on the Image of Retina Using Extreme Learning Machine Method","authors":"Zolanda Anggraeni, H. A. Wibawa","doi":"10.1109/ICICoS48119.2019.8982492","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982492","url":null,"abstract":"Diabetic retinopathy is a health problem that cause damage to the retinal blood vessels and occurs in more than half of people who suffer from diabetes. It is estimated that around 28 million people experience loss of sight for this reason. Thus, the system for detecting early signs of diabetic retinopathy will be very helpful and one of first signs of the onset of symptoms of diabetic retinopathy is the appearance of exudates in the retinal image of the eye. To build an exudate emergence detection system, in this study use the method of extreme learning machine (ELM) which has a fast learning speed. This system uses the gray level co-occurrence matrix feature extraction with 6 features, namely contrast, homogeneity, correlation, ASM, energy and dissimilarity. To get the best model, six scenarios are used by distinguishing the preprocessing flow. The pre processing stage carried out by all scenarios is optic disc removal, green channel separation, contrast limited adaptive histogram equalization (CLAHE) followed by two different preprocessing lines, namely applying brightness and dilation and erosion operations. Then the second path is radon transform, top-hat filtering, discrete wavelet transform and dilation and erosion. The best model results reached the best accuracy value of 65% with a combination of multiquadric activation functions and 30 hidden neurons.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"265 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124316218","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Twitter Storytelling Generator Using Latent Dirichlet Allocation and Hidden Markov Model POS-TAG (Part-of-Speech Tagging) 基于潜在狄利克雷分配和隐马尔可夫模型POS-TAG(词性标注)的Twitter故事生成器
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982411
Yasir Abdur Rohman, R. Kusumaningrum
Twitter active users in Indonesia reached 50 million users from a total worldwide of 284 million in 2015. In January 2019, active users on Twitter increased by 52% compared to 2018 where active users were only 27%.A large number of users causes the number of tweet documents increases. Tweet documents that contain information such as user activity, news, story can be processed into valuable information for journalists. All of the information collected then arranged based on related tweets into a storytelling that will become news/article. The whole process is still done manually by collecting one by one for each tweet and most of the tweet documents are collected from the trending topic. Actually, that should be done automatically by collecting tweets that have the same topic. Therefore, this research proposes a method of Twitter storytelling generator that combines Latent Dirichlet Allocation (LDA) and Hidden Markov Model POS-TAG (Part-of-Speech Tagging), so it can generate twitter storytelling based on the certain topic. We implemented two scenarios of the experiment. The first experimental calculating the value of perplexity on LDA and HMM POS-TAG, yielding the lowest perplexity value of 6.31 with alpha 0.001, beta 0.001, and the number of topics 4. While the second experimental calculating the value of ROUGE-1, ROUGE-2, BLEU-1, and BLEU-2 on the results of Twitter storytelling generator, yielding the best ROUGE-1 value is 0.470 with the beta cap value of 0.1 and the best ROUGE-2 value is 0.149 with the beta cap value of 0.001. Meanwhile, the best BLEU-1 value is 0.617 on the topic 1 and the best BLEU-2 value is 0.432 on the topic 3. Twitter storytelling generator using the proposed method has good performance when HMM POS-TAG can tagging the tweet documents correctly.
2015年,Twitter在印尼的活跃用户达到5000万,而全球用户总数为2.84亿。2019年1月,Twitter的活跃用户比2018年增长了52%,而2018年的活跃用户仅为27%。大量的用户导致tweet文档的数量增加。包含用户活动、新闻、故事等信息的Tweet文档可以被处理成对记者有价值的信息。所有收集到的信息,然后根据相关的推文排列成一个故事,将成为新闻/文章。整个过程仍然是手动完成的,每条推文都是一个接一个地收集,大部分推文文档都是从趋势主题中收集的。实际上,这应该通过收集具有相同主题的推文来自动完成。因此,本研究提出了一种结合Latent Dirichlet Allocation (LDA)和Hidden Markov Model POS-TAG (Part-of-Speech Tagging)的Twitter故事生成器方法,可以基于特定主题生成Twitter故事生成器。我们在实验中实现了两个场景。第一个实验计算LDA和HMM POS-TAG上的perplexity值,得到最小perplexity值为6.31,alpha值为0.001,beta值为0.001,题目数为4。第二个实验计算了ROUGE-1、ROUGE-2、blue -1和blue -2对Twitter故事生成器结果的值,得到最佳的ROUGE-1值为0.470,beta帽值为0.1,最佳的ROUGE-2值为0.149,beta帽值为0.001。同时,主题1的最佳BLEU-1值为0.617,主题3的最佳BLEU-2值为0.432。当HMM POS-TAG能够正确标记推文文档时,使用该方法的推文故事生成器具有良好的性能。
{"title":"Twitter Storytelling Generator Using Latent Dirichlet Allocation and Hidden Markov Model POS-TAG (Part-of-Speech Tagging)","authors":"Yasir Abdur Rohman, R. Kusumaningrum","doi":"10.1109/ICICoS48119.2019.8982411","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982411","url":null,"abstract":"Twitter active users in Indonesia reached 50 million users from a total worldwide of 284 million in 2015. In January 2019, active users on Twitter increased by 52% compared to 2018 where active users were only 27%.A large number of users causes the number of tweet documents increases. Tweet documents that contain information such as user activity, news, story can be processed into valuable information for journalists. All of the information collected then arranged based on related tweets into a storytelling that will become news/article. The whole process is still done manually by collecting one by one for each tweet and most of the tweet documents are collected from the trending topic. Actually, that should be done automatically by collecting tweets that have the same topic. Therefore, this research proposes a method of Twitter storytelling generator that combines Latent Dirichlet Allocation (LDA) and Hidden Markov Model POS-TAG (Part-of-Speech Tagging), so it can generate twitter storytelling based on the certain topic. We implemented two scenarios of the experiment. The first experimental calculating the value of perplexity on LDA and HMM POS-TAG, yielding the lowest perplexity value of 6.31 with alpha 0.001, beta 0.001, and the number of topics 4. While the second experimental calculating the value of ROUGE-1, ROUGE-2, BLEU-1, and BLEU-2 on the results of Twitter storytelling generator, yielding the best ROUGE-1 value is 0.470 with the beta cap value of 0.1 and the best ROUGE-2 value is 0.149 with the beta cap value of 0.001. Meanwhile, the best BLEU-1 value is 0.617 on the topic 1 and the best BLEU-2 value is 0.432 on the topic 3. Twitter storytelling generator using the proposed method has good performance when HMM POS-TAG can tagging the tweet documents correctly.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124360147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Sumatra Traditional Food Image Classification Using Classical Machine Learning 使用经典机器学习的苏门答腊岛传统食物图像分类
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982447
Puteri Khatya Fahira, A. Wibisono, H. Wisesa, Zulia Putri Rahmadhani, P. Mursanto, A. Nurhadiyatna
Indonesia is a country rich in culture. One of Indonesia's culturaldiversity is on traditional foods. Traditional food not only has a role in the cultural aspect, but also has an influence on biodiversity. Unfortunately, the current diet of people endangers the existence of traditional foods, which indirectly will also affect Indonesia's food security. Indonesia Local Food Database is one solution proposed to prevent this problem, where the database will play a role to monitor food systems in Indonesia. In this research, database development will focus on collecting data for Sumatra traditionalfood, and also building a model for image classification which will later become one of the main features of the database. Some features like color and texture are extracted from the image. These features are used for classification using 5 classical machine learning models. Evaluation results show performance that as good as deep learning approach.
印度尼西亚是一个文化丰富的国家。印尼的文化多样性之一是传统食物。传统食物不仅在文化方面有作用,而且对生物多样性也有影响。不幸的是,目前人们的饮食危及传统食品的存在,这也将间接影响印度尼西亚的粮食安全。印度尼西亚地方食品数据库是为防止这一问题而提出的一种解决方案,该数据库将在监测印度尼西亚的食品系统方面发挥作用。在本研究中,数据库的开发将侧重于收集苏门答腊岛传统食物的数据,并建立图像分类模型,这将成为数据库的主要功能之一。从图像中提取颜色和纹理等特征。这些特征使用5个经典机器学习模型进行分类。评估结果表明,该方法的性能与深度学习方法相当。
{"title":"Sumatra Traditional Food Image Classification Using Classical Machine Learning","authors":"Puteri Khatya Fahira, A. Wibisono, H. Wisesa, Zulia Putri Rahmadhani, P. Mursanto, A. Nurhadiyatna","doi":"10.1109/ICICoS48119.2019.8982447","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982447","url":null,"abstract":"Indonesia is a country rich in culture. One of Indonesia's culturaldiversity is on traditional foods. Traditional food not only has a role in the cultural aspect, but also has an influence on biodiversity. Unfortunately, the current diet of people endangers the existence of traditional foods, which indirectly will also affect Indonesia's food security. Indonesia Local Food Database is one solution proposed to prevent this problem, where the database will play a role to monitor food systems in Indonesia. In this research, database development will focus on collecting data for Sumatra traditionalfood, and also building a model for image classification which will later become one of the main features of the database. Some features like color and texture are extracted from the image. These features are used for classification using 5 classical machine learning models. Evaluation results show performance that as good as deep learning approach.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117102385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Examining the Acceptance of Virtual Assistant - Vanika for University Students 大学生对虚拟助手Vanika的接受度调查
Pub Date : 2019-10-01 DOI: 10.1109/ICICoS48119.2019.8982513
Devina Gunadi, Ridwan Sanjaya, Bernardinus Harnadi
Finding the right information in easily manner is a need of the community. Virtual assistant is a mobile application to fill this necessity in easily, effective, and efficient manners. Vanika was the virtual assistant developed by Soegijapranata Catholic University (SCU) to serve students and prospective students in finding the right information about the SCU. This study aims to examine the acceptance of virtual assistant-Vanika among them. The answers of two hundred questionnaires were collected from students and prospective students of SCU after they utilized Vanika at least twice. The result reveals that all of factors Information Service Quality, Perceived Ease of Use, and Perceived Usefulness have statistically correlation on Behavioral Intention and each other. Only Information Service Quality has statistically significant direct effect on Behavioral Intention. Prospective students frequently spend more time in searching university information than students, conversely, student perceived easier in using Vanika than prospective students. The last, males spend more time in searching university information than females, and females have more intention to use Vanika than males. The practical implications derived from the findings are the actions to increase individual's acceptance to use Vanika by means of increasing factors relating to Information Service Quality, Perceived Ease of Use, Perceived Usefulness, Age, and Gender.
以方便的方式找到正确的信息是社会的需要。虚拟助手是一个移动应用程序,以方便,有效和高效的方式填补这一需求。Vanika是Soegijapranata天主教大学(SCU)开发的虚拟助手,旨在帮助学生和未来的学生找到关于SCU的正确信息。本研究旨在考察他们对虚拟助理vanika的接受程度。在使用Vanika至少两次后,收集了200份问卷的答案。结果表明,信息服务质量、感知易用性和感知有用性三者对行为意向和彼此之间均有统计学上的相关关系。只有信息服务质量对行为意向有显著的直接影响。在校生通常比在校生花费更多的时间来搜索大学信息,相反,在校生比在校生更容易使用Vanika。最后,男性比女性花费更多的时间搜索大学信息,女性比男性更倾向于使用Vanika。从研究结果中得出的实际意义是通过增加与信息服务质量、感知易用性、感知有用性、年龄和性别相关的因素来提高个人对Vanika的接受度。
{"title":"Examining the Acceptance of Virtual Assistant - Vanika for University Students","authors":"Devina Gunadi, Ridwan Sanjaya, Bernardinus Harnadi","doi":"10.1109/ICICoS48119.2019.8982513","DOIUrl":"https://doi.org/10.1109/ICICoS48119.2019.8982513","url":null,"abstract":"Finding the right information in easily manner is a need of the community. Virtual assistant is a mobile application to fill this necessity in easily, effective, and efficient manners. Vanika was the virtual assistant developed by Soegijapranata Catholic University (SCU) to serve students and prospective students in finding the right information about the SCU. This study aims to examine the acceptance of virtual assistant-Vanika among them. The answers of two hundred questionnaires were collected from students and prospective students of SCU after they utilized Vanika at least twice. The result reveals that all of factors Information Service Quality, Perceived Ease of Use, and Perceived Usefulness have statistically correlation on Behavioral Intention and each other. Only Information Service Quality has statistically significant direct effect on Behavioral Intention. Prospective students frequently spend more time in searching university information than students, conversely, student perceived easier in using Vanika than prospective students. The last, males spend more time in searching university information than females, and females have more intention to use Vanika than males. The practical implications derived from the findings are the actions to increase individual's acceptance to use Vanika by means of increasing factors relating to Information Service Quality, Perceived Ease of Use, Perceived Usefulness, Age, and Gender.","PeriodicalId":105407,"journal":{"name":"2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125933550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
2019 3rd International Conference on Informatics and Computational Sciences (ICICoS)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1