C. Wirdiastuti, U. Syafitri, M. Sumertajaya, E. Rohaeti, M. Rafi
{"title":"Application of lasso for identification of functional groups with significant contributions to antioxidant activities of Centella asiatica","authors":"C. Wirdiastuti, U. Syafitri, M. Sumertajaya, E. Rohaeti, M. Rafi","doi":"10.28919/cmbn/7843","DOIUrl":null,"url":null,"abstract":": High-dimensional data has more variables than observations (p>>n). In this case, modeling with regression analysis becomes ineffective because it will violate the multicollinearity assumption. The least absolute shrinkage and selection operator (LASSO) can handle high-dimensional data and multicollinearity because LASSO works by reducing the parameters of variables with significant effects and selecting variables with minor effects. In its application, several variables have the same characteristics. Reducing and selecting variables in the form of groups can solve the problem so that the group LASSO can be used as a solution. This study used data on antioxidant activity in C. asiatica. It is a plant that contains antioxidants. The spectroscopic technique can find important information about antioxidants, namely the Fourier transformed infrared spectrophotometer (FTIR). FTIR is a spectroscopic technique based on molecular vibrations subjected to infrared so that it can characterize molecules with functional groups. FTIR data has large dimensions and multicollinearity. This study has 1866 explanatory","PeriodicalId":44079,"journal":{"name":"Communications in Mathematical Biology and Neuroscience","volume":null,"pages":null},"PeriodicalIF":0.5000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Mathematical Biology and Neuroscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.28919/cmbn/7843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
: High-dimensional data has more variables than observations (p>>n). In this case, modeling with regression analysis becomes ineffective because it will violate the multicollinearity assumption. The least absolute shrinkage and selection operator (LASSO) can handle high-dimensional data and multicollinearity because LASSO works by reducing the parameters of variables with significant effects and selecting variables with minor effects. In its application, several variables have the same characteristics. Reducing and selecting variables in the form of groups can solve the problem so that the group LASSO can be used as a solution. This study used data on antioxidant activity in C. asiatica. It is a plant that contains antioxidants. The spectroscopic technique can find important information about antioxidants, namely the Fourier transformed infrared spectrophotometer (FTIR). FTIR is a spectroscopic technique based on molecular vibrations subjected to infrared so that it can characterize molecules with functional groups. FTIR data has large dimensions and multicollinearity. This study has 1866 explanatory
期刊介绍:
Communications in Mathematical Biology and Neuroscience (CMBN) is a peer-reviewed open access international journal, which is aimed to provide a publication forum for important research in all aspects of mathematical biology and neuroscience. This journal will accept high quality articles containing original research results and survey articles of exceptional merit.