C. Lungu, S. Ersali, Beata Szefler, Atena Pîrvan-Moldovan, S. Basak, M. Diudea
{"title":"Cluj描述符探索的大数据集的维度","authors":"C. Lungu, S. Ersali, Beata Szefler, Atena Pîrvan-Moldovan, S. Basak, M. Diudea","doi":"10.24193/SUBBCHEM.2017.3.16","DOIUrl":null,"url":null,"abstract":"Dimensionality of a relatively big data set (95 compounds) observed for toxicity (mutagenicity) was explored in order to compute QSAR models. Distinct molecular descriptors were used. Dimensionality of data, using PCA, correlation plots and clustering, was evaluated. Analyzing data dimensionality allowed model optimization. Docking studies and PCA were used in order to expand data dimensionality. Pearson correlation coefficient (r) values, obtained for both perceptive and predictive models, were satisfactory.","PeriodicalId":22005,"journal":{"name":"Studia Universitatis Babes-bolyai Chemia","volume":"32 1","pages":"197-204"},"PeriodicalIF":0.5000,"publicationDate":"2017-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Dimensionality of big data sets explored by Cluj descriptors\",\"authors\":\"C. Lungu, S. Ersali, Beata Szefler, Atena Pîrvan-Moldovan, S. Basak, M. Diudea\",\"doi\":\"10.24193/SUBBCHEM.2017.3.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Dimensionality of a relatively big data set (95 compounds) observed for toxicity (mutagenicity) was explored in order to compute QSAR models. Distinct molecular descriptors were used. Dimensionality of data, using PCA, correlation plots and clustering, was evaluated. Analyzing data dimensionality allowed model optimization. Docking studies and PCA were used in order to expand data dimensionality. Pearson correlation coefficient (r) values, obtained for both perceptive and predictive models, were satisfactory.\",\"PeriodicalId\":22005,\"journal\":{\"name\":\"Studia Universitatis Babes-bolyai Chemia\",\"volume\":\"32 1\",\"pages\":\"197-204\"},\"PeriodicalIF\":0.5000,\"publicationDate\":\"2017-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Studia Universitatis Babes-bolyai Chemia\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.24193/SUBBCHEM.2017.3.16\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studia Universitatis Babes-bolyai Chemia","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.24193/SUBBCHEM.2017.3.16","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Dimensionality of big data sets explored by Cluj descriptors
Dimensionality of a relatively big data set (95 compounds) observed for toxicity (mutagenicity) was explored in order to compute QSAR models. Distinct molecular descriptors were used. Dimensionality of data, using PCA, correlation plots and clustering, was evaluated. Analyzing data dimensionality allowed model optimization. Docking studies and PCA were used in order to expand data dimensionality. Pearson correlation coefficient (r) values, obtained for both perceptive and predictive models, were satisfactory.
期刊介绍:
Studia Universitatis Babes-Bolyai, Seria Chemia publishes fundamental studies in all areas of chemistry and chemical engineering.
Coverage includes experimental and theoretical reports on quantitative studies of structure and thermodynamics, kinetics, mechanisms of reactions, inorganic, organic, organometallic chemistry, biochemistry, computational chemistry, solid-state phenomena, surface chemistry, chemical technology and environmental chemistry.