Behnam Moeini, Tahereh G. Avval, Neal Gallagher, M. Linford
{"title":"表面分析洞察笔记。X射线光电子能谱图像的主成分分析。预处理的重要性","authors":"Behnam Moeini, Tahereh G. Avval, Neal Gallagher, M. Linford","doi":"10.1002/sia.7252","DOIUrl":null,"url":null,"abstract":"This Insight Note follows two previous Insight Notes on X‐ray photoelectron spectroscopy (XPS) image analysis that dealt with the importance of analyzing the raw data and the use of summary statistics. As a next step in the exploratory data analysis (EDA) of XPS images, we now show principal component analysis (PCA) of an XPS image. PCA is appropriate when the spectra in a data set are correlated to some degree and the noise in the spectra is unimportant. In these cases, PCA can significantly reduce the dimensionality and complexity of data sets. Preprocessing is an important part of many PCAs. Its usefulness is illustrated with a small, mock data set, where the potential pitfalls of not preprocessing are shown. PCAs of XPS image data that were not preprocessed and preprocessed by mean centering are illustrated. Scree plots, which are used to determine the number of abstract factors (principal components, PCs) that describe a data set, are shown. The spectra in our XPS image are quite noisy, which is consistent with the moderate, but still significant, amount of variance that is captured by the first two PCs in our PCA. With both preprocessing methods, the loadings on PC1 and PC2 are remarkably smooth. The loadings on the next six PCs also appear to contain some chemical information. Scores images generated using both no preprocessing and preprocessing by mean centering reveal many of the same general features in the data set that were found in our two previous Insight Notes.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Surface analysis insight note. Principal component analysis (PCA) of an X‐ray photoelectron spectroscopy image. The importance of preprocessing\",\"authors\":\"Behnam Moeini, Tahereh G. Avval, Neal Gallagher, M. Linford\",\"doi\":\"10.1002/sia.7252\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This Insight Note follows two previous Insight Notes on X‐ray photoelectron spectroscopy (XPS) image analysis that dealt with the importance of analyzing the raw data and the use of summary statistics. As a next step in the exploratory data analysis (EDA) of XPS images, we now show principal component analysis (PCA) of an XPS image. PCA is appropriate when the spectra in a data set are correlated to some degree and the noise in the spectra is unimportant. In these cases, PCA can significantly reduce the dimensionality and complexity of data sets. Preprocessing is an important part of many PCAs. Its usefulness is illustrated with a small, mock data set, where the potential pitfalls of not preprocessing are shown. PCAs of XPS image data that were not preprocessed and preprocessed by mean centering are illustrated. Scree plots, which are used to determine the number of abstract factors (principal components, PCs) that describe a data set, are shown. The spectra in our XPS image are quite noisy, which is consistent with the moderate, but still significant, amount of variance that is captured by the first two PCs in our PCA. With both preprocessing methods, the loadings on PC1 and PC2 are remarkably smooth. The loadings on the next six PCs also appear to contain some chemical information. Scores images generated using both no preprocessing and preprocessing by mean centering reveal many of the same general features in the data set that were found in our two previous Insight Notes.\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2023-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1002/sia.7252\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1002/sia.7252","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Surface analysis insight note. Principal component analysis (PCA) of an X‐ray photoelectron spectroscopy image. The importance of preprocessing
This Insight Note follows two previous Insight Notes on X‐ray photoelectron spectroscopy (XPS) image analysis that dealt with the importance of analyzing the raw data and the use of summary statistics. As a next step in the exploratory data analysis (EDA) of XPS images, we now show principal component analysis (PCA) of an XPS image. PCA is appropriate when the spectra in a data set are correlated to some degree and the noise in the spectra is unimportant. In these cases, PCA can significantly reduce the dimensionality and complexity of data sets. Preprocessing is an important part of many PCAs. Its usefulness is illustrated with a small, mock data set, where the potential pitfalls of not preprocessing are shown. PCAs of XPS image data that were not preprocessed and preprocessed by mean centering are illustrated. Scree plots, which are used to determine the number of abstract factors (principal components, PCs) that describe a data set, are shown. The spectra in our XPS image are quite noisy, which is consistent with the moderate, but still significant, amount of variance that is captured by the first two PCs in our PCA. With both preprocessing methods, the loadings on PC1 and PC2 are remarkably smooth. The loadings on the next six PCs also appear to contain some chemical information. Scores images generated using both no preprocessing and preprocessing by mean centering reveal many of the same general features in the data set that were found in our two previous Insight Notes.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.