Raquel Alonso-Perez , James M.D. Day , D. Graham Pearson , Yan Luo , Manuel A. Palacios , Raju Sudhakar , Aaron Palke
{"title":"通过指纹识别和机器学习方法探索祖母绿的全球地球化学出处","authors":"Raquel Alonso-Perez , James M.D. Day , D. Graham Pearson , Yan Luo , Manuel A. Palacios , Raju Sudhakar , Aaron Palke","doi":"10.1016/j.aiig.2024.100085","DOIUrl":null,"url":null,"abstract":"<div><p>Emeralds – the green colored variety of beryl – occur as gem-quality specimens in over fifty deposits globally. While digital traceability methods for emerald have limitations, sample-based approaches offer robust alternatives, particularly for determining the geographic origin of emerald. Three factors make emerald suitable for provenance studies and hence for developing models for origin determination. First, the diverse elemental chemistry of emerald at minor (<1 wt%) and trace levels (<1 to 100’s ppmw) exhibits unique inter-element fractionations between global deposits. Second, minimally destructive techniques, including laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS), enable measurement of these diagnostic elemental signatures. Third, when applied to extensive datasets, machine learning (ML) techniques enable the creation of predictive models and statistical discrimination with adequate characterization of the deposits. This study employs a carefully selected dataset comprising more than 1000 LA-ICP-MS analyses of gem-quality emeralds, enriched with new analyses. This dataset represents the largest available for global emerald deposits. We conducted unsupervised exploratory analysis using Principal Component Analysis (PCA). For machine learning-based classification, we employed Support Vector Machine Classification (SVM-C), achieving an initial accuracy rate of 79%. This was enhanced to 96.8% through the use of hierarchical SVM-C with PCA filters as our modeling approach. The ML models were trained using the concentrations of eight statistically significant elements (Li, V, Cr, Fe, Sc, Ga, Rb, Cs). By leveraging high-quality LA-ICP-MS data and ML techniques, accurate identification of the geographical origin of emerald becomes possible. These models are important for accurate provenance of emerald, and from a geochemical perspective, for understanding the formation environments of beryl-bearing pegmatites and shales.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"5 ","pages":"Article 100085"},"PeriodicalIF":0.0000,"publicationDate":"2024-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544124000261/pdfft?md5=8ac6027d08bf9d1a8618e5ecdb9f25b3&pid=1-s2.0-S2666544124000261-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Exploring emerald global geochemical provenance through fingerprinting and machine learning methods\",\"authors\":\"Raquel Alonso-Perez , James M.D. Day , D. Graham Pearson , Yan Luo , Manuel A. Palacios , Raju Sudhakar , Aaron Palke\",\"doi\":\"10.1016/j.aiig.2024.100085\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Emeralds – the green colored variety of beryl – occur as gem-quality specimens in over fifty deposits globally. While digital traceability methods for emerald have limitations, sample-based approaches offer robust alternatives, particularly for determining the geographic origin of emerald. Three factors make emerald suitable for provenance studies and hence for developing models for origin determination. First, the diverse elemental chemistry of emerald at minor (<1 wt%) and trace levels (<1 to 100’s ppmw) exhibits unique inter-element fractionations between global deposits. Second, minimally destructive techniques, including laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS), enable measurement of these diagnostic elemental signatures. Third, when applied to extensive datasets, machine learning (ML) techniques enable the creation of predictive models and statistical discrimination with adequate characterization of the deposits. This study employs a carefully selected dataset comprising more than 1000 LA-ICP-MS analyses of gem-quality emeralds, enriched with new analyses. This dataset represents the largest available for global emerald deposits. We conducted unsupervised exploratory analysis using Principal Component Analysis (PCA). For machine learning-based classification, we employed Support Vector Machine Classification (SVM-C), achieving an initial accuracy rate of 79%. This was enhanced to 96.8% through the use of hierarchical SVM-C with PCA filters as our modeling approach. The ML models were trained using the concentrations of eight statistically significant elements (Li, V, Cr, Fe, Sc, Ga, Rb, Cs). By leveraging high-quality LA-ICP-MS data and ML techniques, accurate identification of the geographical origin of emerald becomes possible. These models are important for accurate provenance of emerald, and from a geochemical perspective, for understanding the formation environments of beryl-bearing pegmatites and shales.</p></div>\",\"PeriodicalId\":100124,\"journal\":{\"name\":\"Artificial Intelligence in Geosciences\",\"volume\":\"5 \",\"pages\":\"Article 100085\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666544124000261/pdfft?md5=8ac6027d08bf9d1a8618e5ecdb9f25b3&pid=1-s2.0-S2666544124000261-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Artificial Intelligence in Geosciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666544124000261\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544124000261","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring emerald global geochemical provenance through fingerprinting and machine learning methods
Emeralds – the green colored variety of beryl – occur as gem-quality specimens in over fifty deposits globally. While digital traceability methods for emerald have limitations, sample-based approaches offer robust alternatives, particularly for determining the geographic origin of emerald. Three factors make emerald suitable for provenance studies and hence for developing models for origin determination. First, the diverse elemental chemistry of emerald at minor (<1 wt%) and trace levels (<1 to 100’s ppmw) exhibits unique inter-element fractionations between global deposits. Second, minimally destructive techniques, including laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS), enable measurement of these diagnostic elemental signatures. Third, when applied to extensive datasets, machine learning (ML) techniques enable the creation of predictive models and statistical discrimination with adequate characterization of the deposits. This study employs a carefully selected dataset comprising more than 1000 LA-ICP-MS analyses of gem-quality emeralds, enriched with new analyses. This dataset represents the largest available for global emerald deposits. We conducted unsupervised exploratory analysis using Principal Component Analysis (PCA). For machine learning-based classification, we employed Support Vector Machine Classification (SVM-C), achieving an initial accuracy rate of 79%. This was enhanced to 96.8% through the use of hierarchical SVM-C with PCA filters as our modeling approach. The ML models were trained using the concentrations of eight statistically significant elements (Li, V, Cr, Fe, Sc, Ga, Rb, Cs). By leveraging high-quality LA-ICP-MS data and ML techniques, accurate identification of the geographical origin of emerald becomes possible. These models are important for accurate provenance of emerald, and from a geochemical perspective, for understanding the formation environments of beryl-bearing pegmatites and shales.