首页 > 最新文献

Machine learning with applications最新文献

英文 中文
Document Layout Error Rate (DLER) metric to evaluate image segmentation methods 评估图像分割方法的文档布局错误率 (DLER) 指标
Pub Date : 2024-11-19 DOI: 10.1016/j.mlwa.2024.100606
Ari Vesalainen , Mikko Tolonen , Laura Ruotsalainen
Scholarly editions play a crucial role in humanities research, particularly in the study of literature and historical documents. The primary objective of these editions is to reconstruct the original text or provide insights into the author’s intentions. Traditionally, crafting a critical edition required a lifetime of dedication. However, thanks to recent advancements in deep learning and computer vision, modern text recognition tools can now be used to expedite this process. A key part of these tools is document layout analysis (DLA), where image segmentation methods are used to detect different text elements. Most existing DLA solutions have focused on evaluating the accuracy of these methods, often neglecting to study the practical consequences of method selection. In this study, we have developed a new metric, the Document Layout Error Rate (DLER), which evaluates the performance of fine-grained DLA methods within the overall pipeline. This metric helps identify the method with the lowest error rate, thereby minimizing the manual effort required for corrections. We applied this evaluation method to assess four different methods and their efficacy for the DLA task in the context of David Hume’s History of England.
学术版本在人文学科研究,尤其是文学和历史文献研究中发挥着至关重要的作用。这些版本的主要目的是重构原文或深入了解作者的意图。传统上,制作批判性版本需要一生的奉献。然而,得益于深度学习和计算机视觉领域的最新进展,现在可以使用现代文本识别工具来加快这一过程。这些工具的一个关键部分是文档排版分析(DLA),其中使用图像分割方法来检测不同的文本元素。大多数现有的 DLA 解决方案都侧重于评估这些方法的准确性,往往忽视了对方法选择的实际影响的研究。在本研究中,我们开发了一种新指标--文档布局错误率(DLER),用于评估整个管道中细粒度 DLA 方法的性能。该指标有助于确定错误率最低的方法,从而最大限度地减少人工修正所需的工作量。我们在大卫-休谟(David Hume)的《英国史》(History of England)中应用了这种评估方法来评估四种不同的方法及其在 DLA 任务中的功效。
{"title":"Document Layout Error Rate (DLER) metric to evaluate image segmentation methods","authors":"Ari Vesalainen ,&nbsp;Mikko Tolonen ,&nbsp;Laura Ruotsalainen","doi":"10.1016/j.mlwa.2024.100606","DOIUrl":"10.1016/j.mlwa.2024.100606","url":null,"abstract":"<div><div>Scholarly editions play a crucial role in humanities research, particularly in the study of literature and historical documents. The primary objective of these editions is to reconstruct the original text or provide insights into the author’s intentions. Traditionally, crafting a critical edition required a lifetime of dedication. However, thanks to recent advancements in deep learning and computer vision, modern text recognition tools can now be used to expedite this process. A key part of these tools is document layout analysis (DLA), where image segmentation methods are used to detect different text elements. Most existing DLA solutions have focused on evaluating the accuracy of these methods, often neglecting to study the practical consequences of method selection. In this study, we have developed a new metric, the Document Layout Error Rate (DLER), which evaluates the performance of fine-grained DLA methods within the overall pipeline. This metric helps identify the method with the lowest error rate, thereby minimizing the manual effort required for corrections. We applied this evaluation method to assess four different methods and their efficacy for the DLA task in the context of David Hume’s <em>History of England</em>.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100606"},"PeriodicalIF":0.0,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Supervised machine learning for microbiomics: Bridging the gap between current and best practices 用于微生物组学的有监督机器学习:缩小当前实践与最佳实践之间的差距
Pub Date : 2024-11-14 DOI: 10.1016/j.mlwa.2024.100607
Natasha Katherine Dudek , Mariami Chakhvadze , Saba Kobakhidze , Omar Kantidze , Yuriy Gankin
Machine learning (ML) is poised to drive innovations in clinical microbiomics, such as in disease diagnostics and prognostics. However, the successful implementation of ML in these domains necessitates the development of reproducible, interpretable models that meet the rigorous performance standards set by regulatory agencies. This study aims to identify key areas in need of improvement in current ML practices within microbiomics, with a focus on bridging the gap between existing methodologies and the requirements for clinical application. To do so, we analyze 100 peer-reviewed articles from 2021 to 2022. Within this corpus, datasets have a median size of 161.5 samples, with over one-third containing fewer than 100 samples, signaling a high potential for overfitting. Limited demographic data further raises concerns about generalizability and fairness, with 24% of studies omitting participants' country of residence, and attributes like race/ethnicity, education, and income rarely reported (11%, 2%, and 0%, respectively). Methodological issues are also common; for instance, for 86% of studies we could not confidently rule out test set omission and data leakage, suggesting a strong potential for inflated performance estimates across the literature. Reproducibility is a concern, with 78% of studies abstaining from sharing their ML code publicly. Based on this analysis, we provide guidance to avoid common pitfalls that can hinder model performance, generalizability, and trustworthiness. An interactive tutorial on applying ML to microbiomics data accompanies the discussion, to help establish and reinforce best practices within the community.
机器学习(ML)有望推动临床微生物组学的创新,如疾病诊断和预后。然而,要在这些领域成功实施机器学习,就必须开发出可重复、可解释的模型,以满足监管机构制定的严格性能标准。本研究旨在确定微生物组学中当前 ML 实践中需要改进的关键领域,重点是缩小现有方法与临床应用要求之间的差距。为此,我们分析了 2021 年至 2022 年的 100 篇同行评审文章。在这一语料库中,数据集的中位数为 161.5 个样本,其中超过三分之一的数据集包含的样本少于 100 个,这表明过度拟合的可能性很大。有限的人口统计学数据进一步引发了对普遍性和公平性的担忧,24% 的研究遗漏了参与者的居住国,种族/民族、教育程度和收入等属性也很少被报告(分别为 11%、2% 和 0%)。方法论问题也很常见;例如,在 86% 的研究中,我们无法有把握地排除测试集遗漏和数据泄漏的可能性,这表明在所有文献中都很有可能出现夸大绩效估计值的情况。可重复性也是一个令人担忧的问题,有 78% 的研究放弃公开共享其 ML 代码。基于这一分析,我们提供了避免常见陷阱的指南,这些陷阱可能会妨碍模型的性能、可推广性和可信度。在讨论的同时,我们还提供了将 ML 应用于微生物组学数据的互动教程,以帮助在社区内建立和加强最佳实践。
{"title":"Supervised machine learning for microbiomics: Bridging the gap between current and best practices","authors":"Natasha Katherine Dudek ,&nbsp;Mariami Chakhvadze ,&nbsp;Saba Kobakhidze ,&nbsp;Omar Kantidze ,&nbsp;Yuriy Gankin","doi":"10.1016/j.mlwa.2024.100607","DOIUrl":"10.1016/j.mlwa.2024.100607","url":null,"abstract":"<div><div>Machine learning (ML) is poised to drive innovations in clinical microbiomics, such as in disease diagnostics and prognostics. However, the successful implementation of ML in these domains necessitates the development of reproducible, interpretable models that meet the rigorous performance standards set by regulatory agencies. This study aims to identify key areas in need of improvement in current ML practices within microbiomics, with a focus on bridging the gap between existing methodologies and the requirements for clinical application. To do so, we analyze 100 peer-reviewed articles from 2021 to 2022. Within this corpus, datasets have a median size of 161.5 samples, with over one-third containing fewer than 100 samples, signaling a high potential for overfitting. Limited demographic data further raises concerns about generalizability and fairness, with 24% of studies omitting participants' country of residence, and attributes like race/ethnicity, education, and income rarely reported (11%, 2%, and 0%, respectively). Methodological issues are also common; for instance, for 86% of studies we could not confidently rule out test set omission and data leakage, suggesting a strong potential for inflated performance estimates across the literature. Reproducibility is a concern, with 78% of studies abstaining from sharing their ML code publicly. Based on this analysis, we provide guidance to avoid common pitfalls that can hinder model performance, generalizability, and trustworthiness. An interactive tutorial on applying ML to microbiomics data accompanies the discussion, to help establish and reinforce best practices within the community.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100607"},"PeriodicalIF":0.0,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans 玩文字游戏比较 ChatGPT 和人类的词汇量和词汇多样性
Pub Date : 2024-11-12 DOI: 10.1016/j.mlwa.2024.100602
Pedro Reviriego , Javier Conde , Elena Merino-Gómez , Gonzalo Martínez , José Alberto Hernández
The introduction of Artificial Intelligence (AI) generative language models such as GPT (Generative Pre-trained Transformer) and conversational tools such as ChatGPT has triggered a revolution that can transform how text is generated. This has many implications, for example, as AI-generated text becomes a significant fraction of the text, would this affect the language capabilities of readers and also the training of newer AI tools? Would it affect the evolution of languages? Focusing on one specific aspect of the language: words; will the use of tools such as ChatGPT increase or reduce the vocabulary used or the lexical diversity? This has implications for words, as those not included in AI-generated content will tend to be less and less popular and may eventually be lost. In this work, we perform an initial comparison of the vocabulary and lexical diversity of ChatGPT and humans when performing the same tasks. In more detail, two datasets containing the answers to different types of questions answered by ChatGPT and humans, and a third dataset in which ChatGPT paraphrases sentences and questions are used. The analysis shows that ChatGPT-3.5 tends to use fewer distinct words and lower diversity than humans while ChatGPT-4 has a similar lexical diversity as humans and in some cases even larger. These results are very preliminary and additional datasets and ChatGPT configurations have to be evaluated to extract more general conclusions. Therefore, further research is needed to understand how the use of ChatGPT and more broadly generative AI tools will affect the vocabulary and lexical diversity in different types of text and languages.
人工智能(AI)生成语言模型(如 GPT(预训练生成转换器))和会话工具(如 ChatGPT)的引入引发了一场革命,可以改变文本的生成方式。这将产生许多影响,例如,当人工智能生成的文本成为文本的重要组成部分时,这是否会影响读者的语言能力以及更新的人工智能工具的训练?它会影响语言的进化吗?聚焦语言的一个具体方面:词汇;使用 ChatGPT 等工具会增加还是减少词汇量或词汇多样性?这对词汇会产生影响,因为那些未包含在人工智能生成的内容中的词汇会越来越少,最终可能会消失。在这项工作中,我们初步比较了 ChatGPT 和人类在执行相同任务时的词汇量和词汇多样性。具体来说,我们使用了两个数据集,其中包含 ChatGPT 和人类回答的不同类型问题的答案,以及 ChatGPT 对句子和问题进行转述的第三个数据集。分析表明,与人类相比,ChatGPT-3.5 倾向于使用较少的独特词汇和较低的多样性,而 ChatGPT-4 的词汇多样性与人类相似,在某些情况下甚至更大。这些结果还只是初步的,还需要对更多的数据集和 ChatGPT 配置进行评估,才能得出更普遍的结论。因此,还需要进一步研究,以了解 ChatGPT 和更广泛的生成式人工智能工具的使用将如何影响不同类型文本和语言的词汇量和词汇多样性。
{"title":"Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans","authors":"Pedro Reviriego ,&nbsp;Javier Conde ,&nbsp;Elena Merino-Gómez ,&nbsp;Gonzalo Martínez ,&nbsp;José Alberto Hernández","doi":"10.1016/j.mlwa.2024.100602","DOIUrl":"10.1016/j.mlwa.2024.100602","url":null,"abstract":"<div><div>The introduction of Artificial Intelligence (AI) generative language models such as GPT (Generative Pre-trained Transformer) and conversational tools such as ChatGPT has triggered a revolution that can transform how text is generated. This has many implications, for example, as AI-generated text becomes a significant fraction of the text, would this affect the language capabilities of readers and also the training of newer AI tools? Would it affect the evolution of languages? Focusing on one specific aspect of the language: words; will the use of tools such as ChatGPT increase or reduce the vocabulary used or the lexical diversity? This has implications for words, as those not included in AI-generated content will tend to be less and less popular and may eventually be lost. In this work, we perform an initial comparison of the vocabulary and lexical diversity of ChatGPT and humans when performing the same tasks. In more detail, two datasets containing the answers to different types of questions answered by ChatGPT and humans, and a third dataset in which ChatGPT paraphrases sentences and questions are used. The analysis shows that ChatGPT-3.5 tends to use fewer distinct words and lower diversity than humans while ChatGPT-4 has a similar lexical diversity as humans and in some cases even larger. These results are very preliminary and additional datasets and ChatGPT configurations have to be evaluated to extract more general conclusions. Therefore, further research is needed to understand how the use of ChatGPT and more broadly generative AI tools will affect the vocabulary and lexical diversity in different types of text and languages.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100602"},"PeriodicalIF":0.0,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A survey on knowledge distillation: Recent advancements 知识提炼调查:最新进展
Pub Date : 2024-11-10 DOI: 10.1016/j.mlwa.2024.100605
Amir Moslemi , Anna Briskina , Zubeka Dang , Jason Li
Deep learning has achieved notable success across academia, medicine, and industry. Its ability to identify complex patterns in large-scale data and to manage millions of parameters has made it highly advantageous. However, deploying deep learning models presents a significant challenge due to their high computational demands. Knowledge distillation (KD) has emerged as a key technique for model compression and efficient knowledge transfer, enabling the deployment of deep learning models on resource-limited devices without compromising performance. This survey examines recent advancements in KD, highlighting key innovations in architectures, training paradigms, and application domains. We categorize contemporary KD methods into traditional approaches, such as response-based, feature-based, and relation-based knowledge distillation, and novel advanced paradigms, including self-distillation, cross-modal distillation, and adversarial distillation strategies. Additionally, we discuss emerging challenges, particularly in the context of distillation under limited data scenarios, privacy-preserving KD, and the interplay with other model compression techniques like quantization. Our survey also explores applications across computer vision, natural language processing, and multimodal tasks, where KD has driven performance improvements and enhanced model compression. This review aims to provide researchers and practitioners with a comprehensive understanding of the state-of-the-art in knowledge distillation, bridging foundational concepts with the latest methodologies and practical implications.
深度学习在学术界、医学界和工业界都取得了显著的成就。深度学习能够识别大规模数据中的复杂模式并管理数百万个参数,这使其具有极大的优势。然而,由于对计算的要求很高,部署深度学习模型是一项重大挑战。知识蒸馏(KD)已成为模型压缩和高效知识传输的一项关键技术,可在资源有限的设备上部署深度学习模型,同时不影响性能。本调查研究了知识蒸馏的最新进展,突出了架构、训练范式和应用领域的关键创新。我们将当代 KD 方法分为传统方法(如基于响应、基于特征和基于关系的知识蒸馏)和新型高级范式(包括自蒸馏、跨模态蒸馏和对抗性蒸馏策略)。此外,我们还讨论了新出现的挑战,特别是在有限数据场景下的蒸馏、保护隐私的 KD 以及与量化等其他模型压缩技术的相互作用等方面。我们的调查还探讨了计算机视觉、自然语言处理和多模态任务中的应用,在这些应用中,KD 推动了性能的提高并加强了模型压缩。本综述旨在让研究人员和从业人员全面了解知识蒸馏的最新进展,将基础概念与最新方法和实际影响联系起来。
{"title":"A survey on knowledge distillation: Recent advancements","authors":"Amir Moslemi ,&nbsp;Anna Briskina ,&nbsp;Zubeka Dang ,&nbsp;Jason Li","doi":"10.1016/j.mlwa.2024.100605","DOIUrl":"10.1016/j.mlwa.2024.100605","url":null,"abstract":"<div><div>Deep learning has achieved notable success across academia, medicine, and industry. Its ability to identify complex patterns in large-scale data and to manage millions of parameters has made it highly advantageous. However, deploying deep learning models presents a significant challenge due to their high computational demands. Knowledge distillation (KD) has emerged as a key technique for model compression and efficient knowledge transfer, enabling the deployment of deep learning models on resource-limited devices without compromising performance. This survey examines recent advancements in KD, highlighting key innovations in architectures, training paradigms, and application domains. We categorize contemporary KD methods into traditional approaches, such as response-based, feature-based, and relation-based knowledge distillation, and novel advanced paradigms, including self-distillation, cross-modal distillation, and adversarial distillation strategies. Additionally, we discuss emerging challenges, particularly in the context of distillation under limited data scenarios, privacy-preserving KD, and the interplay with other model compression techniques like quantization. Our survey also explores applications across computer vision, natural language processing, and multimodal tasks, where KD has driven performance improvements and enhanced model compression. This review aims to provide researchers and practitioners with a comprehensive understanding of the state-of-the-art in knowledge distillation, bridging foundational concepts with the latest methodologies and practical implications.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100605"},"PeriodicalIF":0.0,"publicationDate":"2024-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142698990","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Texas rural land market integration: A causal analysis using machine learning applications 得克萨斯州农村土地市场一体化:利用机器学习应用进行因果分析
Pub Date : 2024-11-08 DOI: 10.1016/j.mlwa.2024.100604
Tian Su , Senarath Dharmasena , David Leatham , Charles Gilliland
Texas rural land markets have several special features that makes it unique from other rural land markets in the United States. In 2021, Texas agricultural land, including buildings, is valued at $299.88 billion, which is almost 10% of the nation's total agricultural real estate value and 83% of the state's land is categorized as rural. In addition, due to its size and geologic features, Texas’ diverse landscape creates complex and widely divergent conditions affecting ownership and marketing of the land. Despite this complexity, lack of granular level and reliable transactional data on land sales has prevented thorough investigation into Texas land markets to uncover various interdependencies. Using quarterly transactional land value data from 1966 to 2017, this study uses cutting-edge machine learning algorithms and probabilistic graphical models to uncover causal interaction patterns of different land markets in Texas. The results reveal that Texas rural land markets are interdependent. Current and potential landholders and lenders can use the results from this work to aid strategic decision making. Financial institutions and investment groups could be made aware of the trend of one land market relative to other markets and adjust their holdings accordingly. Landowners may better understand changes in net wealth, which affect their ability to borrow capital and operate efficiently. Moreover, lenders may also benefit from the information to manage collateral and thus maintain the stability of their operation.
得克萨斯州的农村土地市场有几个特点,使其有别于美国其他农村土地市场。2021 年,得克萨斯州农业用地(包括建筑物)的价值为 2,998.8 亿美元,几乎占全国农业房地产总价值的 10%,而该州 83% 的土地被归类为农村土地。此外,得克萨斯州幅员辽阔,地质特征各异,地形复杂多样,影响了土地的所有权和销售。尽管情况复杂,但由于缺乏细粒度的、可靠的土地销售交易数据,因此无法对得克萨斯州的土地市场进行深入调查,以揭示各种相互依存关系。本研究利用 1966 年至 2017 年的季度交易地价数据,采用最先进的机器学习算法和概率图形模型,揭示了德克萨斯州不同土地市场的因果互动模式。研究结果表明,得克萨斯州农村土地市场是相互依存的。当前和潜在的土地所有者及贷款人可以利用这项研究的结果来帮助做出战略决策。金融机构和投资集团可以了解一个土地市场相对于其他市场的趋势,并据此调整其持有的土地。土地所有者可以更好地了解净财富的变化,这将影响他们借贷资本和有效经营的能力。此外,贷款人也可从信息中获益,以管理抵押品,从而保持其经营的稳定性。
{"title":"Texas rural land market integration: A causal analysis using machine learning applications","authors":"Tian Su ,&nbsp;Senarath Dharmasena ,&nbsp;David Leatham ,&nbsp;Charles Gilliland","doi":"10.1016/j.mlwa.2024.100604","DOIUrl":"10.1016/j.mlwa.2024.100604","url":null,"abstract":"<div><div>Texas rural land markets have several special features that makes it unique from other rural land markets in the United States. In 2021, Texas agricultural land, including buildings, is valued at $299.88 billion, which is almost 10% of the nation's total agricultural real estate value and 83% of the state's land is categorized as rural. In addition, due to its size and geologic features, Texas’ diverse landscape creates complex and widely divergent conditions affecting ownership and marketing of the land. Despite this complexity, lack of granular level and reliable transactional data on land sales has prevented thorough investigation into Texas land markets to uncover various interdependencies. Using quarterly transactional land value data from 1966 to 2017, this study uses cutting-edge machine learning algorithms and probabilistic graphical models to uncover causal interaction patterns of different land markets in Texas. The results reveal that Texas rural land markets are interdependent. Current and potential landholders and lenders can use the results from this work to aid strategic decision making. Financial institutions and investment groups could be made aware of the trend of one land market relative to other markets and adjust their holdings accordingly. Landowners may better understand changes in net wealth, which affect their ability to borrow capital and operate efficiently. Moreover, lenders may also benefit from the information to manage collateral and thus maintain the stability of their operation.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100604"},"PeriodicalIF":0.0,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Applications of cluster-based transfer learning in image and localization tasks 基于聚类的迁移学习在图像和定位任务中的应用
Pub Date : 2024-11-07 DOI: 10.1016/j.mlwa.2024.100601
Liuyi Yang, Patrick Finnerty, Chikara Ohta
Transfer learning can address the issue of insufficient labels in machine learning. Using knowledge in a labeled domain (source domain) can assist in acquiring and learning knowledge in a domain (target domain) that lacks some or all labels. In this paper, we propose a new cluster-based semi-supervised transfer learning (CBSSTL) under a new assumption that samples in the target domain are unlabeled but contain cluster information. Furthermore, we propose a new transfer learning framework and a method for fine-tuning parameters. We tested and compared the proposed method with other unsupervised and semi-supervised transfer learning methods on well-known image datasets. The experimental results demonstrate the effectiveness of the proposed method. Additionally, we created a localization dataset for transfer learning. Finally, we tested and analyzed the proposed method on this dataset. Its particularly challenging nature makes it difficult for our method to work effectively.
迁移学习可以解决机器学习中标签不足的问题。利用有标签领域(源领域)的知识可以帮助获取和学习缺乏部分或全部标签的领域(目标领域)的知识。在本文中,我们提出了一种新的基于聚类的半监督迁移学习(CBSSTL),其新假设是:目标域中的样本没有标签,但包含聚类信息。此外,我们还提出了一种新的迁移学习框架和参数微调方法。我们在著名的图像数据集上对所提出的方法与其他无监督和半监督迁移学习方法进行了测试和比较。实验结果证明了所提方法的有效性。此外,我们还创建了一个用于迁移学习的定位数据集。最后,我们在该数据集上测试并分析了所提出的方法。该数据集特别具有挑战性,这使得我们的方法难以有效发挥作用。
{"title":"Applications of cluster-based transfer learning in image and localization tasks","authors":"Liuyi Yang,&nbsp;Patrick Finnerty,&nbsp;Chikara Ohta","doi":"10.1016/j.mlwa.2024.100601","DOIUrl":"10.1016/j.mlwa.2024.100601","url":null,"abstract":"<div><div>Transfer learning can address the issue of insufficient labels in machine learning. Using knowledge in a labeled domain (source domain) can assist in acquiring and learning knowledge in a domain (target domain) that lacks some or all labels. In this paper, we propose a new cluster-based semi-supervised transfer learning (CBSSTL) under a new assumption that samples in the target domain are unlabeled but contain cluster information. Furthermore, we propose a new transfer learning framework and a method for fine-tuning parameters. We tested and compared the proposed method with other unsupervised and semi-supervised transfer learning methods on well-known image datasets. The experimental results demonstrate the effectiveness of the proposed method. Additionally, we created a localization dataset for transfer learning. Finally, we tested and analyzed the proposed method on this dataset. Its particularly challenging nature makes it difficult for our method to work effectively.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100601"},"PeriodicalIF":0.0,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vulnerability detection using BERT based LLM model with transparency obligation practice towards trustworthy AI 使用基于 BERT 的 LLM 模型进行漏洞检测,履行透明义务,实现可信赖的人工智能
Pub Date : 2024-11-02 DOI: 10.1016/j.mlwa.2024.100598
Jean Haurogné , Nihala Basheer , Shareeful Islam
Vulnerabilities in the source code are one of the main causes of potential threats in software-intensive systems. There are a large number of vulnerabilities published each day, and effective vulnerability detection is critical to identifying and mitigating these vulnerabilities. AI has emerged as a promising solution to enhance vulnerability detection, offering the ability to analyse vast amounts of data and identify patterns indicative of potential threats. However, AI-based methods often face several challenges, specifically when dealing with large datasets and understanding the specific context of the problem. Large Language Model (LLM) is now widely considered to tackle more complex tasks and handle large datasets, which also exhibits limitations in terms of explaining the model outcome and existing works focus on providing overview of explainability and transparency. This research introduces a novel transparency obligation practice for vulnerability detection using BERT based LLMs. We address the black-box nature of LLMs by employing XAI techniques, unique combination of SHAP, LIME, heat map. We propose an architecture that combines the BERT model with transparency obligation practices, which ensures the assurance of transparency throughout the entire LLM life cycle. An experiment is performed with a large source code dataset to demonstrate the applicability of the proposed approach. The result shows higher accuracy of 91.8 % for the vulnerability detection and model explainability outcome is highly influenced by “vulnerable”, “function”, "mysql_tmpdir_list", “strmov” tokens using both SHAP and LIME framework. Heatmap of attention weights, highlights the local token interactions that aid in understanding the model's decision points.
源代码中的漏洞是造成软件密集型系统潜在威胁的主要原因之一。每天都有大量漏洞发布,有效的漏洞检测对于识别和缓解这些漏洞至关重要。人工智能已成为加强漏洞检测的一种前景广阔的解决方案,它能够分析海量数据并识别表明潜在威胁的模式。然而,基于人工智能的方法往往面临一些挑战,特别是在处理大型数据集和了解问题的具体背景时。目前,人们普遍认为大型语言模型(LLM)可以解决更复杂的任务和处理大型数据集,但它在解释模型结果方面也表现出局限性,现有的工作主要集中在提供可解释性和透明度方面的概述。本研究介绍了使用基于 BERT 的 LLM 进行漏洞检测的新型透明度义务实践。我们采用 XAI 技术、SHAP、LIME 和热图的独特组合,解决了 LLM 的黑箱性质。我们提出了一种将 BERT 模型与透明度义务实践相结合的架构,可确保整个 LLM 生命周期的透明度。我们使用大型源代码数据集进行了实验,以证明所提方法的适用性。结果表明,使用 SHAP 和 LIME 框架,漏洞检测的准确率高达 91.8%,模型的可解释性结果受 "vulnerable"、"function"、"mysql_tmpdir_list "和 "strmov "令牌的影响很大。注意力权重热图,突出显示了有助于理解模型决策点的局部标记相互作用。
{"title":"Vulnerability detection using BERT based LLM model with transparency obligation practice towards trustworthy AI","authors":"Jean Haurogné ,&nbsp;Nihala Basheer ,&nbsp;Shareeful Islam","doi":"10.1016/j.mlwa.2024.100598","DOIUrl":"10.1016/j.mlwa.2024.100598","url":null,"abstract":"<div><div>Vulnerabilities in the source code are one of the main causes of potential threats in software-intensive systems. There are a large number of vulnerabilities published each day, and effective vulnerability detection is critical to identifying and mitigating these vulnerabilities. AI has emerged as a promising solution to enhance vulnerability detection, offering the ability to analyse vast amounts of data and identify patterns indicative of potential threats. However, AI-based methods often face several challenges, specifically when dealing with large datasets and understanding the specific context of the problem. Large Language Model (LLM) is now widely considered to tackle more complex tasks and handle large datasets, which also exhibits limitations in terms of explaining the model outcome and existing works focus on providing overview of explainability and transparency. This research introduces a novel transparency obligation practice for vulnerability detection using BERT based LLMs. We address the black-box nature of LLMs by employing XAI techniques, unique combination of SHAP, LIME, heat map. We propose an architecture that combines the BERT model with transparency obligation practices, which ensures the assurance of transparency throughout the entire LLM life cycle. An experiment is performed with a large source code dataset to demonstrate the applicability of the proposed approach. The result shows higher accuracy of 91.8 % for the vulnerability detection and model explainability outcome is highly influenced by “vulnerable”, “function”, \"mysql_tmpdir_list\", “strmov” tokens using both SHAP and LIME framework. Heatmap of attention weights, highlights the local token interactions that aid in understanding the model's decision points.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100598"},"PeriodicalIF":0.0,"publicationDate":"2024-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650982","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Moral decision making: Explainable insights into the role of working memory in autonomous driving 道德决策:关于工作记忆在自动驾驶中作用的可解释见解
Pub Date : 2024-10-31 DOI: 10.1016/j.mlwa.2024.100599
Amandeep Singh, Yovela Murzello, Hyowon Lee, Shene Abdalla, Siby Samuel
The intersection of Artificial Intelligence (AI) and moral philosophy presents unique challenges in the development of autonomous vehicles, particularly in scenarios requiring split-second ethical decisions. This study examines the relationship between working memory (WM) and moral judgments in simulated AV scenarios, quantifying the effects of varying cognitive load on utilitarian decision-making under different time constraints. We experimented with 336 participants, each completing 16 simulated driving trials presenting unique ethical dilemmas. Results reveal a complex interplay between cognitive load and ethical choices. Under high temporal pressure (1-second response window), utilitarian decisions decreased significantly from 92.77 % to 70.08 %. Extended time constraints led to increased utilitarian choices. Statistical analyses validated these findings across diverse ethical contexts. Chi-square tests revealed significant associations between WM load and utilitarian decisions in 1-second conditions, particularly for high-stakes scenarios. Logistic regression showed that WM significantly decreased the likelihood of utilitarian decisions in these scenarios. Six supervised machine learning models were employed, with Gaussian Naive Bayes achieving the highest predictive accuracy (82.2 % to 97.0 %) in distinguishing utilitarian decisions. Partial Dependence analysis revealed a strong negative correlation between WM and utilitarian decisions, especially in the 1-second interval. The 2-second interval emerged as potentially optimal for balancing time constraints and cognitive load. These findings contribute to the theoretical understanding of ethical decision-making under cognitive load and provide practical insights for developing ethically aligned autonomous systems, with implications for improving safety, optimizing takeover protocols, and enhancing the ethical reasoning capabilities of autonomous driving systems.
人工智能(AI)与道德哲学的交叉为自动驾驶汽车的开发带来了独特的挑战,尤其是在需要瞬间做出道德决定的场景中。本研究探讨了在模拟自动驾驶汽车场景中工作记忆(WM)与道德判断之间的关系,量化了在不同时间限制下不同认知负荷对功利决策的影响。我们对 336 名参与者进行了实验,每个人都完成了 16 次模拟驾驶试验,这些试验呈现了独特的道德困境。结果显示,认知负荷与道德选择之间存在复杂的相互作用。在高时间压力下(1 秒钟反应窗口),功利性决策从 92.77% 显著下降到 70.08%。时间限制延长则导致功利性选择增加。统计分析在不同的伦理环境中验证了这些发现。卡方检验显示,在 1 秒钟的条件下,WM 负荷与功利性决策之间存在显著关联,尤其是在高风险情景下。逻辑回归表明,在这些情景中,WM 会显著降低做出功利性决策的可能性。研究人员采用了六种有监督的机器学习模型,其中高斯直觉贝叶斯模型在区分功利性决策方面的预测准确率最高(82.2% 到 97.0%)。偏倚分析表明,WM 和功利性决策之间存在很强的负相关,尤其是在 1 秒的时间间隔内。2秒钟的时间间隔可能是平衡时间限制和认知负荷的最佳时间间隔。这些发现有助于从理论上理解认知负荷下的道德决策,并为开发符合道德规范的自动驾驶系统提供了实用见解,对提高安全性、优化接管协议和增强自动驾驶系统的道德推理能力具有重要意义。
{"title":"Moral decision making: Explainable insights into the role of working memory in autonomous driving","authors":"Amandeep Singh,&nbsp;Yovela Murzello,&nbsp;Hyowon Lee,&nbsp;Shene Abdalla,&nbsp;Siby Samuel","doi":"10.1016/j.mlwa.2024.100599","DOIUrl":"10.1016/j.mlwa.2024.100599","url":null,"abstract":"<div><div>The intersection of Artificial Intelligence (AI) and moral philosophy presents unique challenges in the development of autonomous vehicles, particularly in scenarios requiring split-second ethical decisions. This study examines the relationship between working memory (WM) and moral judgments in simulated AV scenarios, quantifying the effects of varying cognitive load on utilitarian decision-making under different time constraints. We experimented with 336 participants, each completing 16 simulated driving trials presenting unique ethical dilemmas. Results reveal a complex interplay between cognitive load and ethical choices. Under high temporal pressure (1-second response window), utilitarian decisions decreased significantly from 92.77 % to 70.08 %. Extended time constraints led to increased utilitarian choices. Statistical analyses validated these findings across diverse ethical contexts. Chi-square tests revealed significant associations between WM load and utilitarian decisions in 1-second conditions, particularly for high-stakes scenarios. Logistic regression showed that WM significantly decreased the likelihood of utilitarian decisions in these scenarios. Six supervised machine learning models were employed, with Gaussian Naive Bayes achieving the highest predictive accuracy (82.2 % to 97.0 %) in distinguishing utilitarian decisions. Partial Dependence analysis revealed a strong negative correlation between WM and utilitarian decisions, especially in the 1-second interval. The 2-second interval emerged as potentially optimal for balancing time constraints and cognitive load. These findings contribute to the theoretical understanding of ethical decision-making under cognitive load and provide practical insights for developing ethically aligned autonomous systems, with implications for improving safety, optimizing takeover protocols, and enhancing the ethical reasoning capabilities of autonomous driving systems.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100599"},"PeriodicalIF":0.0,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142650983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning applications for real-time and early detection of fall armyworm, African armyworm, and maize stem borer 深度学习在实时和早期检测秋虫、非洲象鼻虫和玉米螟中的应用
Pub Date : 2024-10-30 DOI: 10.1016/j.mlwa.2024.100596
Ivan Oyege , Harriet Sibitenda , Maruthi Sridhar Balaji Bhaskar
The application of artificial intelligence for identifying Fall armyworm (Spodoptera frugiperda), African armyworm (Spodoptera exempta), and Maize stem borer (Busseola fusca) is critical due to the threats they pose to global food production. This study aims to evaluate and identify the most accurate and robust DL models in detecting and classifying these three significant agricultural pests. Seven traditional DL models: Convolutional Neural Network, Visual Geometry Group (VGG16), Residual Networks (ResNet50), MobileNetV2, InceptionV3, Deep Neural Network (DNN), and InceptionResNetV2 and the advanced You Look Only Once (YOLOv8) model were trained and tested using pest image datasets. The results showed that all traditional models except DNN had high accuracies ranging from 93.17% (InceptionResNetV2) to 99.43% (MobileNet) in training and testing, with losses ranging from 1.71% (MobileNetV2) to 24.99% (InceptionResNetV2). DNN had a slightly lower accuracy range of 55.27% to 56.39% and a loss range of 85.02% to 89.96% in training and testing. YOLOv8 emerged as the best and most robust model in the pest detection and classification tasks, achieving Precision and Recall scores ranging from 98.4% to 100% on single-class and multi-class classifications, making it highly suitable for real-world pest management applications. This research pioneers the use of DL for the classification and detection of maize stem borer, African armyworm and Fall armyworm, unique and separately addressing a critical gap in agricultural pest management in corn. With early and accurate pest identification, crop protection measures can be implemented efficiently. The findings lead to reduced crop damage and enhanced food security.
人工智能在识别秋季军虫(Spodoptera frugiperda)、非洲军虫(Spodoptera exempta)和玉米螟(Busseola fusca)方面的应用至关重要,因为它们对全球粮食生产构成威胁。本研究旨在评估和确定在检测和分类这三种重要农业害虫方面最准确、最稳健的 DL 模型。七个传统的 DL 模型:使用害虫图像数据集对卷积神经网络、视觉几何组 (VGG16)、残差网络 (ResNet50)、MobileNetV2、InceptionV3、深度神经网络 (DNN) 和 InceptionResNetV2 以及先进的 You Look Only Once (YOLOv8) 模型进行了训练和测试。结果显示,除 DNN 外,所有传统模型在训练和测试中的准确率都很高,从 93.17%(InceptionResNetV2)到 99.43%(MobileNet)不等,损失率从 1.71%(MobileNetV2)到 24.99%(InceptionResNetV2)不等。DNN 的准确率略低,为 55.27% 至 56.39%,在训练和测试中的损失率为 85.02% 至 89.96%。在害虫检测和分类任务中,YOLOv8 成为最佳和最稳健的模型,在单类和多类分类中取得了 98.4% 到 100% 的精确度和召回率,非常适合实际害虫管理应用。这项研究开创性地将 DL 用于玉米螟、非洲大食心虫和秋季大食心虫的分类和检测,独特地分别解决了玉米农业害虫管理中的一个关键缺口。有了早期准确的害虫识别,就可以高效地实施作物保护措施。这些发现可减少作物损失,提高粮食安全。
{"title":"Deep learning applications for real-time and early detection of fall armyworm, African armyworm, and maize stem borer","authors":"Ivan Oyege ,&nbsp;Harriet Sibitenda ,&nbsp;Maruthi Sridhar Balaji Bhaskar","doi":"10.1016/j.mlwa.2024.100596","DOIUrl":"10.1016/j.mlwa.2024.100596","url":null,"abstract":"<div><div>The application of artificial intelligence for identifying Fall armyworm (<em>Spodoptera frugiperda</em>), African armyworm (<em>Spodoptera exempta</em>), and Maize stem borer (<em>Busseola fusca</em>) is critical due to the threats they pose to global food production. This study aims to evaluate and identify the most accurate and robust DL models in detecting and classifying these three significant agricultural pests. Seven traditional DL models: Convolutional Neural Network, Visual Geometry Group (VGG16), Residual Networks (ResNet50), MobileNetV2, InceptionV3, Deep Neural Network (DNN), and InceptionResNetV2 and the advanced You Look Only Once (YOLOv8) model were trained and tested using pest image datasets. The results showed that all traditional models except DNN had high accuracies ranging from 93.17% (InceptionResNetV2) to 99.43% (MobileNet) in training and testing, with losses ranging from 1.71% (MobileNetV2) to 24.99% (InceptionResNetV2). DNN had a slightly lower accuracy range of 55.27% to 56.39% and a loss range of 85.02% to 89.96% in training and testing. YOLOv8 emerged as the best and most robust model in the pest detection and classification tasks, achieving Precision and Recall scores ranging from 98.4% to 100% on single-class and multi-class classifications, making it highly suitable for real-world pest management applications. This research pioneers the use of DL for the classification and detection of maize stem borer, African armyworm and Fall armyworm, unique and separately addressing a critical gap in agricultural pest management in corn. With early and accurate pest identification, crop protection measures can be implemented efficiently. The findings lead to reduced crop damage and enhanced food security.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100596"},"PeriodicalIF":0.0,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142572647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing SMOTE for imbalanced data with abnormal minority instances 针对异常少数实例的不平衡数据增强 SMOTE
Pub Date : 2024-10-29 DOI: 10.1016/j.mlwa.2024.100597
Surani Matharaarachchi , Mike Domaratzki , Saman Muthukumarana
Imbalanced datasets are frequent in machine learning, where certain classes are markedly underrepresented compared to others. This imbalance often results in sub-optimal model performance, as classifiers tend to favour the majority class. A significant challenge arises when abnormal instances, such as outliers, exist within the minority class, diminishing the effectiveness of traditional re-sampling methods like the Synthetic Minority Over-sampling Technique (SMOTE). This manuscript addresses this critical issue by introducing four SMOTE extensions: Distance ExtSMOTE, Dirichlet ExtSMOTE, FCRP SMOTE, and BGMM SMOTE. These methods leverage a weighted average of neighbouring instances to enhance the quality of synthetic samples and mitigate the impact of outliers. Comprehensive experiments conducted on diverse simulated and real-world imbalanced datasets demonstrate that the proposed methods improve classification performance compared to the original SMOTE and its most competitive variants. Notably, we demonstrate that Dirichlet ExtSMOTE outperforms most other proposed and existing SMOTE variants in terms of achieving better F1 score, MCC, and PR-AUC. Our results underscore the effectiveness of these advanced SMOTE extensions in tackling class imbalance, particularly in the presence of abnormal instances, offering robust solutions for real-world applications.
不平衡数据集在机器学习中经常出现,与其他类别相比,某些类别的代表性明显不足。这种不平衡往往会导致模型性能达不到最优,因为分类器往往倾向于大多数类别。当少数类中存在异常实例(如离群值)时,就会出现一个重大挑战,从而降低传统再采样方法(如合成少数群体过度采样技术(SMOTE))的有效性。本手稿通过引入四种 SMOTE 扩展来解决这一关键问题:距离 ExtSMOTE、Dirichlet ExtSMOTE、FCRP SMOTE 和 BGMM SMOTE。这些方法利用相邻实例的加权平均来提高合成样本的质量,并减轻异常值的影响。在各种模拟和真实世界不平衡数据集上进行的综合实验表明,与原始 SMOTE 及其最具竞争力的变体相比,所提出的方法提高了分类性能。值得注意的是,我们证明了 Dirichlet ExtSMOTE 在获得更好的 F1 分数、MCC 和 PR-AUC 方面优于大多数其他提出的和现有的 SMOTE 变体。我们的研究结果凸显了这些先进的 SMOTE 扩展在解决类不平衡方面的有效性,尤其是在存在异常实例的情况下,为现实世界的应用提供了稳健的解决方案。
{"title":"Enhancing SMOTE for imbalanced data with abnormal minority instances","authors":"Surani Matharaarachchi ,&nbsp;Mike Domaratzki ,&nbsp;Saman Muthukumarana","doi":"10.1016/j.mlwa.2024.100597","DOIUrl":"10.1016/j.mlwa.2024.100597","url":null,"abstract":"<div><div>Imbalanced datasets are frequent in machine learning, where certain classes are markedly underrepresented compared to others. This imbalance often results in sub-optimal model performance, as classifiers tend to favour the majority class. A significant challenge arises when abnormal instances, such as outliers, exist within the minority class, diminishing the effectiveness of traditional re-sampling methods like the Synthetic Minority Over-sampling Technique (SMOTE). This manuscript addresses this critical issue by introducing four SMOTE extensions: Distance ExtSMOTE, Dirichlet ExtSMOTE, FCRP SMOTE, and BGMM SMOTE. These methods leverage a weighted average of neighbouring instances to enhance the quality of synthetic samples and mitigate the impact of outliers. Comprehensive experiments conducted on diverse simulated and real-world imbalanced datasets demonstrate that the proposed methods improve classification performance compared to the original SMOTE and its most competitive variants. Notably, we demonstrate that Dirichlet ExtSMOTE outperforms most other proposed and existing SMOTE variants in terms of achieving better F1 score, MCC, and PR-AUC. Our results underscore the effectiveness of these advanced SMOTE extensions in tackling class imbalance, particularly in the presence of abnormal instances, offering robust solutions for real-world applications.</div></div>","PeriodicalId":74093,"journal":{"name":"Machine learning with applications","volume":"18 ","pages":"Article 100597"},"PeriodicalIF":0.0,"publicationDate":"2024-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142578485","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Machine learning with applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1