{"title":"COEFF-KANs: A Paradigm to Address the Electrolyte Field with KANs","authors":"Xinhe Li, Zhuoying Feng, Yezeng Chen, Weichen Dai, Zixu He, Yi Zhou, Shuhong Jiao","doi":"arxiv-2407.20265","DOIUrl":null,"url":null,"abstract":"To reduce the experimental validation workload for chemical researchers and\naccelerate the design and optimization of high-energy-density lithium metal\nbatteries, we aim to leverage models to automatically predict Coulombic\nEfficiency (CE) based on the composition of liquid electrolytes. There are\nmainly two representative paradigms in existing methods: machine learning and\ndeep learning. However, the former requires intelligent input feature selection\nand reliable computational methods, leading to error propagation from feature\nestimation to model prediction, while the latter (e.g. MultiModal-MoLFormer)\nfaces challenges of poor predictive performance and overfitting due to limited\ndiversity in augmented data. To tackle these issues, we propose a novel method\nCOEFF (COlumbic EFficiency prediction via Fine-tuned models), which consists of\ntwo stages: pre-training a chemical general model and fine-tuning on downstream\ndomain data. Firstly, we adopt the publicly available MoLFormer model to obtain\nfeature vectors for each solvent and salt in the electrolyte. Then, we perform\na weighted average of embeddings for each token across all molecules, with\nweights determined by the respective electrolyte component ratios. Finally, we\ninput the obtained electrolyte features into a Multi-layer Perceptron or\nKolmogorov-Arnold Network to predict CE. Experimental results on a real-world\ndataset demonstrate that our method achieves SOTA for predicting CE compared to\nall baselines. Data and code used in this work will be made publicly available\nafter the paper is published.","PeriodicalId":501309,"journal":{"name":"arXiv - CS - Computational Engineering, Finance, and Science","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computational Engineering, Finance, and Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.20265","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
To reduce the experimental validation workload for chemical researchers and
accelerate the design and optimization of high-energy-density lithium metal
batteries, we aim to leverage models to automatically predict Coulombic
Efficiency (CE) based on the composition of liquid electrolytes. There are
mainly two representative paradigms in existing methods: machine learning and
deep learning. However, the former requires intelligent input feature selection
and reliable computational methods, leading to error propagation from feature
estimation to model prediction, while the latter (e.g. MultiModal-MoLFormer)
faces challenges of poor predictive performance and overfitting due to limited
diversity in augmented data. To tackle these issues, we propose a novel method
COEFF (COlumbic EFficiency prediction via Fine-tuned models), which consists of
two stages: pre-training a chemical general model and fine-tuning on downstream
domain data. Firstly, we adopt the publicly available MoLFormer model to obtain
feature vectors for each solvent and salt in the electrolyte. Then, we perform
a weighted average of embeddings for each token across all molecules, with
weights determined by the respective electrolyte component ratios. Finally, we
input the obtained electrolyte features into a Multi-layer Perceptron or
Kolmogorov-Arnold Network to predict CE. Experimental results on a real-world
dataset demonstrate that our method achieves SOTA for predicting CE compared to
all baselines. Data and code used in this work will be made publicly available
after the paper is published.