{"title":"CellAgent: An LLM-driven Multi-Agent Framework for Automated Single-cell Data Analysis","authors":"Yihang Xiao, Jinyi Liu, Yan Zheng, Xiaohan Xie, Jianye Hao, Mingzhi Li, Ruitao Wang, Fei Ni, Yuxiao Li, Jintian Luo, Shaoqing Jiao, Jiajie Peng","doi":"arxiv-2407.09811","DOIUrl":null,"url":null,"abstract":"Single-cell RNA sequencing (scRNA-seq) data analysis is crucial for\nbiological research, as it enables the precise characterization of cellular\nheterogeneity. However, manual manipulation of various tools to achieve desired\noutcomes can be labor-intensive for researchers. To address this, we introduce\nCellAgent (http://cell.agent4science.cn/), an LLM-driven multi-agent framework,\nspecifically designed for the automatic processing and execution of scRNA-seq\ndata analysis tasks, providing high-quality results with no human intervention.\nFirstly, to adapt general LLMs to the biological field, CellAgent constructs\nLLM-driven biological expert roles - planner, executor, and evaluator - each\nwith specific responsibilities. Then, CellAgent introduces a hierarchical\ndecision-making mechanism to coordinate these biological experts, effectively\ndriving the planning and step-by-step execution of complex data analysis tasks.\nFurthermore, we propose a self-iterative optimization mechanism, enabling\nCellAgent to autonomously evaluate and optimize solutions, thereby guaranteeing\noutput quality. We evaluate CellAgent on a comprehensive benchmark dataset\nencompassing dozens of tissues and hundreds of distinct cell types. Evaluation\nresults consistently show that CellAgent effectively identifies the most\nsuitable tools and hyperparameters for single-cell analysis tasks, achieving\noptimal performance. This automated framework dramatically reduces the workload\nfor science data analyses, bringing us into the \"Agent for Science\" era.","PeriodicalId":501070,"journal":{"name":"arXiv - QuanBio - Genomics","volume":"106 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.09811","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Single-cell RNA sequencing (scRNA-seq) data analysis is crucial for
biological research, as it enables the precise characterization of cellular
heterogeneity. However, manual manipulation of various tools to achieve desired
outcomes can be labor-intensive for researchers. To address this, we introduce
CellAgent (http://cell.agent4science.cn/), an LLM-driven multi-agent framework,
specifically designed for the automatic processing and execution of scRNA-seq
data analysis tasks, providing high-quality results with no human intervention.
Firstly, to adapt general LLMs to the biological field, CellAgent constructs
LLM-driven biological expert roles - planner, executor, and evaluator - each
with specific responsibilities. Then, CellAgent introduces a hierarchical
decision-making mechanism to coordinate these biological experts, effectively
driving the planning and step-by-step execution of complex data analysis tasks.
Furthermore, we propose a self-iterative optimization mechanism, enabling
CellAgent to autonomously evaluate and optimize solutions, thereby guaranteeing
output quality. We evaluate CellAgent on a comprehensive benchmark dataset
encompassing dozens of tissues and hundreds of distinct cell types. Evaluation
results consistently show that CellAgent effectively identifies the most
suitable tools and hyperparameters for single-cell analysis tasks, achieving
optimal performance. This automated framework dramatically reduces the workload
for science data analyses, bringing us into the "Agent for Science" era.