Tree-Based Learning on Amperometric Time Series Data Demonstrates High Accuracy for Classification

Jeyashree Krishnan, Zeyu Lian, Pieter E. Oomen, Xiulan He, Soodabeh Majdi, Andreas Schuppert, Andrew Ewing
{"title":"Tree-Based Learning on Amperometric Time Series Data Demonstrates High Accuracy for Classification","authors":"Jeyashree Krishnan, Zeyu Lian, Pieter E. Oomen, Xiulan He, Soodabeh Majdi, Andreas Schuppert, Andrew Ewing","doi":"arxiv-2302.02650","DOIUrl":null,"url":null,"abstract":"Elucidating exocytosis processes provide insights into cellular\nneurotransmission mechanisms, and may have potential in neurodegenerative\ndiseases research. Amperometry is an established electrochemical method for the\ndetection of neurotransmitters released from and stored inside cells. An\nimportant aspect of the amperometry method is the sub-millisecond temporal\nresolution of the current recordings which leads to several hundreds of\ngigabytes of high-quality data. In this study, we present a universal method\nfor the classification with respect to diverse amperometric datasets using\ndata-driven approaches in computational science. We demonstrate a very high\nprediction accuracy (greater than or equal to 95%). This includes an end-to-end\nsystematic machine learning workflow for amperometric time series datasets\nconsisting of pre-processing; feature extraction; model identification;\ntraining and testing; followed by feature importance evaluation - all\nimplemented. We tested the method on heterogeneous amperometric time series\ndatasets generated using different experimental approaches, chemical\nstimulations, electrode types, and varying recording times. We identified a\ncertain overarching set of common features across these datasets which enables\naccurate predictions. Further, we showed that information relevant for the\nclassification of amperometric traces are neither in the spiky segments alone,\nnor can it be retrieved from just the temporal structure of spikes. In fact,\nthe transients between spikes and the trace baselines carry essential\ninformation for a successful classification, thereby strongly demonstrating\nthat an effective feature representation of amperometric time series requires\nthe full time series. To our knowledge, this is one of the first studies that\npropose a scheme for machine learning, and in particular, supervised learning\non full amperometry time series data.","PeriodicalId":501170,"journal":{"name":"arXiv - QuanBio - Subcellular Processes","volume":"82 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Subcellular Processes","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2302.02650","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Elucidating exocytosis processes provide insights into cellular neurotransmission mechanisms, and may have potential in neurodegenerative diseases research. Amperometry is an established electrochemical method for the detection of neurotransmitters released from and stored inside cells. An important aspect of the amperometry method is the sub-millisecond temporal resolution of the current recordings which leads to several hundreds of gigabytes of high-quality data. In this study, we present a universal method for the classification with respect to diverse amperometric datasets using data-driven approaches in computational science. We demonstrate a very high prediction accuracy (greater than or equal to 95%). This includes an end-to-end systematic machine learning workflow for amperometric time series datasets consisting of pre-processing; feature extraction; model identification; training and testing; followed by feature importance evaluation - all implemented. We tested the method on heterogeneous amperometric time series datasets generated using different experimental approaches, chemical stimulations, electrode types, and varying recording times. We identified a certain overarching set of common features across these datasets which enables accurate predictions. Further, we showed that information relevant for the classification of amperometric traces are neither in the spiky segments alone, nor can it be retrieved from just the temporal structure of spikes. In fact, the transients between spikes and the trace baselines carry essential information for a successful classification, thereby strongly demonstrating that an effective feature representation of amperometric time series requires the full time series. To our knowledge, this is one of the first studies that propose a scheme for machine learning, and in particular, supervised learning on full amperometry time series data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于树形学习的安培时间序列数据分类精度高
阐明胞吐过程提供了对细胞神经传递机制的见解,并可能在神经退行性疾病的研究中具有潜力。安培法是一种建立的电化学方法,用于检测从细胞释放和储存的神经递质。安培法的一个重要方面是当前记录的亚毫秒时间分辨率,这导致数百千兆字节的高质量数据。在这项研究中,我们提出了一种通用的方法来分类相对于不同的安培数据集使用数据驱动的方法在计算科学。我们证明了非常高的预测精度(大于或等于95%)。这包括一个端到端系统的机器学习工作流,用于安培时间序列数据集,包括预处理;特征提取;模型识别、培训和测试;其次是特征重要性评估- allimplemented。我们在使用不同实验方法、化学刺激、电极类型和不同记录时间生成的异构安培时间序列数据集上测试了该方法。我们确定了这些数据集的某些共同特征,这些特征可以进行准确的预测。此外,我们发现与安培痕迹分类相关的信息既不单独存在于尖峰片段中,也不能仅从尖峰的时间结构中检索。事实上,峰值和轨迹基线之间的瞬变为成功的分类提供了必要的信息,从而有力地证明了安培时间序列的有效特征表示需要完整的时间序列。据我们所知,这是第一个提出机器学习方案的研究之一,特别是在全安培时间序列数据上的监督学习。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Causal loop and Stock-Flow Modeling of Signal Transduction Pathways Transient contacts between filaments impart its elasticity to branched actin Ultra-rapid, Quantitative, Label-free Antibiotic Susceptibility Testing via Optically Detected Purine Metabolites Intracellular order formation through stepwise phase transitions A Novel Use of Pseudospectra in Mathematical Biology: Understanding HPA Axis Sensitivity
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1