Biased Hypothesis Formation From Projection Pursuit

John Patterson, Chris S. Avery, Tyler Grear, D. Jacobs
{"title":"Biased Hypothesis Formation From Projection Pursuit","authors":"John Patterson, Chris S. Avery, Tyler Grear, D. Jacobs","doi":"10.54364/aaiml.2021.1114","DOIUrl":null,"url":null,"abstract":"The effect of bias on hypothesis formation is characterized for an automated data-driven projection pursuit neural network to extract and select features for binary classification of data streams. This intelligent exploratory process partitions a complete vector state space into disjoint subspaces to create working hypotheses quantified by similarities and differences observed between two groups of labeled data streams. Data streams are typically time sequenced, and may exhibit complex spatio-temporal patterns. For example, given atomic trajectories from molecular dynamics simulation, the machine's task is to quantify dynamical mechanisms that promote function by comparing protein mutants, some known to function while others are nonfunctional. Utilizing synthetic two-dimensional molecules that mimic the dynamics of functional and nonfunctional proteins, biases are identified and controlled in both the machine learning model and selected training data under different contexts. The refinement of a working hypothesis converges to a statistically robust multivariate perception of the data based on a context-dependent perspective. Including diverse perspectives during data exploration enhances interpretability of the multivariate characterization of similarities and differences.","PeriodicalId":373878,"journal":{"name":"Adv. Artif. Intell. Mach. Learn.","volume":"193 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Adv. Artif. Intell. Mach. Learn.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54364/aaiml.2021.1114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The effect of bias on hypothesis formation is characterized for an automated data-driven projection pursuit neural network to extract and select features for binary classification of data streams. This intelligent exploratory process partitions a complete vector state space into disjoint subspaces to create working hypotheses quantified by similarities and differences observed between two groups of labeled data streams. Data streams are typically time sequenced, and may exhibit complex spatio-temporal patterns. For example, given atomic trajectories from molecular dynamics simulation, the machine's task is to quantify dynamical mechanisms that promote function by comparing protein mutants, some known to function while others are nonfunctional. Utilizing synthetic two-dimensional molecules that mimic the dynamics of functional and nonfunctional proteins, biases are identified and controlled in both the machine learning model and selected training data under different contexts. The refinement of a working hypothesis converges to a statistically robust multivariate perception of the data based on a context-dependent perspective. Including diverse perspectives during data exploration enhances interpretability of the multivariate characterization of similarities and differences.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从投射追踪中形成有偏差的假设
利用数据驱动的自动投影寻踪神经网络对数据流进行特征提取和选择,研究了偏差对假设形成的影响。这种智能探索过程将一个完整的向量状态空间划分为不相交的子空间,通过观察两组标记数据流之间的相似性和差异性来创建工作假设。数据流通常是按时间顺序排列的,并且可能表现出复杂的时空模式。例如,给定分子动力学模拟的原子轨迹,该机器的任务是通过比较蛋白质突变体来量化促进功能的动力学机制,其中一些已知具有功能,而另一些则无功能。利用模拟功能和非功能蛋白质动态的合成二维分子,在机器学习模型和不同背景下选择的训练数据中识别和控制偏差。工作假设的细化收敛到基于上下文依赖视角的数据的统计稳健的多变量感知。在数据探索过程中包含不同的视角,增强了相似性和差异性的多元特征的可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FishRecGAN: An End to End GAN Based Network for Fisheye Rectification and Calibration Should ChatGPT and Bard Share Revenue with Their Data Providers? A New Business Model for the AI Era Structural Vibration Signal Denoising Using Stacking Ensemble of Hybrid CNN-RNN A Comparison of Methods for Neural Network Aggregation One-class Damage Detector Using Deeper Fully Convolutional Data Descriptions for Civil Application
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1