首页 > 最新文献

Journal of Integrative Bioinformatics最新文献

英文 中文
Evaluating molecular representations in machine learning models for drug response prediction and interpretability. 评估用于药物反应预测和可解释性的机器学习模型中的分子表征。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-08-26 eCollection Date: 2022-09-01 DOI: 10.1515/jib-2022-0006
Delora Baptista, João Correia, Bruno Pereira, Miguel Rocha

Machine learning (ML) is increasingly being used to guide drug discovery processes. When applying ML approaches to chemical datasets, molecular descriptors and fingerprints are typically used to represent compounds as numerical vectors. However, in recent years, end-to-end deep learning (DL) methods that can learn feature representations directly from line notations or molecular graphs have been proposed as alternatives to using precomputed features. This study set out to investigate which compound representation methods are the most suitable for drug sensitivity prediction in cancer cell lines. Twelve different representations were benchmarked on 5 compound screening datasets, using DeepMol, a new chemoinformatics package developed by our research group, to perform these analyses. The results of this study show that the predictive performance of end-to-end DL models is comparable to, and at times surpasses, that of models trained on molecular fingerprints, even when less training data is available. This study also found that combining several compound representation methods into an ensemble can improve performance. Finally, we show that a post hoc feature attribution method can boost the explainability of the DL models.

机器学习(ML)越来越多地用于指导药物发现过程。在将 ML 方法应用于化学数据集时,分子描述符和指纹通常用于将化合物表示为数字向量。然而,近年来,有人提出了端到端深度学习(DL)方法,这种方法可以直接从线条符号或分子图中学习特征表示,作为使用预计算特征的替代方法。本研究旨在调查哪种化合物表示方法最适合预测癌细胞系的药物敏感性。我们在 5 个化合物筛选数据集上对 12 种不同的表示方法进行了基准测试,并使用我们研究小组开发的新型化学信息学软件包 DeepMol 进行了这些分析。这项研究的结果表明,端到端 DL 模型的预测性能可与分子指纹训练的模型相媲美,有时甚至超过后者,即使在训练数据较少的情况下也是如此。这项研究还发现,将几种复合表示方法组合在一起可以提高性能。最后,我们展示了一种事后特征归因方法可以提高 DL 模型的可解释性。
{"title":"Evaluating molecular representations in machine learning models for drug response prediction and interpretability.","authors":"Delora Baptista, João Correia, Bruno Pereira, Miguel Rocha","doi":"10.1515/jib-2022-0006","DOIUrl":"10.1515/jib-2022-0006","url":null,"abstract":"<p><p>Machine learning (ML) is increasingly being used to guide drug discovery processes. When applying ML approaches to chemical datasets, molecular descriptors and fingerprints are typically used to represent compounds as numerical vectors. However, in recent years, end-to-end deep learning (DL) methods that can learn feature representations directly from line notations or molecular graphs have been proposed as alternatives to using precomputed features. This study set out to investigate which compound representation methods are the most suitable for drug sensitivity prediction in cancer cell lines. Twelve different representations were benchmarked on 5 compound screening datasets, using DeepMol, a new chemoinformatics package developed by our research group, to perform these analyses. The results of this study show that the predictive performance of end-to-end DL models is comparable to, and at times surpasses, that of models trained on molecular fingerprints, even when less training data is available. This study also found that combining several compound representation methods into an ensemble can improve performance. Finally, we show that a <i>post hoc</i> feature attribution method can boost the explainability of the DL models.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 3","pages":""},"PeriodicalIF":1.5,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9521826/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33438674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design X Bioinformatics: a community-driven initiative to connect bioinformatics and design. 设计X生物信息学:一个社区驱动的倡议,连接生物信息学和设计。
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-07-22 DOI: 10.1515/jib-2022-0037
Björn Sommer, Daisuke Inoue, Marc Baaden

Bioinformatics applies computer science approaches to the analysis of biological data. It is widely known for its genomics-based analysis approaches that have supported, for example, the 1000 Genomes Project. In addition, bioinformatics relates to many other areas, such as analysis of microscopic images (e.g., organelle localization), molecular modelling (e.g., proteins, biological membranes), and visualization of biological networks (e.g., protein-protein interaction networks, metabolism). Design is a highly interdisciplinary field that incorporates aspects such as aesthetic, economic, functional, philosophical, and/or socio-political considerations into the creative process and is usually determined by context. While visualization plays a critical role in bioinformatics, as reflected in a number of conferences and workshops in the field, design in bioinformatics-related research contexts in particular is not as well studied. With this special issue in conjunction with an international workshop, we aim to bring together bioinformaticians from different fields with designers, design researchers, and medical and scientific illustrators to discuss future challenges in the context of bioinformatics and design.

生物信息学应用计算机科学方法来分析生物数据。它以其基于基因组学的分析方法而闻名,例如,支持了1000基因组计划。此外,生物信息学涉及许多其他领域,如微观图像分析(例如,细胞器定位),分子建模(例如,蛋白质,生物膜)和生物网络可视化(例如,蛋白质-蛋白质相互作用网络,代谢)。设计是一个高度跨学科的领域,它将美学、经济、功能、哲学和/或社会政治等方面的考虑融入到创作过程中,通常由背景决定。虽然可视化在生物信息学中起着至关重要的作用,正如该领域的一些会议和研讨会所反映的那样,但与生物信息学相关的研究环境中的设计尤其没有得到很好的研究。通过这期特刊与国际研讨会的结合,我们的目标是将来自不同领域的生物信息学家与设计师、设计研究人员、医学和科学插画家聚集在一起,讨论生物信息学和设计背景下的未来挑战。
{"title":"Design X Bioinformatics: a community-driven initiative to connect bioinformatics and design.","authors":"Björn Sommer,&nbsp;Daisuke Inoue,&nbsp;Marc Baaden","doi":"10.1515/jib-2022-0037","DOIUrl":"https://doi.org/10.1515/jib-2022-0037","url":null,"abstract":"<p><p>Bioinformatics applies computer science approaches to the analysis of biological data. It is widely known for its genomics-based analysis approaches that have supported, for example, the 1000 Genomes Project. In addition, bioinformatics relates to many other areas, such as analysis of microscopic images (e.g., organelle localization), molecular modelling (e.g., proteins, biological membranes), and visualization of biological networks (e.g., protein-protein interaction networks, metabolism). Design is a highly interdisciplinary field that incorporates aspects such as aesthetic, economic, functional, philosophical, and/or socio-political considerations into the creative process and is usually determined by context. While visualization plays a critical role in bioinformatics, as reflected in a number of conferences and workshops in the field, design in bioinformatics-related research contexts in particular is not as well studied. With this special issue in conjunction with an international workshop, we aim to bring together bioinformaticians from different fields with designers, design researchers, and medical and scientific illustrators to discuss future challenges in the context of bioinformatics and design.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377699/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40527885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A hybrid of Bees algorithm and regulatory on/off minimization for optimizing lactate and succinate production. 混合蜜蜂算法和调节开/关最小化优化乳酸和琥珀酸盐生产。
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-07-19 eCollection Date: 2022-09-01 DOI: 10.1515/jib-2022-0003
Mohd Izzat Yong, Mohd Saberi Mohamad, Yee Wen Choon, Weng Howe Chan, Hasyiya Karimah Adli, Khairul Nizar Syazwan Wsw, Nooraini Yusoff, Muhammad Akmal Remli

Metabolic engineering has expanded in importance and employment in recent years and is now extensively applied particularly in the production of biomass from microbes. Metabolic network models have been employed extravagantly in computational processes developed to enhance metabolic production and suggest changes in organisms. The crucial issue has been the unrealistic flux distribution presented in prior work on rational modelling framework adopting Optknock and OptGene. In order to address the problem, a hybrid of Bees Algorithm and Regulatory On/Off Minimization (BAROOM) is used. By employing Escherichia coli as the model organism, the most excellent set of genes in E. coli that can be removed and advance the production of succinate can be decided. Evidences shows that BAROOM outperforms alternative strategies used to escalate in succinate production in model organisms like E. coli by selecting the best set of genes to be removed.

近年来,代谢工程的重要性和应用范围不断扩大,目前已广泛应用于微生物生物质的生产。代谢网络模型已被大量应用于计算过程中,这些计算过程旨在提高代谢产生并提示生物体的变化。关键问题是在之前的工作中,采用Optknock和OptGene的合理建模框架提出了不切实际的通量分布。为了解决这个问题,使用了蜜蜂算法和调节开/关最小化(BAROOM)的混合算法。利用大肠杆菌作为模式生物,可以确定大肠杆菌中最优秀的一组基因,可以去除并促进琥珀酸盐的生产。有证据表明,BAROOM通过选择要去除的最佳基因集,优于用于在大肠杆菌等模式生物中增加琥珀酸盐产量的替代策略。
{"title":"A hybrid of Bees algorithm and regulatory on/off minimization for optimizing lactate and succinate production.","authors":"Mohd Izzat Yong,&nbsp;Mohd Saberi Mohamad,&nbsp;Yee Wen Choon,&nbsp;Weng Howe Chan,&nbsp;Hasyiya Karimah Adli,&nbsp;Khairul Nizar Syazwan Wsw,&nbsp;Nooraini Yusoff,&nbsp;Muhammad Akmal Remli","doi":"10.1515/jib-2022-0003","DOIUrl":"https://doi.org/10.1515/jib-2022-0003","url":null,"abstract":"<p><p>Metabolic engineering has expanded in importance and employment in recent years and is now extensively applied particularly in the production of biomass from microbes. Metabolic network models have been employed extravagantly in computational processes developed to enhance metabolic production and suggest changes in organisms. The crucial issue has been the unrealistic flux distribution presented in prior work on rational modelling framework adopting Optknock and OptGene. In order to address the problem, a hybrid of Bees Algorithm and Regulatory On/Off Minimization (BAROOM) is used. By employing <i>Escherichia coli</i> as the model organism, the most excellent set of genes in <i>E. coli</i> that can be removed and advance the production of succinate can be decided. Evidences shows that BAROOM outperforms alternative strategies used to escalate in succinate production in model organisms like <i>E. coli</i> by selecting the best set of genes to be removed.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9521821/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40518181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning. 通过机器学习从植物基因组中自动管理LTR反转录转座子文库。
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-07-12 eCollection Date: 2022-09-01 DOI: 10.1515/jib-2021-0036
Simon Orozco-Arias, Mariana S Candamil-Cortes, Paula A Jaimes, Estiven Valencia-Castrillon, Reinel Tabares-Soto, Gustavo Isaza, Romain Guyot

Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score (Oryza granulata) in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.

转座因子是一种可移动的序列,它可以移动并插入到染色体中,在内部或外部刺激下激活,使生物体具有适应环境的能力。在基因组数据中标注转座因子目前被认为是理解生物体关键方面的关键任务,如表型变异性、物种进化和基因组大小等。由于它们复制的方式,LTR逆转录转座子是植物中最常见的转座子,在某些情况下占所有DNA信息的80%。为了标注这些元素,通常创建一个参考文库,执行一个管理过程,消除TE片段和假阳性,然后使用同源性方法在基因组中进行标注。然而,管理过程可能需要数周时间,需要大量的手工工作和多个耗时的生物信息学软件的执行。在这里,我们提出了一种基于机器学习的方法来对植物基因组自动执行这一过程,获得高达91.18%的f1得分。该方法在4种植物中进行了测试,仅用22.61 s就获得了93.6%的f1分数(Oryza granulata),而生物信息学方法大约需要6小时。这表明基于ml的方法是有效的,可以用于大规模测序项目。
{"title":"Automatic curation of LTR retrotransposon libraries from plant genomes through machine learning.","authors":"Simon Orozco-Arias,&nbsp;Mariana S Candamil-Cortes,&nbsp;Paula A Jaimes,&nbsp;Estiven Valencia-Castrillon,&nbsp;Reinel Tabares-Soto,&nbsp;Gustavo Isaza,&nbsp;Romain Guyot","doi":"10.1515/jib-2021-0036","DOIUrl":"https://doi.org/10.1515/jib-2021-0036","url":null,"abstract":"<p><p>Transposable elements are mobile sequences that can move and insert themselves into chromosomes, activating under internal or external stimuli, giving the organism the ability to adapt to the environment. Annotating transposable elements in genomic data is currently considered a crucial task to understand key aspects of organisms such as phenotype variability, species evolution, and genome size, among others. Because of the way they replicate, LTR retrotransposons are the most common transposable elements in plants, accounting in some cases for up to 80% of all DNA information. To annotate these elements, a reference library is usually created, a curation process is performed, eliminating TE fragments and false positives and then annotated in the genome using the homology method. However, the curation process can take weeks, requires extensive manual work and the execution of multiple time-consuming bioinformatics software. Here, we propose a machine learning-based approach to perform this process automatically on plant genomes, obtaining up to 91.18% F1-score. This approach was tested with four plant species, obtaining up to 93.6% F1-score (<i>Oryza granulata</i>) in only 22.61 s, where bioinformatics methods took approximately 6 h. This acceleration demonstrates that the ML-based approach is efficient and could be used in massive sequencing projects.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9521825/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40498603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Colors in the representation of biological structures. 表现生物结构的颜色
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-07-04 DOI: 10.1515/jib-2022-0021
Monica Zoppè

Among the many properties of proteins, sugars, nucleic acids, membranes and other cellular components, color is not present. At the same time, we humans have a natural ability of recognizing and appreciating colors, and use them generously, with the aim of both delivering information and pleasing the eyes. In this article, I suggest how we can conciliate these two situations, with the contribution of biologists, artists, and computer graphics and perception experts. The concept can be developed in a series of initiatives involving the community, including discussion sessions, technical challenges, experimental studies and outreach activities.

在蛋白质、糖类、核酸、细胞膜和其他细胞成分的众多特性中,颜色是不存在的。与此同时,我们人类天生就有识别和欣赏色彩的能力,并大量使用色彩,目的是传递信息和愉悦视觉。在本文中,我将结合生物学家、艺术家、计算机制图和感知专家的意见,提出如何调和这两种情况的建议。这一概念可以通过一系列社区参与的活动来发展,包括讨论会、技术挑战、实验研究和推广活动。
{"title":"Colors in the representation of biological structures.","authors":"Monica Zoppè","doi":"10.1515/jib-2022-0021","DOIUrl":"10.1515/jib-2022-0021","url":null,"abstract":"<p><p>Among the many properties of proteins, sugars, nucleic acids, membranes and other cellular components, color is not present. At the same time, we humans have a natural ability of recognizing and appreciating colors, and use them generously, with the aim of both delivering information and pleasing the eyes. In this article, I suggest how we can conciliate these two situations, with the contribution of biologists, artists, and computer graphics and perception experts. The concept can be developed in a series of initiatives involving the community, including discussion sessions, technical challenges, experimental studies and outreach activities.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377705/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40562034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design considerations for representing systems biology information with the Systems Biology Graphical Notation. 用系统生物学图形符号表示系统生物学信息的设计考虑。
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-07-04 DOI: 10.1515/jib-2022-0024
Falk Schreiber, Tobias Czauderna

Visual representations are commonly used to explore, analyse, and communicate information and knowledge in systems biology and beyond. Such visualisations not only need to be accurate but should also be aesthetically pleasing and informative. Using the example of the Systems Biology Graphical Notation (SBGN) we will investigate design considerations for graphically presenting information from systems biology, in particular regarding the use of glyphs for types of information, the style of graph layout for network representation, and the concept of bricks for visual network creation.

在系统生物学和其他领域,视觉表示通常用于探索、分析和交流信息和知识。这样的可视化不仅需要准确,而且还应该美观和信息丰富。以系统生物学图形符号(SBGN)为例,我们将研究以图形方式呈现系统生物学信息的设计考虑因素,特别是关于信息类型的字形使用,网络表示的图形布局风格,以及用于视觉网络创建的砖块概念。
{"title":"Design considerations for representing systems biology information with the Systems Biology Graphical Notation.","authors":"Falk Schreiber,&nbsp;Tobias Czauderna","doi":"10.1515/jib-2022-0024","DOIUrl":"https://doi.org/10.1515/jib-2022-0024","url":null,"abstract":"<p><p>Visual representations are commonly used to explore, analyse, and communicate information and knowledge in systems biology and beyond. Such visualisations not only need to be accurate but should also be aesthetically pleasing and informative. Using the example of the Systems Biology Graphical Notation (SBGN) we will investigate design considerations for graphically presenting information from systems biology, in particular regarding the use of glyphs for types of information, the style of graph layout for network representation, and the concept of bricks for visual network creation.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377698/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40470351","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Design - a new way to look at old molecules. 设计——一种看待旧分子的新方法。
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-07-01 DOI: 10.1515/jib-2022-0020
Davide Spalvieri, Anne-Marine Mauviel, Matthieu Lambert, Nicolas Férey, Sophie Sacquin-Mora, Matthieu Chavent, Marc Baaden

We discuss how design enriches molecular science, particularly structural biology and bioinformatics. We present two use cases, one in academic practice and the other to design for outreach. The first case targets the representation of ion channels and their dynamic properties. In the second, we document a transition process from a research environment to general-purpose designs. Several testimonials from practitioners are given. By describing the design process of abstracted shapes, exploded views of molecular structures, motion-averaged slices, 360-degree panoramic projections, and experiments with lit sphere shading, we document how designers help make scientific data accessible without betraying its meaning, and how a creative mind adds value over purely data-driven visualizations. A similar conclusion was drawn for public outreach, as we found that comic-book-style drawings are better suited for communicating science to a broad audience.

我们讨论设计如何丰富分子科学,特别是结构生物学和生物信息学。我们提出了两个用例,一个用于学术实践,另一个用于外展设计。第一种情况针对离子通道的表示及其动态特性。在第二部分,我们记录了从研究环境到通用设计的过渡过程。给出了一些从业人员的证言。通过描述抽象形状的设计过程,分子结构的爆炸视图,运动平均切片,360度全景投影和点亮球体阴影的实验,我们记录了设计师如何帮助使科学数据易于访问而不背叛其含义,以及创造性思维如何在纯数据驱动的可视化中增加价值。在公众宣传方面也得出了类似的结论,因为我们发现漫画风格的绘画更适合向广大受众传播科学。
{"title":"Design - a new way to look at old molecules.","authors":"Davide Spalvieri,&nbsp;Anne-Marine Mauviel,&nbsp;Matthieu Lambert,&nbsp;Nicolas Férey,&nbsp;Sophie Sacquin-Mora,&nbsp;Matthieu Chavent,&nbsp;Marc Baaden","doi":"10.1515/jib-2022-0020","DOIUrl":"https://doi.org/10.1515/jib-2022-0020","url":null,"abstract":"<p><p>We discuss how design enriches molecular science, particularly structural biology and bioinformatics. We present two use cases, one in academic practice and the other to design for outreach. The first case targets the representation of ion channels and their dynamic properties. In the second, we document a transition process from a research environment to general-purpose designs. Several testimonials from practitioners are given. By describing the design process of abstracted shapes, exploded views of molecular structures, motion-averaged slices, 360-degree panoramic projections, and experiments with lit sphere shading, we document how designers help make scientific data accessible without betraying its meaning, and how a creative mind adds value over purely data-driven visualizations. A similar conclusion was drawn for public outreach, as we found that comic-book-style drawings are better suited for communicating science to a broad audience.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40563696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integrative illustration of a JCVI-syn3A minimal cell. JCVI-syn3A 最小细胞的综合图示。
IF 1.5 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-06-27 DOI: 10.1515/jib-2022-0013
David S Goodsell

Data from genomics, proteomics, structural biology and cryo-electron microscopy are integrated into a structural illustration of a cross section through an entire JCVI-syn3.0 minimal cell. The illustration is designed with several goals: to inspire excitement in science, to depict the underlying scientific results accurately, and to be feasible in traditional media. Design choices to achieve these goals include reduction of visual complexity with simplified representations, use of orthographic projection to retain scale relationships, and an approach to color that highlights functional compartments of the cell. Given that this simple cell provides an attractive laboratory for exploring the central processes needed for life, several functional narratives are included in the illustration, including division of the cell and the first depiction of an entire cellular proteome. The illustration lays the foundation for 3D molecular modeling of this cell.

来自基因组学、蛋白质组学、结构生物学和低温电子显微镜的数据被整合到整个 JCVI-syn3.0 最小细胞横截面的结构插图中。插图的设计有几个目标:激发对科学的兴趣,准确描述基本的科学成果,并在传统媒体中可行。为实现这些目标,我们在设计上作了如下选择:用简化的表现手法降低视觉复杂性,使用正投影法保留比例关系,以及用色彩突出细胞的功能区。鉴于这个简单的细胞为探索生命所需的核心过程提供了一个极具吸引力的实验室,插图中包含了几个功能性叙述,包括细胞的分裂和对整个细胞蛋白质组的首次描述。该插图为该细胞的三维分子建模奠定了基础。
{"title":"Integrative illustration of a JCVI-syn3A minimal cell.","authors":"David S Goodsell","doi":"10.1515/jib-2022-0013","DOIUrl":"10.1515/jib-2022-0013","url":null,"abstract":"<p><p>Data from genomics, proteomics, structural biology and cryo-electron microscopy are integrated into a structural illustration of a cross section through an entire JCVI-syn3.0 minimal cell. The illustration is designed with several goals: to inspire excitement in science, to depict the underlying scientific results accurately, and to be feasible in traditional media. Design choices to achieve these goals include reduction of visual complexity with simplified representations, use of orthographic projection to retain scale relationships, and an approach to color that highlights functional compartments of the cell. Given that this simple cell provides an attractive laboratory for exploring the central processes needed for life, several functional narratives are included in the illustration, including division of the cell and the first depiction of an entire cellular proteome. The illustration lays the foundation for 3D molecular modeling of this cell.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40395611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Considering best practices in color palettes for molecular visualizations. 考虑分子可视化调色板的最佳实践。
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-06-22 DOI: 10.1515/jib-2022-0016
Laura Garrison, Stefan Bruckner

Biomedical illustration and visualization techniques provide a window into complex molecular worlds that are difficult to capture through experimental means alone. Biomedical illustrators frequently employ color to help tell a molecular story, e.g., to identify key molecules in a signaling pathway. Currently, color use for molecules is largely arbitrary and often chosen based on the client, cultural factors, or personal taste. The study of molecular dynamics is relatively young, and some stakeholders argue that color use guidelines would throttle the growth of the field. Instead, content authors have ample creative freedom to choose an aesthetic that, e.g., supports the story they want to tell. However, such creative freedom comes at a price. The color design process is challenging, particularly for those without a background in color theory. The result is a semantically inconsistent color space that reduces the interpretability and effectiveness of molecular visualizations as a whole. Our contribution in this paper is threefold. We first discuss some of the factors that contribute to this array of color palettes. Second, we provide a brief sampling of color palettes used in both industry and research sectors. Lastly, we suggest considerations for developing best practices around color palettes applied to molecular visualization.

生物医学插图和可视化技术提供了一个窗口,进入复杂的分子世界,很难通过单独的实验手段捕获。生物医学插画家经常使用颜色来帮助讲述分子故事,例如,识别信号通路中的关键分子。目前,分子的颜色使用在很大程度上是任意的,通常根据客户、文化因素或个人品味来选择。分子动力学的研究相对年轻,一些利益相关者认为颜色使用指南会阻碍该领域的发展。相反,内容作者有足够的创作自由来选择美学,例如,支持他们想要讲述的故事。然而,这种创作自由是有代价的。色彩设计过程是具有挑战性的,特别是对于那些没有色彩理论背景的人。其结果是一个语义上不一致的色彩空间,降低了整体分子可视化的可解释性和有效性。我们在这篇论文中的贡献有三个方面。我们首先讨论一些促成这种调色板阵列的因素。其次,我们提供了在工业和研究部门使用的调色板的简要抽样。最后,我们建议围绕调色板开发应用于分子可视化的最佳实践。
{"title":"Considering best practices in color palettes for molecular visualizations.","authors":"Laura Garrison,&nbsp;Stefan Bruckner","doi":"10.1515/jib-2022-0016","DOIUrl":"https://doi.org/10.1515/jib-2022-0016","url":null,"abstract":"<p><p>Biomedical illustration and visualization techniques provide a window into complex molecular worlds that are difficult to capture through experimental means alone. Biomedical illustrators frequently employ color to help tell a molecular story, e.g., to identify key molecules in a signaling pathway. Currently, color use for molecules is largely arbitrary and often chosen based on the client, cultural factors, or personal taste. The study of molecular dynamics is relatively young, and some stakeholders argue that color use guidelines would throttle the growth of the field. Instead, content authors have ample creative freedom to choose an aesthetic that, e.g., supports the story they want to tell. However, such creative freedom comes at a price. The color design process is challenging, particularly for those without a background in color theory. The result is a semantically inconsistent color space that reduces the interpretability and effectiveness of molecular visualizations as a whole. Our contribution in this paper is threefold. We first discuss some of the factors that contribute to this array of color palettes. Second, we provide a brief sampling of color palettes used in both industry and research sectors. Lastly, we suggest considerations for developing best practices around color palettes applied to molecular visualization.</p>","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":"19 2","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9377702/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"40192929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Integrative analysis of chemical properties and functions of drugs for adverse drug reaction prediction based on multi-label deep neural network 基于多标签深度神经网络的药物化学性质和功能综合分析用于药物不良反应预测
IF 1.9 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY Pub Date : 2022-05-19 DOI: 10.1515/jib-2022-0007
Pranab Das, Yogita, V. Pal
Abstract The prediction of adverse drug reactions (ADR) is an important step of drug discovery and design process. Different drug properties have been employed for ADR prediction but the prediction capability of drug properties and drug functions in integrated manner is yet to be explored. In the present work, a multi-label deep neural network and MLSMOTE based methodology has been proposed for ADR prediction. The proposed methodology has been applied on SMILES Strings data of drugs, 17 molecular descriptors data of drugs and drug functions data individually and in integrated manner for ADR prediction. The experimental results shows that the SMILES Strings + drug functions has outperformed other types of data with regards to ADR prediction capability.
摘要药物不良反应(ADR)的预测是药物发现和设计过程中的重要步骤。不同的药物性质已被用于ADR预测,但药物性质和药物功能的综合预测能力尚待探索。在本工作中,提出了一种基于多标签深度神经网络和MLSMOTE的ADR预测方法。该方法已分别应用于药物的SMILES字符串数据、药物的17个分子描述符数据和药物功能数据,并以集成的方式进行ADR预测。实验结果表明,SMILES Strings+药物功能在ADR预测能力方面优于其他类型的数据。
{"title":"Integrative analysis of chemical properties and functions of drugs for adverse drug reaction prediction based on multi-label deep neural network","authors":"Pranab Das, Yogita, V. Pal","doi":"10.1515/jib-2022-0007","DOIUrl":"https://doi.org/10.1515/jib-2022-0007","url":null,"abstract":"Abstract The prediction of adverse drug reactions (ADR) is an important step of drug discovery and design process. Different drug properties have been employed for ADR prediction but the prediction capability of drug properties and drug functions in integrated manner is yet to be explored. In the present work, a multi-label deep neural network and MLSMOTE based methodology has been proposed for ADR prediction. The proposed methodology has been applied on SMILES Strings data of drugs, 17 molecular descriptors data of drugs and drug functions data individually and in integrated manner for ADR prediction. The experimental results shows that the SMILES Strings + drug functions has outperformed other types of data with regards to ADR prediction capability.","PeriodicalId":53625,"journal":{"name":"Journal of Integrative Bioinformatics","volume":" ","pages":""},"PeriodicalIF":1.9,"publicationDate":"2022-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49603676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Journal of Integrative Bioinformatics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1