in silico Plants最新文献_第8页

Global sensitivity-based modelling approach to identify suitable Eucalyptus traits for adaptation to climate variability and change 基于全球敏感性的桉树适应气候变率和变化的建模方法

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2020-01-01 DOI: 10.1093/insilicoplants/diaa003

E. F. Elli, N. Huth, P. Sentelhas, R. Carneiro, C. Alvares

Eucalyptus-breeding efforts have been made to identify clones of superior performance for growth and yield and how they will interact with global climate changes. This study performs a global sensitivity analysis for assessing the impact of genetic traits on Eucalyptus yield across contrasting environments in Brazil under present and future climate scenarios. The APSIM Next Generation Eucalyptus model was used to perform the simulations of stemwood biomass (t ha−1) for 7-year rotations across 23 locations in Brazil. Projections for the period from 2020 to 2049 using three global circulation models under intermediate (RCP4.5) and high (RCP8.5) greenhouse gas emission scenarios were performed. The Morris sensitivity method was used to perform a global sensitivity analysis to identify the influence of plant traits on stemwood biomass. Traits for radiation use efficiency, leaf partitioning, canopy light capture and fine root partitioning were the most important, impacting the Eucalyptus yield substantially in all environments under the present climate. Some of the traits targeted now by breeders for current climate will remain important under future climates. However, breeding should place a greater emphasis on photosynthetic temperature response for Eucalyptus in some regions. Global sensitivity analysis was found to be a powerful tool for identifying suitable Eucalyptus traits for adaptation to climate variability and change. This approach can improve breeding strategies by better understanding the gene × environment interactions for forest productivity.

桉树育种工作已经进行，以确定生长和产量性能优越的克隆，以及它们将如何与全球气候变化相互作用。本研究对巴西在当前和未来气候情景下不同环境下遗传性状对桉树产量的影响进行了全球敏感性分析。利用APSIM下一代桉树模型对巴西23个地点的茎材生物量(t ha - 1)进行了7年轮作的模拟。利用中(RCP4.5)和高(RCP8.5)温室气体排放情景下的3种全球环流模式对2020 - 2049年进行了预估。采用Morris敏感性方法进行全局敏感性分析，以确定植物性状对茎材生物量的影响。在当前气候条件下，辐射利用效率、叶片分配、冠层光捕获和细根分配是影响桉树产量的最重要性状。育种者现在针对当前气候的一些性状在未来气候下仍然很重要。然而，在某些地区，桉树的育种应更加重视光合温度响应。全局敏感性分析是鉴别桉树适应气候变率和变化的有效工具。该方法可以通过更好地了解基因与环境相互作用对森林生产力的影响来改进育种策略。

{"title":"Global sensitivity-based modelling approach to identify suitable Eucalyptus traits for adaptation to climate variability and change","authors":"E. F. Elli, N. Huth, P. Sentelhas, R. Carneiro, C. Alvares","doi":"10.1093/insilicoplants/diaa003","DOIUrl":"https://doi.org/10.1093/insilicoplants/diaa003","url":null,"abstract":"\u0000 Eucalyptus-breeding efforts have been made to identify clones of superior performance for growth and yield and how they will interact with global climate changes. This study performs a global sensitivity analysis for assessing the impact of genetic traits on Eucalyptus yield across contrasting environments in Brazil under present and future climate scenarios. The APSIM Next Generation Eucalyptus model was used to perform the simulations of stemwood biomass (t ha−1) for 7-year rotations across 23 locations in Brazil. Projections for the period from 2020 to 2049 using three global circulation models under intermediate (RCP4.5) and high (RCP8.5) greenhouse gas emission scenarios were performed. The Morris sensitivity method was used to perform a global sensitivity analysis to identify the influence of plant traits on stemwood biomass. Traits for radiation use efficiency, leaf partitioning, canopy light capture and fine root partitioning were the most important, impacting the Eucalyptus yield substantially in all environments under the present climate. Some of the traits targeted now by breeders for current climate will remain important under future climates. However, breeding should place a greater emphasis on photosynthetic temperature response for Eucalyptus in some regions. Global sensitivity analysis was found to be a powerful tool for identifying suitable Eucalyptus traits for adaptation to climate variability and change. This approach can improve breeding strategies by better understanding the gene × environment interactions for forest productivity.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/insilicoplants/diaa003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47467376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Within- and cross-species predictions of plant specialized metabolism genes using transfer learning. 利用迁移学习对植物特化代谢基因的种内和跨种预测。

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2020-01-01 Epub Date: 2020-07-30 DOI: 10.1093/insilicoplants/diaa005

Bethany M Moore, Peipei Wang, Pengxiang Fan, Aaron Lee, Bryan Leong, Yann-Ru Lou, Craig A Schenck, Koichi Sugimoto, Robert Last, Melissa D Lehti-Shiu, Cornelius S Barry, Shin-Han Shiu

Plant specialized metabolites mediate interactions between plants and the environment and have significant agronomical/pharmaceutical value. Most genes involved in specialized metabolism (SM) are unknown because of the large number of metabolites and the challenge in differentiating SM genes from general metabolism (GM) genes. Plant models like Arabidopsis thaliana have extensive, experimentally derived annotations, whereas many non-model species do not. Here we employed a machine learning strategy, transfer learning, where knowledge from A. thaliana is transferred to predict gene functions in cultivated tomato with fewer experimentally annotated genes. The first tomato SM/GM prediction model using only tomato data performs well (F-measure = 0.74, compared with 0.5 for random and 1.0 for perfect predictions), but from manually curating 88 SM/GM genes, we found many mis-predicted entries were likely mis-annotated. When the SM/GM prediction models built with A. thaliana data were used to filter out genes where the A. thaliana-based model predictions disagreed with tomato annotations, the new tomato model trained with filtered data improved significantly (F-measure = 0.92). Our study demonstrates that SM/GM genes can be better predicted by leveraging cross-species information. Additionally, our findings provide an example for transfer learning in genomics where knowledge can be transferred from an information-rich species to an information-poor one.

植物特化代谢物介导植物与环境之间的相互作用，具有重要的农学/药学价值。由于代谢产物数量众多，而且很难从一般代谢(GM)基因中区分出特殊代谢(SM)基因，因此大多数参与特殊代谢(SM)的基因都是未知的。拟南芥等植物模型具有广泛的实验推导的注释，而许多非模式物种则没有。在这里，我们采用了一种机器学习策略，即迁移学习，将拟南芥的知识转移到具有较少实验注释基因的栽培番茄中来预测基因功能。仅使用番茄数据的第一个番茄SM/GM预测模型表现良好(F-measure = 0.74，而随机预测为0.5，完美预测为1.0)，但从手动管理的88个SM/GM基因中，我们发现许多错误预测的条目可能是错误注释。利用拟沙拟兰数据建立的SM/GM预测模型，过滤掉拟沙拟兰模型预测与番茄注释不一致的基因，过滤后的新番茄模型得到显著改善(F-measure = 0.92)。我们的研究表明，利用跨物种信息可以更好地预测SM/GM基因。此外，我们的发现为基因组学中的迁移学习提供了一个例子，其中知识可以从信息丰富的物种转移到信息贫乏的物种。

{"title":"Within- and cross-species predictions of plant specialized metabolism genes using transfer learning.","authors":"Bethany M Moore, Peipei Wang, Pengxiang Fan, Aaron Lee, Bryan Leong, Yann-Ru Lou, Craig A Schenck, Koichi Sugimoto, Robert Last, Melissa D Lehti-Shiu, Cornelius S Barry, Shin-Han Shiu","doi":"10.1093/insilicoplants/diaa005","DOIUrl":"https://doi.org/10.1093/insilicoplants/diaa005","url":null,"abstract":"Plant specialized metabolites mediate interactions between plants and the environment and have significant agronomical/pharmaceutical value. Most genes involved in specialized metabolism (SM) are unknown because of the large number of metabolites and the challenge in differentiating SM genes from general metabolism (GM) genes. Plant models like Arabidopsis thaliana have extensive, experimentally derived annotations, whereas many non-model species do not. Here we employed a machine learning strategy, transfer learning, where knowledge from A. thaliana is transferred to predict gene functions in cultivated tomato with fewer experimentally annotated genes. The first tomato SM/GM prediction model using only tomato data performs well (F-measure = 0.74, compared with 0.5 for random and 1.0 for perfect predictions), but from manually curating 88 SM/GM genes, we found many mis-predicted entries were likely mis-annotated. When the SM/GM prediction models built with A. thaliana data were used to filter out genes where the A. thaliana-based model predictions disagreed with tomato annotations, the new tomato model trained with filtered data improved significantly (F-measure = 0.92). Our study demonstrates that SM/GM genes can be better predicted by leveraging cross-species information. Additionally, our findings provide an example for transfer learning in genomics where knowledge can be transferred from an information-rich species to an information-poor one.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":"2 1","pages":"diaa005"},"PeriodicalIF":3.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/insilicoplants/diaa005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"38738446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Within- and cross-species predictions of plant specialized metabolism genes using transfer learning 利用迁移学习预测植物特殊代谢基因的种内和跨种预测

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2020-01-01 DOI: 10.1101/2020.01.13.112102

Bethany M. Moore, Peipei Wang, P. Fan, Aaron Lee, Bryan J. Leong, Y. Lou, Craig A. Schenck, K. Sugimoto, R. Last, Melissa D. Lehti-Shiu, Cornelius S. Barry, Shin-Han Shiu

Plant specialized metabolites mediate interactions between plants and the environment and have significant agronomical/pharmaceutical value. Most genes involved in specialized metabolism (SM) are unknown because of the large number of metabolites and the challenge in differentiating SM genes from general metabolism (GM) genes. Plant models like Arabidopsis thaliana have extensive, experimentally derived annotations, whereas many non-model species do not. Here we employed a machine learning strategy, transfer learning, where knowledge from A. thaliana is transferred to predict gene functions in cultivated tomato with fewer experimentally annotated genes. The first tomato SM/GM prediction model using only tomato data performs well (F-measure=0.74, compared with 0.5 for random and 1.0 for perfect predictions), but from manually curating 88 SM/GM genes, we found many mis-predicted entries were likely mis-annotated. When the SM/GM prediction models built with A. thaliana data were used to filter out genes where the A. thaliana-based model predictions disagreed with tomato annotations, the new tomato model trained with filtered data improved significantly (F-measure=0.92). Our study demonstrates that SM/GM genes can be better predicted by leveraging cross-species information. Additionally, our findings provide an example for transfer learning in genomics where knowledge can be transferred from an information-rich species to an information-poor one.

植物专用代谢产物介导植物与环境之间的相互作用，具有重要的农业/药用价值。大多数参与专门代谢（SM）的基因都是未知的，因为有大量的代谢产物，并且在区分SM基因和一般代谢（GM）基因方面存在挑战。像拟南芥这样的植物模型有广泛的实验来源的注释，而许多非模型物种没有。在这里，我们采用了一种机器学习策略，即转移学习，将拟南芥的知识转移到具有较少实验注释基因的栽培番茄中，以预测其基因功能。第一个仅使用番茄数据的番茄SM/GM预测模型表现良好（F-measure=0.74，而随机预测为0.5，完美预测为1.0），但通过手动管理88个SM/GM基因，我们发现许多错误预测的条目可能被错误注释。当使用拟南芥数据构建的SM/GM预测模型来筛选出基于拟南芥的模型预测与番茄注释不一致的基因时，使用过滤数据训练的新番茄模型显著改进（F-measure=0.92）。我们的研究表明，利用跨物种信息可以更好地预测SM/GM基因。此外，我们的发现为基因组学中的迁移学习提供了一个例子，即知识可以从信息丰富的物种转移到信息贫乏的物种。

{"title":"Within- and cross-species predictions of plant specialized metabolism genes using transfer learning","authors":"Bethany M. Moore, Peipei Wang, P. Fan, Aaron Lee, Bryan J. Leong, Y. Lou, Craig A. Schenck, K. Sugimoto, R. Last, Melissa D. Lehti-Shiu, Cornelius S. Barry, Shin-Han Shiu","doi":"10.1101/2020.01.13.112102","DOIUrl":"https://doi.org/10.1101/2020.01.13.112102","url":null,"abstract":"Plant specialized metabolites mediate interactions between plants and the environment and have significant agronomical/pharmaceutical value. Most genes involved in specialized metabolism (SM) are unknown because of the large number of metabolites and the challenge in differentiating SM genes from general metabolism (GM) genes. Plant models like Arabidopsis thaliana have extensive, experimentally derived annotations, whereas many non-model species do not. Here we employed a machine learning strategy, transfer learning, where knowledge from A. thaliana is transferred to predict gene functions in cultivated tomato with fewer experimentally annotated genes. The first tomato SM/GM prediction model using only tomato data performs well (F-measure=0.74, compared with 0.5 for random and 1.0 for perfect predictions), but from manually curating 88 SM/GM genes, we found many mis-predicted entries were likely mis-annotated. When the SM/GM prediction models built with A. thaliana data were used to filter out genes where the A. thaliana-based model predictions disagreed with tomato annotations, the new tomato model trained with filtered data improved significantly (F-measure=0.92). Our study demonstrates that SM/GM genes can be better predicted by leveraging cross-species information. Additionally, our findings provide an example for transfer learning in genomics where knowledge can be transferred from an information-rich species to an information-poor one.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48756679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

A new tool for discovering transcriptional regulators of co-expressed genes predicts gene regulatory networks that mediate ethylene-controlled root development 一种发现共表达基因转录调控因子的新工具预测了介导乙烯控制的根系发育的基因调控网络

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2020-01-01 DOI: 10.1093/insilicoplants/diaa006

Alexandria F. Harkey, Kira N Sims, G. Muday

Gene regulatory networks (GRNs) are defined by a cascade of transcriptional events by which signals, such as hormones or environmental cues, change development. To understand these networks, it is necessary to link specific transcription factors (TFs) to the downstream gene targets whose expression they regulate. Although multiple methods provide information on the targets of a single TF, moving from groups of co-expressed genes to the TF that controls them is more difficult. To facilitate this bottom-up approach, we have developed a web application named TF DEACoN. This application uses a publicly available Arabidopsis thaliana DNA Affinity Purification (DAP-Seq) data set to search for TFs that show enriched binding to groups of co-regulated genes. We used TF DEACoN to examine groups of transcripts regulated by treatment with the ethylene precursor 1-aminocyclopropane-1-carboxylic acid (ACC), using a transcriptional data set performed with high temporal resolution. We demonstrate the utility of this application when co-regulated genes are divided by timing of response or cell-type-specific information, which provides more information on TF/target relationships than when less defined and larger groups of co-regulated genes are used. This approach predicted TFs that may participate in ethylene-modulated root development including the TF NAM (NO APICAL MERISTEM). We used a genetic approach to show that a mutation in NAM reduces the negative regulation of lateral root development by ACC. The combination of filtering and TF DEACoN used here can be applied to any group of co-regulated genes to predict GRNs that control coordinated transcriptional responses.

基因调控网络（GRNs）是由一系列转录事件定义的，通过这些事件，激素或环境线索等信号会改变发育。为了理解这些网络，有必要将特定的转录因子（TF）与它们调节表达的下游基因靶点联系起来。尽管多种方法提供了关于单个TF靶点的信息，但从共表达基因组转移到控制它们的TF更为困难。为了促进这种自下而上的方法，我们开发了一个名为TFDEACON的web应用程序。该应用程序使用公开可用的拟南芥DNA亲和纯化（DAP-Seq）数据集来搜索显示与共调节基因组富集结合的转录因子。我们使用TF DEACoN来检测由乙烯前体1-氨基环丙烷-1-羧酸（ACC）处理调节的转录物组，使用高时间分辨率的转录数据集。当共调节基因按反应时间或细胞类型特异性信息划分时，我们证明了这种应用的实用性，这比使用定义较少和较大的共调节基因组时提供了更多关于TF/靶标关系的信息。该方法预测了可能参与乙烯调节根系发育的TF，包括TF NAM（无APICAL MERISTEM）。我们使用遗传学方法表明，NAM的突变减少了ACC对侧根发育的负调控。这里使用的过滤和TF DEACoN的组合可以应用于任何一组共调控基因，以预测控制协调转录反应的GRN。

{"title":"A new tool for discovering transcriptional regulators of co-expressed genes predicts gene regulatory networks that mediate ethylene-controlled root development","authors":"Alexandria F. Harkey, Kira N Sims, G. Muday","doi":"10.1093/insilicoplants/diaa006","DOIUrl":"https://doi.org/10.1093/insilicoplants/diaa006","url":null,"abstract":"\u0000 Gene regulatory networks (GRNs) are defined by a cascade of transcriptional events by which signals, such as hormones or environmental cues, change development. To understand these networks, it is necessary to link specific transcription factors (TFs) to the downstream gene targets whose expression they regulate. Although multiple methods provide information on the targets of a single TF, moving from groups of co-expressed genes to the TF that controls them is more difficult. To facilitate this bottom-up approach, we have developed a web application named TF DEACoN. This application uses a publicly available Arabidopsis thaliana DNA Affinity Purification (DAP-Seq) data set to search for TFs that show enriched binding to groups of co-regulated genes. We used TF DEACoN to examine groups of transcripts regulated by treatment with the ethylene precursor 1-aminocyclopropane-1-carboxylic acid (ACC), using a transcriptional data set performed with high temporal resolution. We demonstrate the utility of this application when co-regulated genes are divided by timing of response or cell-type-specific information, which provides more information on TF/target relationships than when less defined and larger groups of co-regulated genes are used. This approach predicted TFs that may participate in ethylene-modulated root development including the TF NAM (NO APICAL MERISTEM). We used a genetic approach to show that a mutation in NAM reduces the negative regulation of lateral root development by ACC. The combination of filtering and TF DEACoN used here can be applied to any group of co-regulated genes to predict GRNs that control coordinated transcriptional responses.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/insilicoplants/diaa006","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49176515","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Reuse of process-based models: automatic transformation into many programming languages and simulation platforms 基于过程的模型的重用：自动转换为许多编程语言和模拟平台

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2020-01-01 DOI: 10.1093/insilicoplants/diaa007

Cyrille Ahmed Midingoyi, C. Pradal, I. Athanasiadis, M. Donatelli, Andreas Enders, D. Fumagalli, Frédérick Garçia, D. Holzworth, G. Hoogenboom, C. Porter, H. Raynal, P. Thorburn, P. Martre

The diversity of plant and crop process-based modelling platforms in terms of implementation language, software design and architectural constraints limits the reusability of the model components outside the platform in which they were originally developed, making model reuse a persistent issue. To facilitate the intercomparison and improvement of process-based models and the exchange of model components, several groups in the field joined to create the Agricultural Model Exchange Initiative (AMEI). Agricultural Model Exchange Initiative proposes a centralized framework for exchanging and reusing model components. It provides a modular and declarative approach to describe the specification of unit models and their composition. A model algorithm is associated with each model specification, which implements its mathematical behaviour. This paper focuses on the expression of the model algorithm independently of the platform specificities, and how the model algorithm can be seamlessly integrated into different platforms. We define CyML, a Cython-derived language with minimum specifications to implement model component algorithms. We also propose CyMLT, an extensible source-to-source transformation system that transforms CyML source code into different target languages such as Fortran, C#, C++, Java and Python, and into different programming paradigms. CyMLT is also able to generate model components to target modelling platforms such as DSSAT, BioMA, Record, SIMPLACE and OpenAlea. We demonstrate our reuse approach with a simple unit model and the capacity to extend CyMLT with other languages and platforms. The approach we present here will help to improve the reproducibility, exchange and reuse of process-based models.

基于植物和作物过程的建模平台在实现语言、软件设计和体系结构约束方面的多样性限制了模型组件在最初开发的平台之外的可重用性，使模型重用成为一个持久的问题。为了促进基于过程的模型的相互比较和改进以及模型组件的交换，该领域的几个小组联合起来创建了农业模型交换倡议（AMEI）。农业模型交换倡议提出了一个用于交换和重用模型组件的集中式框架。它提供了一种模块化和声明性的方法来描述单元模型的规范及其组成。模型算法与每个模型规范相关联，实现其数学行为。本文的重点是独立于平台特性的模型算法的表达，以及如何将模型算法无缝集成到不同的平台中。我们定义了CyML，这是一种Cython派生的语言，具有实现模型组件算法的最低规范。我们还提出了CyMLT，这是一个可扩展的源代码到源代码转换系统，可以将CyML源代码转换为不同的目标语言，如Fortran、C#、C++、Java和Python，以及不同的编程范式。CyMLT还能够为DSSAT、BioMA、Record、SIMPLACE和OpenAlea等建模平台生成模型组件。我们用一个简单的单元模型展示了我们的重用方法，以及用其他语言和平台扩展CyMLT的能力。我们在这里提出的方法将有助于提高基于过程的模型的再现性、交换和重用。

{"title":"Reuse of process-based models: automatic transformation into many programming languages and simulation platforms","authors":"Cyrille Ahmed Midingoyi, C. Pradal, I. Athanasiadis, M. Donatelli, Andreas Enders, D. Fumagalli, Frédérick Garçia, D. Holzworth, G. Hoogenboom, C. Porter, H. Raynal, P. Thorburn, P. Martre","doi":"10.1093/insilicoplants/diaa007","DOIUrl":"https://doi.org/10.1093/insilicoplants/diaa007","url":null,"abstract":"\u0000 The diversity of plant and crop process-based modelling platforms in terms of implementation language, software design and architectural constraints limits the reusability of the model components outside the platform in which they were originally developed, making model reuse a persistent issue. To facilitate the intercomparison and improvement of process-based models and the exchange of model components, several groups in the field joined to create the Agricultural Model Exchange Initiative (AMEI). Agricultural Model Exchange Initiative proposes a centralized framework for exchanging and reusing model components. It provides a modular and declarative approach to describe the specification of unit models and their composition. A model algorithm is associated with each model specification, which implements its mathematical behaviour. This paper focuses on the expression of the model algorithm independently of the platform specificities, and how the model algorithm can be seamlessly integrated into different platforms. We define CyML, a Cython-derived language with minimum specifications to implement model component algorithms. We also propose CyMLT, an extensible source-to-source transformation system that transforms CyML source code into different target languages such as Fortran, C#, C++, Java and Python, and into different programming paradigms. CyMLT is also able to generate model components to target modelling platforms such as DSSAT, BioMA, Record, SIMPLACE and OpenAlea. We demonstrate our reuse approach with a simple unit model and the capacity to extend CyMLT with other languages and platforms. The approach we present here will help to improve the reproducibility, exchange and reuse of process-based models.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2020-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/insilicoplants/diaa007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44940072","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Gillespie-Lindenmayer systems for stochastic simulation of morphogenesis 形态发生随机模拟的Gillespie-Lindenmayer系统

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2019-01-01 DOI: 10.1093/insilicoplants/diz009

M. Cieslak, P. Prusinkiewicz

Lindenmayer systems (L-systems) provide a useful framework for modelling the development of multicellular structures and organisms. The parametric extension of L-systems allows for incorporating molecular-level processes into the models. Until now, the dynamics of these processes has been expressed using differential equations, implying continuously valued concentrations of the substances involved. This assumption is not satisfied, however, when the numbers of molecules are small. A further extension that accounts for the stochastic effects arising in this case is thus needed. We integrate L-systems and the Gillespie’s Stochastic Simulation Algorithm to simulate stochastic processes in fixed and developing linear structures. We illustrate the resulting formalism with stochastic implementations of diffusion-decay, reaction-diffusion and auxin-transport-driven morphogenetic processes. Our method and software can be used to simulate molecular and higher-level spatially explicit stochastic processes in static and developing structures, and study their behaviour in the presence of stochastic perturbations.

林登迈尔系统(l -系统)为多细胞结构和生物体的发育建模提供了一个有用的框架。l系统的参数扩展允许将分子水平过程纳入模型。到目前为止，这些过程的动力学都是用微分方程来表示的，这意味着所涉及的物质的浓度是连续的。然而，当分子数量很小时，这个假设就不成立了。因此，需要进一步的扩展，以解释在这种情况下产生的随机效应。我们整合了l系统和Gillespie随机模拟算法来模拟固定和发展中的线性结构中的随机过程。我们用扩散-衰变、反应-扩散和生长素运输驱动的形态发生过程的随机实现来说明由此产生的形式主义。我们的方法和软件可用于模拟静态和发展结构中的分子和更高水平的空间显式随机过程，并研究它们在随机扰动存在下的行为。

引用次数: 4

Making our plant modelling community more than the sum of its parts: a personal perspective 使我们的植物建模社区不仅仅是各部分的总和:个人视角

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2019-01-01 DOI: 10.1093/INSILICOPLANTS/DIY002

S. Long

The rise of mathematical modelling represents a transition in any scientific area towards quantitative and unequivocal presentation of hypotheses and theory. In Physics, phenomena are now predicted from mathematical models and computer simulations of those models well before observations are made to confirm these predictions. Indeed the largest endeavours in Physics, such as particle physics accelerators and astrophysics observational platforms, are created to test the most profound predictions of such models. Given the huge complexity of living organisms coupled with massive species and even phenotype within genotype diversity, we are far from the same level of advancement, yet need to approach it. Complexity of our systems means that many biological modelling efforts will remain, largely, based on emergent properties and phenomena. Nevertheless, complete models of the full complexity of single-celled organisms are beginning to transition to eukaryotes (Beard et al. 2012; Service 2016). Within narrow areas of gene function, we are already seeing successful projections from gene expression to prediction of growth and development of whole plants (Chew et al. 2014). Modelling provides a framework in which we can precisely organize and test our quantitative knowledge and hypotheses about how a plant process or combination of processes works and then test these against reality. As such, it provides a data-hypothesis-test-learn cycle to improve our understanding of plants and their use. Equally, the rapid growth of high-throughput ‘omics facilities is delivering ever-increasing amounts of data for which our capacity and ability to analyse and interpret lags. Mathematical models coupled with high-performance computing provide a means to deliver this needed acceleration. Simultaneously it should provide the means to predict which data is needed most, so providing feedback and focus for ‘omics approaches. This wealth of data also provides unprecedented opportunities for improving the precision of models by high-speed data to model linkage. Similarly, numerical and text mining knowledge discovery offer much to improving mathematical modelling of plant processes, with opportunities for automated improvement of representation and parameterization (Fer et al. 2018). In parallel, computer simulation of mathematical models has evolved from printouts of numbers to 3D representations of the growth and development of organs, whole plants and even communities of plants that can be indistinguishable from the real thing (Fig. 1). This facilitates identification of emergent phenomena while providing unprecedented opportunities in revolutionizing plant science education (Prusinkiewicz et al. 2007; Prusinkiewicz and Runions 2012; Runions et al. 2017).

数学建模的兴起代表着任何科学领域向定量和明确地提出假设和理论的转变。在物理学中，在进行观测以证实这些预测之前，就已经通过数学模型和这些模型的计算机模拟来预测现象了。事实上，物理学中最大的努力，如粒子物理加速器和天体物理学观测平台，都是为了测试这些模型最深刻的预测。考虑到活生物体的巨大复杂性，加上基因型多样性中的大量物种甚至表型，我们还远远没有达到同样的进步水平，但需要接近它。我们系统的复杂性意味着，许多生物建模工作将在很大程度上仍然基于新出现的特性和现象。然而，单细胞生物完全复杂性的完整模型正开始向真核生物过渡（Beard等人，2012；Service 2016）。在基因功能的狭窄领域内，我们已经看到了从基因表达到整个植物生长发育预测的成功预测（Chew等人，2014）。建模提供了一个框架，在这个框架中，我们可以精确地组织和测试我们关于植物过程或过程组合如何工作的定量知识和假设，然后对照现实进行测试。因此，它提供了一个数据假设检验学习周期，以提高我们对植物及其使用的理解。同样，高通量组学设施的快速增长正在提供越来越多的数据，而我们分析和解释这些数据的能力和能力滞后。数学模型与高性能计算相结合，提供了一种提供所需加速度的方法。同时，它应该提供预测最需要哪些数据的方法，从而为“组学方法”提供反馈和关注。这种丰富的数据也为通过高速数据到模型的链接来提高模型的精度提供了前所未有的机会。同样，数值和文本挖掘知识发现为改进工厂过程的数学建模提供了很大的帮助，并为自动化改进表示和参数化提供了机会（Fer等人，2018）。与此同时，数学模型的计算机模拟已经从数字的打印输出发展到器官、整个植物甚至植物群落的生长和发育的3D表示，这些都与真实事物无法区分（图1）。这有助于识别新兴现象，同时为植物科学教育的革命性发展提供了前所未有的机会（Prusinkiewicz等人，2007；Prusinkievicz和Runions 2012；Runions等人2017）。

{"title":"Making our plant modelling community more than the sum of its parts: a personal perspective","authors":"S. Long","doi":"10.1093/INSILICOPLANTS/DIY002","DOIUrl":"https://doi.org/10.1093/INSILICOPLANTS/DIY002","url":null,"abstract":"The rise of mathematical modelling represents a transition in any scientific area towards quantitative and unequivocal presentation of hypotheses and theory. In Physics, phenomena are now predicted from mathematical models and computer simulations of those models well before observations are made to confirm these predictions. Indeed the largest endeavours in Physics, such as particle physics accelerators and astrophysics observational platforms, are created to test the most profound predictions of such models. Given the huge complexity of living organisms coupled with massive species and even phenotype within genotype diversity, we are far from the same level of advancement, yet need to approach it. Complexity of our systems means that many biological modelling efforts will remain, largely, based on emergent properties and phenomena. Nevertheless, complete models of the full complexity of single-celled organisms are beginning to transition to eukaryotes (Beard et al. 2012; Service 2016). Within narrow areas of gene function, we are already seeing successful projections from gene expression to prediction of growth and development of whole plants (Chew et al. 2014). Modelling provides a framework in which we can precisely organize and test our quantitative knowledge and hypotheses about how a plant process or combination of processes works and then test these against reality. As such, it provides a data-hypothesis-test-learn cycle to improve our understanding of plants and their use. Equally, the rapid growth of high-throughput ‘omics facilities is delivering ever-increasing amounts of data for which our capacity and ability to analyse and interpret lags. Mathematical models coupled with high-performance computing provide a means to deliver this needed acceleration. Simultaneously it should provide the means to predict which data is needed most, so providing feedback and focus for ‘omics approaches. This wealth of data also provides unprecedented opportunities for improving the precision of models by high-speed data to model linkage. Similarly, numerical and text mining knowledge discovery offer much to improving mathematical modelling of plant processes, with opportunities for automated improvement of representation and parameterization (Fer et al. 2018). In parallel, computer simulation of mathematical models has evolved from printouts of numbers to 3D representations of the growth and development of organs, whole plants and even communities of plants that can be indistinguishable from the real thing (Fig. 1). This facilitates identification of emergent phenomena while providing unprecedented opportunities in revolutionizing plant science education (Prusinkiewicz et al. 2007; Prusinkiewicz and Runions 2012; Runions et al. 2017).","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/INSILICOPLANTS/DIY002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44411236","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

On the dynamic determinants of reproductive failure under drought in maize 干旱条件下玉米生殖失败的动态决定因素研究

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2019-01-01 DOI: 10.1093/INSILICOPLANTS/DIZ003

C. Messina, G. Hammer, G. McLean, M. Cooper, E. V. van Oosterom, F. Tardieu, S. Chapman, A. Doherty, C. Gho

Reproductive failure under drought in maize (Zea mays) is a major cause of instability in global food systems. While there has been extensive research on maize reproductive physiology, it has not been formalized in mathematical form to enable the study and prediction of emergent phenotypes, physiological epistasis and pleiotropy. We developed a quantitative synthesis organized as a dynamical model for cohorting of reproductive structures along the ear while accounting for carbon and water supply and demand balances. The model can simulate the dynamics of silk initiation, elongation, fertilization and kernel growth, and can generate well-known emergent phenotypes such as the relationship between plant growth, anthesis-silking interval, kernel number and yield, as well as ear phenotypes under drought (e.g. tip kernel abortion). Simulation of field experiments with controlled drought conditions showed that predictions tracked well the observed response of yield and yield components to timing of water deficit. This framework represents a significant improvement from previous approaches to simulate reproductive physiology in maize. We envisage opportunities for this predictive capacity to advance our understanding of maize reproductive biology by informing experimentation, supporting breeding and increasing productivity in maize.

玉米（Zea mays）在干旱条件下的繁殖失败是全球粮食系统不稳定的主要原因。虽然对玉米生殖生理学进行了广泛的研究，但还没有以数学形式形式形式化，从而能够研究和预测紧急表型、生理上位性和多效性。我们开发了一个定量综合，作为一个动态模型，用于沿耳朵的生殖结构的共聚，同时考虑碳和水的供需平衡。该模型可以模拟丝的起始、伸长、受精和籽粒生长的动力学，并可以产生众所周知的紧急表型，如植物生长、花吐丝间隔、籽粒数量和产量之间的关系，以及干旱条件下的穗表型（如尖粒败育）。在受控干旱条件下进行的田间试验模拟表明，预测很好地跟踪了观察到的产量和产量组成部分对缺水时间的反应。该框架代表了对以前模拟玉米生殖生理学方法的显著改进。我们设想这种预测能力有机会通过为实验提供信息、支持玉米育种和提高玉米生产力来提高我们对玉米生殖生物学的理解。

{"title":"On the dynamic determinants of reproductive failure under drought in maize","authors":"C. Messina, G. Hammer, G. McLean, M. Cooper, E. V. van Oosterom, F. Tardieu, S. Chapman, A. Doherty, C. Gho","doi":"10.1093/INSILICOPLANTS/DIZ003","DOIUrl":"https://doi.org/10.1093/INSILICOPLANTS/DIZ003","url":null,"abstract":"Reproductive failure under drought in maize (Zea mays) is a major cause of instability in global food systems. While there has been extensive research on maize reproductive physiology, it has not been formalized in mathematical form to enable the study and prediction of emergent phenotypes, physiological epistasis and pleiotropy. We developed a quantitative synthesis organized as a dynamical model for cohorting of reproductive structures along the ear while accounting for carbon and water supply and demand balances. The model can simulate the dynamics of silk initiation, elongation, fertilization and kernel growth, and can generate well-known emergent phenotypes such as the relationship between plant growth, anthesis-silking interval, kernel number and yield, as well as ear phenotypes under drought (e.g. tip kernel abortion). Simulation of field experiments with controlled drought conditions showed that predictions tracked well the observed response of yield and yield components to timing of water deficit. This framework represents a significant improvement from previous approaches to simulate reproductive physiology in maize. We envisage opportunities for this predictive capacity to advance our understanding of maize reproductive biology by informing experimentation, supporting breeding and increasing productivity in maize.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":" ","pages":""},"PeriodicalIF":3.1,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/INSILICOPLANTS/DIZ003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43411079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 49

Systems models, phenomics and genomics: three pillars for developing high-yielding photosynthetically efficient crops. 系统模型、表型组学和基因组学:开发光合高效高产作物的三大支柱。

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2019-01-01 Epub Date: 2019-04-25 DOI: 10.1093/insilicoplants/diy003

Tian-Gen Chang, Shuoqi Chang, Qing-Feng Song, Shahnaz Perveen, Xin-Guang Zhu

Recent years witnessed a stagnation in yield enhancement in major staple crops, which leads plant biologists and breeders to focus on an urgent challenge to dramatically increase crop yield to meet the growing food demand. Systems models have started to show their capacity in guiding crops improvement for greater biomass and grain yield production. Here we argue that systems models, phenomics and genomics combined are three pillars for the future breeding for high-yielding photosynthetically efficient crops (HYPEC). Briefly, systems models can be used to guide identification of breeding targets for a particular cultivar and define optimal physiological and architectural parameters for a particular crop to achieve high yield under defined environments. Phenomics can support collection of architectural, physiological, biochemical and molecular parameters in a high-throughput manner, which can be used to support both model validation and model parameterization. Genomic techniques can be used to accelerate crop breeding by enabling more efficient mapping between genotypic and phenotypic variation, and guide genome engineering or editing for model-designed traits. In this paper, we elaborate on these roles and how they can work synergistically to support future HYPEC breeding.

近年来，主要主要作物的产量增长停滞不前，这导致植物生物学家和育种家将重点放在大幅提高作物产量以满足日益增长的粮食需求这一紧迫挑战上。系统模型已经开始显示出它们在指导作物改良以提高生物量和粮食产量方面的能力。在此，我们认为系统模型、表型组学和基因组学相结合是未来高产光合高效作物育种的三大支柱。简而言之，系统模型可用于指导特定品种的育种目标识别，并定义特定作物在特定环境下实现高产的最佳生理和结构参数。表型组学可以支持高通量的建筑、生理、生化和分子参数的收集，可用于支持模型验证和模型参数化。基因组技术可以通过更有效地绘制基因型和表型变异之间的图谱来加速作物育种，并指导模型设计性状的基因组工程或编辑。在本文中，我们详细阐述了这些作用以及它们如何协同工作以支持未来的HYPEC育种。

{"title":"Systems models, phenomics and genomics: three pillars for developing high-yielding photosynthetically efficient crops.","authors":"Tian-Gen Chang, Shuoqi Chang, Qing-Feng Song, Shahnaz Perveen, Xin-Guang Zhu","doi":"10.1093/insilicoplants/diy003","DOIUrl":"https://doi.org/10.1093/insilicoplants/diy003","url":null,"abstract":"Recent years witnessed a stagnation in yield enhancement in major staple crops, which leads plant biologists and breeders to focus on an urgent challenge to dramatically increase crop yield to meet the growing food demand. Systems models have started to show their capacity in guiding crops improvement for greater biomass and grain yield production. Here we argue that systems models, phenomics and genomics combined are three pillars for the future breeding for high-yielding photosynthetically efficient crops (HYPEC). Briefly, systems models can be used to guide identification of breeding targets for a particular cultivar and define optimal physiological and architectural parameters for a particular crop to achieve high yield under defined environments. Phenomics can support collection of architectural, physiological, biochemical and molecular parameters in a high-throughput manner, which can be used to support both model validation and model parameterization. Genomic techniques can be used to accelerate crop breeding by enabling more efficient mapping between genotypic and phenotypic variation, and guide genome engineering or editing for model-designed traits. In this paper, we elaborate on these roles and how they can work synergistically to support future HYPEC breeding.","PeriodicalId":36138,"journal":{"name":"in silico Plants","volume":"1 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2019-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1093/insilicoplants/diy003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39115099","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 18

yggdrasil: a Python package for integrating computational models across languages and scales yggdrasil:一个Python包，用于跨语言和规模集成计算模型

IF 3.1 Q1 AGRONOMY

in silico Plants

Pub Date : 2019-01-01 DOI: 10.1093/INSILICOPLANTS/DIZ001

Meagan Lang

Thousands of computational models have been created within both the plant biology community and broader scientific communities in the past two decades that have the potential to be combined into complex integration networks capable of capturing more complex biological processes than possible with isolated models. However, the technological barriers introduced by differences in language and data formats have slowed this progress. We present yggdrasil (previously cis_interface), a Python package for running integration networks with connections between models across languages and scales. yggdrasil coordinates parallel execution of models in Python, C, C++, and Matlab on Linux, Mac OS, and Windows operating systems, and handles communication in a number of data formats common to computational plant modelling. yggdrasil is designed to be user-friendly and can be accessed at https://github.com/cropsinsilico/yggdrasil. Although originally developed for plant models, yggdrasil can be used to connect computational models from any domain.

在过去的二十年里，在植物生物学社区和更广泛的科学社区中已经创建了成千上万的计算模型，这些模型有可能被组合成复杂的集成网络，能够捕获比孤立模型更复杂的生物过程。然而，语言和数据格式差异带来的技术障碍减缓了这一进展。我们介绍了yggdrasil(以前称为cis_interface)，这是一个Python包，用于运行集成网络，并在不同语言和规模的模型之间建立连接。yggdrasil在Linux、Mac OS和Windows操作系统上协调Python、C、c++和Matlab中模型的并行执行，并处理计算工厂建模常用的许多数据格式的通信。Yggdrasil的设计是用户友好的，可以访问https://github.com/cropsinsilico/yggdrasil。虽然最初是为植物模型开发的，但yggdrasil可以用于连接来自任何领域的计算模型。

引用次数: 16