Background: Wheat (Triticum aestivum L.) is an important grain crops in the world, and its growth and development in different stages is seriously affected by saline-alkali stress, especially in seedling stage. Therefore, nondestructive detection of wheat seedlings under saline-alkali stress can provide more comprehensive technical support for wheat breeding, cultivation and management.
Results: This research focused on moisture signal prediction and classification of saline-alkali stress in wheat seedlings using fusion techniques. After collecting and analyzing transverse relaxation time and Multispectral imaging (MSI) information of wheat seedlings, four regression models were used to predict the moisture signal. K-Nearest Neighbor (KNN) and Gaussian-Naïve Bayes (GNB) models were combined with fivefold cross validation to classify the prediction of wheat seedling stress. The results showed that wheat seedlings would increase the bound water content through a certain mechanism to enhance their saline-alkali stress. Under the same Na concentration, the effect of alkali stress on moisture, growth and spectrum of wheat seedlings is stronger than salt stress. The Gradient Boosting Decision Regression Tree model performs the best in predicting wheat moisture signals, with a coefficient of determination (R2P) of 0.98 and a root mean square error of 109.60. It also had a short training time (1.48 s) and an efficient prediction speed (1300 obs/s). The KNN and GNB demonstrated significantly enhanced predictive performance when classifying the fused dataset, compared to using single datasets individually. In particular, the GNB model performing best on the fused dataset, with Precision, Recall, Accuracy, and F1-score of 90.30, 88.89%, 88.90%, and 0.90, respectively.
Conclusions: Under the same Na concentration, the effects of alkali stress on water content, spectrum, and growth of wheat were stronger than that of salt stress, which was more unfavorable to the growth of wheat. The fusion of low-field nuclear magnetic resonance and MSI technology can improve the classification of wheat stress, and provide an effective technical method for rapid and accurate monitoring of wheat seedlings under saline-alkali stress.
The major drawback to the implementation of genomic selection in a breeding program lies in long-term decrease in additive genetic variance, which is a trade-off for rapid genetic improvement in short term. Balancing increase in genetic gain with retention of additive genetic variance necessitates careful optimization of this trade-off. In this study, we proposed an integrated index selection approach within the genomic inferred cross-selection (GCS) framework to maximize genetic gain across multiple traits. With this method, we identified optimal crosses that simultaneously maximize progeny performance and maintain genetic variance for multiple traits. Using a stochastic simulated recurrent breeding program over a 40-years period, we evaluated different GCS methods along with other factors, such as the number of parents, crosses, and progeny per cross, that influence genetic gain in a pulse crop breeding program. Across all breeding scenarios, the posterior mean variance consistently enhances genetic gain when compared to other methods, such as the usefulness criterion, optimal haploid value, mean genomic estimated breeding value, and mean index selection value of the superior parents. In addition, we provide a detailed strategy to optimize the number of parents, crosses, and progeny per cross that can potentially maximize short- and long-term genetic gain in a public breeding program.
Background: The proportion of nitrogen (N) derived from the atmosphere (Ndfa) is a fundamental component of the plant N demand in legume species. To estimate the N benefit of grain legumes for the subsequent crop in the rotation, a simplified N balance is frequently used. This balance is calculated as the difference between fixed N and removed N by grains. The Ndfa needed to achieve a neutral N balance (hereafter ) is usually estimated through a simple linear regression model between Ndfa and N balance. This quantity is routinely estimated without accounting for the uncertainty in the estimate, which is needed to perform formal statistical inference about . In this article, we utilized a global database to describe the development of a novel Bayesian framework to quantify the uncertainty of . This study aimed to (i) develop a Bayesian framework to quantify the uncertainty of , and (ii) contrast the use of this Bayesian framework with the widely used delta and bootstrapping methods under different data availability scenarios.
Results: The delta method, bootstrapping, and Bayesian inference provided nearly equivalent numerical values when the range of values for Ndfa was thoroughly explored during data collection (e.g., 6-91%), and the number of observations was relatively high (e.g., ). When the Ndfa tested was narrow and/or sample size was small, the delta method and bootstrapping provided confidence intervals containing biologically non-meaningful values (i.e. < 0% or > 100%). However, under a narrow Ndfa range and small sample size, the developed Bayesian inference framework obtained biologically meaningful values in the uncertainty estimation.
Conclusion: In this study, we showed that the developed Bayesian framework was preferable under limited data conditions ─by using informative priors─ and when uncertainty estimation had to be constrained (regularized) to obtain meaningful inference. The presented Bayesian framework lays the foundation not only to conduct formal comparisons or hypothesis testing involving , but also to learn about its expected value, variance, and higher moments such as skewness and kurtosis under different agroecological and crop management conditions. This framework can also be transferred to estimate balances for other nutrients and/or field crops to gain knowledge on global crop nutrient balances.
Background: Dissection of complex plant cell wall structures demands a sensitive and quantitative method. FTIR is used regularly as a screening method to identify specific linkages in cell walls. However, quantification and assigning spectral bands to particular cell wall components is still a major challenge, specifically in crop species. In this study, we addressed these challenges using ATR-FTIR spectroscopy as it is a high throughput, cost-effective and non-destructive approach to understand the plant cell wall composition. This method was validated by analysing different varieties of mungbean which is one of the most important legume crops grown widely in Asia.
Results: Using standards and extraction of a specific component of cell wall components, we assigned 1050-1060 cm-1 and 1390-1420 cm-1 wavenumbers that can be widely used to quantify cellulose and lignin, respectively, in Arabidopsis, Populus, rice and mungbean. Also, using KBr as a diluent, we established a method that can relatively quantify the cellulose and lignin composition among different tissue types of the above species. We further used this method to quantify cellulose and lignin in field-grown mungbean genotypes. The ATR-FTIR-based study revealed the cellulose content variation ranges from 27.9% to 52.3%, and the lignin content variation ranges from 13.7% to 31.6% in mungbean genotypes.
Conclusion: Multivariate analysis of FT-IR data revealed differences in total cell wall (600-2000 cm-1), cellulose (1000-1100 cm-1) and lignin (1390-1420 cm-1) among leaf and stem of four plant species. Overall, our data suggested that ATR-FTIR can be used for the relative quantification of lignin and cellulose in different plant species. This method was successfully applied for rapid screening of cell wall composition in mungbean stem, and similarly, it can be used for screening other crops or tree species.
Background: The use of 3D imaging techniques, such as X-ray CT, in root phenotyping has become more widespread in recent years. However, due to the complexity of the root structure, analyzing the resulting 3D volumes to obtain detailed architectural root traits remains a challenging computational problem. When it comes to image-based phenotyping of excavated maize root crowns, two types of root features that are notably missing from existing methods are the whorls and soil line. Whorls refer to the distinct areas located at the base of each stem node from which roots sprout in a circular pattern (Liu S, Barrow CS, Hanlon M, Lynch JP, Bucksch A. Dirt/3D: 3D root phenotyping for field-grown maize (zea mays). Plant Physiol. 2021;187(2):739-57. https://doi.org/10.1093/plphys/kiab311 .). The soil line is where the root stem meets the ground. Knowledge of these features would give biologists deeper insights into the root system architecture (RSA) and the below- and above-ground root properties.
Results: We developed TopoRoot+, a computational pipeline that produces architectural traits from 3D X-ray CT volumes of excavated maize root crowns. Building upon the TopoRoot software (Zeng D, Li M, Jiang N, Ju Y, Schreiber H, Chambers E, et al. Toporoot: A method for computing hierarchy and fine-grained traits of maize roots from 3D imaging. Plant Methods. 2021;17(1). https://doi.org/10.1186/s13007-021-00829-z .) for computing fine-grained root traits, TopoRoot + adds the capability to detect whorls, identify nodal roots at each whorl, and compute the soil line location. The new algorithms in TopoRoot + offer an additional set of fine-grained traits beyond those provided by TopoRoot. The addition includes internode distances, root traits at every hierarchy level associated with a whorl, and root traits specific to above or below the ground. TopoRoot + is validated on a diverse collection of field-grown maize root crowns consisting of nine genotypes and spanning across three years. TopoRoot + runs in minutes for a typical volume size of [Formula: see text] on a desktop workstation. Our software and test dataset are freely distributed on Github.
Conclusions: TopoRoot + advances the state-of-the-art in image-based phenotyping of excavated maize root crowns by offering more detailed architectural traits related to whorls and soil lines. The efficiency of TopoRoot + makes it well-suited for high-throughput image-based root phenotyping.
Fungal diseases are the main factors affecting the quality and production of vegetables. Rapid and accurate detection of pathogenic spores is of great practical significance for early prediction and prevention of diseases. However, there are some problems with microscopic images collected in the natural environment, such as complex backgrounds, more disturbing materials, small size of spores, and various forms. Therefore, this study proposed an improved detection method of GCS-YOLOv8 (Global context and CARFAE and Small detector-optimized YOLOv8), effectively improving the detection accuracy of small-target pathogen spores in natural scenes. Firstly, by adding a small target detection layer in the network, the network's sensitivity to small targets is enhanced, and the problem of low detection accuracy of the small target is effectively improved. Secondly, Global Context attention is introduced in Backbone to optimize the CSPDarknet53 to 2-Stage FPN (C2F) module and model global context information. At the same time, the feature up-sampling module Content-Aware Reassembly of Features (CARAFE) was introduced into Neck to enhance the ability of the network to extract spore features in natural scenes further. Finally, we used an Explainable Artificial Intelligence (XAI) approach to interpret the model's predictions. The experimental results showed that the improved GCS-YOLOv8 model could detect the spores of the three fungi with an accuracy of 0.926 and a model size of 22.8 MB, which was significantly superior to the existing model and showed good robustness under different brightness conditions. The test on the microscopic images of the infection structure of cucumber down mildew also proved that the model had good generalization. Therefore, this study realized the accurate detection of pathogen spores in natural scenes and provided feasible technical support for early predicting and preventing fungal diseases.
Background: This study explores the use of Unmanned Aerial Vehicles (UAVs) for estimating wheat biomass, focusing on the impact of phenotyping and analytical protocols in the context of late-stage variety selection programs. It emphasizes the importance of variable selection, model specificity, and sampling location within the experimental plot in predicting biomass, aiming to refine UAV-based estimation techniques for enhanced selection accuracy and throughput in variety testing programs.
Results: The research uncovered that integrating geometric and spectral traits led to an increase in prediction accuracy, whilst a recursive feature elimination (RFE) based variable selection workflowled to slight reductions in accuracy with the benefit of increased interpretability. Models, tailored to specific experiments were more accurate than those modelling all experiments together, while models trained for broad-growth stages did not significantly increase accuracy. The comparison between a permanent and a precise region of interest (ROI) within the plot showed negligible differences in biomass prediction accuracy, indicating the robustness of the approach across different sampling locations within the plot. Significant differences in the within-season repeatability (w2) of biomass predictions across different experiments highlighted the need for further investigation into the optimal timing of measurement for prediction.
Conclusions: The study highlights the promising potential of UAV technology in biomass prediction for wheat at a small plot scale. It suggests that the accuracy of biomass predictions can be significantly improved through optimizing analytical and modelling protocols (i.e., variable selection, algorithm selection, stage-specific model development). Future work should focus on exploring the applicability of these findings under a wider variety of conditions and from a more diverse set of genotypes.
Soybean seeds are susceptible to damage from the Riptortus pedestris, which is a significant factor affecting the quality of soybean seeds. Currently, manual screening methods for soybean seeds are limited to visual inspection, making it difficult to identify seeds that are phenotypically defect-free but have been punctured by stink bugs on the sub-surface. To facilitate the convenient and efficient identification of healthy soybean seeds, this paper proposes a soybean seed pest detection method based on spatial frequency domain imaging combined with RL-SVM. Firstly, soybean optical data is obtained using single integration sphere technique, and the vigor index of soybean seeds is obtained through germination experiments. Then, based on the above two data items using feature extraction algorithms (the successive projections algorithm and the competitive adaptive reweighted sampling algorithm), the characteristic wavelengths of soybeans are identified. Subsequently, the spatial frequency domain imaging technique is used to obtain the sub-surface images of soybean seeds in a forward manner, and the optical coefficients such as the reduced scattering coefficient and absorption coefficient of soybean seeds are inverted. Finally, RL-MLR, RL-GRNN, and RL-SVM prediction models are established based on the ratio of the area of insect-damaged sub-surface to the entire seed, soybean varieties, and at three wavelengths (502 nm, 813 nm, and 712 nm) for predicting and identifying soybean the stinging and sucking pest damage levels of soybean seeds. The experimental results show that the spatial frequency domain imaging technique yields small errors in the optical coefficients of soybean seeds, with errors of less than 15% for and less than 10% for . After parameter adjustment through reinforcement learning, the Macro-Recall metrics of each model have improved by 10%-15%, and the RL-SVM model achieves a high Macro-Recall value of 0.9635 for classifying the pest damage levels of soybean seeds.
Background: As genomes of many eukaryotic species, especially plants, are large and complex, their de novo sequencing and assembly is still a difficult task despite progress in sequencing technologies. An alternative to genome assembly is the assembly of transcriptome, the set of RNA products of the expressed genes. While a bunch of de novo transcriptome assemblers exists, the challenges of transcriptomes (the existence of isoforms, the uneven expression levels across genes) complicates the generation of high-quality assemblies suitable for downstream analyses.
Results: We developed Trans2express - a web-based tool and a pipeline of de novo hybrid transcriptome assembly and postprocessing based on rnaSPAdes with a set of subsequent filtrations. The pipeline was tested on Arabidopsis thaliana cDNA sequencing data obtained using Illumina and Oxford Nanopore Technologies platforms and three non-model plant species. The comparison of structural characteristics of the transcriptome assembly with reference Arabidopsis genome revealed the high quality of assembled transcriptome with 86.1% of Arabidopsis expressed genes assembled as a single contig. We tested the applicability of the transcriptome assembly for gene expression analysis. For both Arabidopsis and non-model species the results showed high congruence of gene expression levels and sets of differentially expressed genes between analyses based on genome and based on the transcriptome assembly.
Conclusions: We present Trans2express - a protocol for de novo hybrid transcriptome assembly aimed at recovering of a single transcript per gene. We expect this protocol to promote the characterization of transcriptomes and gene expression analysis in non-model plants and web-based tool to be of use to a wide range of plant biologists.