首页 > 最新文献

Plant Methods最新文献

英文 中文
Evaluation of one-image 3D reconstruction for plant model generation. 植物模型生成中一幅图像三维重建的评价。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-13 DOI: 10.1186/s13007-025-01482-6
Zihe Gao, Zane K J Hartley, Andrew P French

Generating accurate and visually realistic 3D models of plants from single-view images is crucial yet remains challenging due to plants' intricate geometry and frequent occlusions. This capability matters because it supplements current plant datasets and enables non-destructive, high-throughput phenotyping for crop breeding and precision agriculture. More broadly, 3D reconstruction is particularly important because plant morphology is inherently three-dimensional, while 2D representations miss occluded leaves, branching geometry, and volumetric traits. However, plants present unique challenges compared to common rigid objects, and most current generative methods have not been systematically tested in this domain, leaving a gap in understanding their reliability for realistic plant reconstruction. This study systematically evaluates six advanced generative techniques-Hunyuan3D 2.0, Trellis (Structured 3D Latents), One2345++, InstantMesh, Direct3D and Unique3D-using the existing PlantDreamer dataset. Specifically, this research reconstructs mesh models from images of Bean plants and quantitatively assesses each method's performance against ground-truth models using Chamfer Distance, Normal Consistency, F-Score, PSNR, LPIPS, and CLIP Score. The paper also presents qualitative results of Kale and Mint plants. The results indicate that Hunyuan3D 2.0 achieves superior performance overall, suggesting its effectiveness in capturing complex plant structures. This work provides valuable insights into strengths and limitations of contemporary 3D generative approaches, guiding future improvements in realistic plant digitisation.

从单视图图像中生成准确和视觉逼真的植物3D模型至关重要,但由于植物复杂的几何形状和频繁的遮挡,仍然具有挑战性。这种能力很重要,因为它补充了现有的植物数据集,并为作物育种和精准农业提供了非破坏性、高通量的表型分析。更广泛地说,3D重建尤其重要,因为植物形态本质上是三维的,而2D表示缺少闭塞的叶子,分支几何形状和体积特征。然而,与常见的刚性物体相比,植物呈现出独特的挑战,并且大多数当前的生成方法尚未在该领域进行系统测试,因此在了解其在现实植物重建中的可靠性方面存在差距。本研究使用现有的plantdream数据集系统地评估了六种先进的生成技术——hunyuan3d 2.0、Trellis (Structured 3D Latents)、one2345++、InstantMesh、Direct3D和unique3d。具体而言,本研究从豆类植物图像中重建网格模型,并使用倒角距离、正常一致性、F-Score、PSNR、LPIPS和CLIP Score定量评估每种方法相对于ground-truth模型的性能。本文还介绍了羽衣甘蓝和薄荷植物的定性结果。结果表明,浑源3d 2.0在捕获复杂植物结构方面具有较好的效果。这项工作为当代3D生成方法的优势和局限性提供了有价值的见解,指导未来现实植物数字化的改进。
{"title":"Evaluation of one-image 3D reconstruction for plant model generation.","authors":"Zihe Gao, Zane K J Hartley, Andrew P French","doi":"10.1186/s13007-025-01482-6","DOIUrl":"10.1186/s13007-025-01482-6","url":null,"abstract":"<p><p>Generating accurate and visually realistic 3D models of plants from single-view images is crucial yet remains challenging due to plants' intricate geometry and frequent occlusions. This capability matters because it supplements current plant datasets and enables non-destructive, high-throughput phenotyping for crop breeding and precision agriculture. More broadly, 3D reconstruction is particularly important because plant morphology is inherently three-dimensional, while 2D representations miss occluded leaves, branching geometry, and volumetric traits. However, plants present unique challenges compared to common rigid objects, and most current generative methods have not been systematically tested in this domain, leaving a gap in understanding their reliability for realistic plant reconstruction. This study systematically evaluates six advanced generative techniques-Hunyuan3D 2.0, Trellis (Structured 3D Latents), One2345++, InstantMesh, Direct3D and Unique3D-using the existing PlantDreamer dataset. Specifically, this research reconstructs mesh models from images of Bean plants and quantitatively assesses each method's performance against ground-truth models using Chamfer Distance, Normal Consistency, F-Score, PSNR, LPIPS, and CLIP Score. The paper also presents qualitative results of Kale and Mint plants. The results indicate that Hunyuan3D 2.0 achieves superior performance overall, suggesting its effectiveness in capturing complex plant structures. This work provides valuable insights into strengths and limitations of contemporary 3D generative approaches, guiding future improvements in realistic plant digitisation.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"18"},"PeriodicalIF":4.4,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145966691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MicroDeblurNet: high-fidelity restoration of spatially variant defocus in microscopic images for cucumber downy mildew. MicroDeblurNet:高保真黄瓜霜霉病显微图像空间变异离焦恢复。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-13 DOI: 10.1186/s13007-025-01495-1
Yiding Zhang, Bo Wang, Yuzhaobi Song, Zonghuan Han, Xiaoshuan Zhang, Lingxian Zhang

Microscopic imaging provides essential visual evidence for pathogen monitoring, but its shallow depth of field and the three-dimensional height variation of spores lead to pronounced defocus blur and structural degradation. Restoring such images is therefore crucial for reliable spore identification and downstream analysis. However, microscopic defocus is a spatially varying process that severely suppresses high-frequency structures, causing natural-image deblurring models to generalize poorly. In addition, optical constraints of microscopy make realistic sharp-blur pairs difficult to obtain, further limiting learning-based restoration. To address these challenges, we propose MicroDeblurNet, a single-image deblurring network specifically designed for microscopic defocus restoration. The model incorporates a convolutional block attention module to enhance spatial selectivity toward key pathogen structures, and employs depthwise over-parameterized convolutions to capture locally varying blur patterns more effectively, enabling spatially consistent and structurally coherent restoration. Furthermore, a spatial-frequency consistency loss is proposed to strengthen high-frequency detail recovery while maintaining color fidelity and morphological integrity. To support high-fidelity supervision, we propose a paired-data construction strategy based on Laplacian-pyramid fusion and construct a clear-blur microscopic dataset for cucumber downy mildew. The restored outputs of MicroDeblurNet are further applied to sporangia detection and semantic segmentation to evaluate their impact on high-level visual tasks. Finally, we build an integrated microscopic analysis platform that delivers standardized high-quality data and automated pathogen-structure recognition and analysis to support disease assessment and management. Experimental results demonstrate that MicroDeblurNet achieves an optimal balance across pixel-level, structure-level, and perception-level metrics, reaching a PSNR of 42.48 dB and SSIM of 0.9839, outperforming advanced state-of-the-art methods. In downstream tasks, MicroDeblurNet delivers higher detection recall and segmentation accuracy in challenging scenarios involving sporangia adhesion and background impurities, demonstrating its ability to enhance target discernibility, preserve structural completeness, and improve robustness under complex microscopic conditions.

显微成像为病原体监测提供了必要的视觉证据,但其浅景深和孢子的三维高度变化导致明显的离焦模糊和结构退化。因此,恢复这些图像对于可靠的孢子鉴定和下游分析至关重要。然而,微观离焦是一个空间变化的过程,严重抑制高频结构,导致自然图像去模糊模型泛化不良。此外,显微镜的光学限制使得难以获得真实的锐利模糊对,进一步限制了基于学习的恢复。为了应对这些挑战,我们提出了MicroDeblurNet,这是一个专为微观散焦恢复而设计的单图像去模糊网络。该模型采用卷积块注意模块来增强对关键病原体结构的空间选择性,并采用深度过参数化卷积来更有效地捕获局部变化的模糊模式,从而实现空间一致性和结构一致性的恢复。此外,提出了一种空间频率一致性损失来增强高频细节恢复,同时保持色彩保真度和形态完整性。为了支持高保真监控,提出了一种基于拉普拉斯-金字塔融合的配对数据构建策略,构建了清晰模糊的黄瓜霜霉病微观数据集。将MicroDeblurNet的恢复输出进一步应用于孢子囊检测和语义分割,以评估其对高级视觉任务的影响。最后,我们构建了一个集成的显微分析平台,提供标准化的高质量数据和自动化的病原体结构识别和分析,以支持疾病评估和管理。实验结果表明,MicroDeblurNet实现了像素级、结构级和感知级指标的最佳平衡,PSNR达到42.48 dB, SSIM为0.9839,优于先进的最先进的方法。在下游任务中,MicroDeblurNet在涉及孢子囊粘附和背景杂质的挑战性场景中提供了更高的检测召回率和分割精度,证明了其在复杂微观条件下增强目标可识别性、保持结构完整性和提高鲁棒性的能力。
{"title":"MicroDeblurNet: high-fidelity restoration of spatially variant defocus in microscopic images for cucumber downy mildew.","authors":"Yiding Zhang, Bo Wang, Yuzhaobi Song, Zonghuan Han, Xiaoshuan Zhang, Lingxian Zhang","doi":"10.1186/s13007-025-01495-1","DOIUrl":"10.1186/s13007-025-01495-1","url":null,"abstract":"<p><p>Microscopic imaging provides essential visual evidence for pathogen monitoring, but its shallow depth of field and the three-dimensional height variation of spores lead to pronounced defocus blur and structural degradation. Restoring such images is therefore crucial for reliable spore identification and downstream analysis. However, microscopic defocus is a spatially varying process that severely suppresses high-frequency structures, causing natural-image deblurring models to generalize poorly. In addition, optical constraints of microscopy make realistic sharp-blur pairs difficult to obtain, further limiting learning-based restoration. To address these challenges, we propose MicroDeblurNet, a single-image deblurring network specifically designed for microscopic defocus restoration. The model incorporates a convolutional block attention module to enhance spatial selectivity toward key pathogen structures, and employs depthwise over-parameterized convolutions to capture locally varying blur patterns more effectively, enabling spatially consistent and structurally coherent restoration. Furthermore, a spatial-frequency consistency loss is proposed to strengthen high-frequency detail recovery while maintaining color fidelity and morphological integrity. To support high-fidelity supervision, we propose a paired-data construction strategy based on Laplacian-pyramid fusion and construct a clear-blur microscopic dataset for cucumber downy mildew. The restored outputs of MicroDeblurNet are further applied to sporangia detection and semantic segmentation to evaluate their impact on high-level visual tasks. Finally, we build an integrated microscopic analysis platform that delivers standardized high-quality data and automated pathogen-structure recognition and analysis to support disease assessment and management. Experimental results demonstrate that MicroDeblurNet achieves an optimal balance across pixel-level, structure-level, and perception-level metrics, reaching a PSNR of 42.48 dB and SSIM of 0.9839, outperforming advanced state-of-the-art methods. In downstream tasks, MicroDeblurNet delivers higher detection recall and segmentation accuracy in challenging scenarios involving sporangia adhesion and background impurities, demonstrating its ability to enhance target discernibility, preserve structural completeness, and improve robustness under complex microscopic conditions.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"16"},"PeriodicalIF":4.4,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Crop phenotype prediction using SNP context and whole-genome feature embedding based on DNABERT-2. 基于DNABERT-2的SNP上下文和全基因组特征嵌入的作物表型预测。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-12 DOI: 10.1186/s13007-025-01492-4
Huan Li, Yunpeng Cui, Tan Sun, Ting Wang, Zhen Chen, Chao Wang, Wenbo Bian, Juan Liu, Mo Wang, Li Chen, Jinming Wu, Jie Huang
<p><strong>Background: </strong>Modern agriculture demands precise genomic prediction to accelerate elite crop breeding, yet traditional genomic prediction approaches, such as genomic best linear unbiased prediction (GBLUP) and Bayesian methods, focus primarily on the cumulative effect of individual SNPs, thus neglecting the concerted influence that the surrounding sequence context has on the phenotype.</p><p><strong>Methods: </strong>To overcome these limitations, we propose two novel feature embedding modes (SNP-context and whole-genome) based on DNABERT-2, a cross-species genomic foundation model that uses self-attention mechanisms and transfer learning to automatically identify conserved sequence features across diverse evolutionary lineages without prior biological assumptions. The whole-genome feature embedding aggregates genomic information at a global scale by pooling vectors from chunked sequences processed by DNABERT-2, whereas the context feature embedding captures local information by directly encoding variable-length (500-3000 bp) sequences centered on target SNPs. To reduce noise in the high-dimensional feature embeddings, we employed principal component analysis (PCA) and partial least squares (PLS) to project the features into a lower-dimensional space. We generated two kinds of feature embedding for three crop datasets (rice413, rice395, and maize301), investigated the impact of 500-3000 bp flanking SNP contexts on phenotypic prediction, and compared prediction accuracy variations across algorithms at 4-768 feature dimensions among the PCA, PLS, and no dimensionality reduction strategies.</p><p><strong>Results: </strong>The results demonstrate that machine learning (ML) algorithms operating under the SNP-context embedding mode achieve greater accuracy and lower mean absolute errors (MAEs) than traditional SNP features, with performance peaking at optimal context lengths that proved to be trait-dependent (e.g., 1000 bp to 3000 bp), particularly for traits with low-to-moderate heritability (H<sup>2</sup> ∈ (0.2, 0.7]). In contrast, using whole-genome embeddings as input for ML can further improve the prediction accuracy for highly heritable traits (H<sup>2</sup> ∈ (0.7, 1.0]), even outperforming state-of-the-art deep learning models (such as DNNGP and ResGS) that rely on SNP markers.</p><p><strong>Conclusions: </strong>The proposed feature embedding methods, which leverage DNABERT-2 to capture the contextual features of SNPs, effectively overcome the limitations of traditional prediction models. This study demonstrates that the SNP-context mode is superior for traits with low-to-moderate heritability, while the whole-genome embedding mode excels for highly heritable ones. Our work provides plant breeders with a flexible and powerful analytical framework, enabling them to select the most suitable phenotypic prediction method based on the complexity of the target trait, thereby accelerating genetic gain in the breeding of elite crop
背景:现代农业需要精确的基因组预测来加速优质作物育种,然而传统的基因组预测方法,如基因组最佳线性无偏预测(GBLUP)和贝叶斯方法,主要关注单个snp的累积效应,从而忽略了周围序列背景对表型的协同影响。方法:为了克服这些限制,我们提出了基于DNABERT-2的两种新的特征嵌入模式(SNP-context和全基因组),DNABERT-2是一种跨物种基因组基础模型,它使用自我注意机制和迁移学习来自动识别不同进化谱系中的保守序列特征,而无需事先的生物学假设。全基因组特征嵌入通过汇集DNABERT-2处理的分块序列中的向量,在全球范围内聚合基因组信息,而上下文特征嵌入通过直接编码以目标snp为中心的可变长度(500-3000 bp)序列来捕获局部信息。为了降低高维特征嵌入中的噪声,我们使用主成分分析(PCA)和偏最小二乘(PLS)将特征投影到低维空间中。我们为3个作物数据集(rice413、rice395和maize301)生成了两种特征嵌入,研究了500-3000 bp侧翼SNP上下文对表型预测的影响,并比较了PCA、PLS和无降维策略在4-768个特征维度上不同算法的预测精度差异。结果表明,与传统的SNP特征相比,在SNP-上下文嵌入模式下运行的机器学习(ML)算法具有更高的准确性和更低的平均绝对误差(MAEs),其性能在最佳上下文长度处达到峰值,该长度被证明是性状依赖的(例如,1000 bp至3000 bp),特别是对于具有低至中等遗传力的性状(H2∈(0.2,0.7))。相比之下,使用全基因组嵌入作为机器学习的输入可以进一步提高对高度可遗传性状(H2∈(0.7,1.0])的预测精度,甚至优于依赖SNP标记的最先进的深度学习模型(如DNNGP和ResGS)。结论:所提出的特征嵌入方法利用DNABERT-2捕获snp的上下文特征,有效克服了传统预测模型的局限性。本研究表明,SNP-context模式在中低遗传力性状中较为优越,而全基因组嵌入模式在高遗传力性状中较为优越。我们的工作为植物育种家提供了一个灵活而强大的分析框架,使他们能够根据目标性状的复杂性选择最合适的表型预测方法,从而加快作物优良品种育种的遗传增益。
{"title":"Crop phenotype prediction using SNP context and whole-genome feature embedding based on DNABERT-2.","authors":"Huan Li, Yunpeng Cui, Tan Sun, Ting Wang, Zhen Chen, Chao Wang, Wenbo Bian, Juan Liu, Mo Wang, Li Chen, Jinming Wu, Jie Huang","doi":"10.1186/s13007-025-01492-4","DOIUrl":"10.1186/s13007-025-01492-4","url":null,"abstract":"&lt;p&gt;&lt;strong&gt;Background: &lt;/strong&gt;Modern agriculture demands precise genomic prediction to accelerate elite crop breeding, yet traditional genomic prediction approaches, such as genomic best linear unbiased prediction (GBLUP) and Bayesian methods, focus primarily on the cumulative effect of individual SNPs, thus neglecting the concerted influence that the surrounding sequence context has on the phenotype.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Methods: &lt;/strong&gt;To overcome these limitations, we propose two novel feature embedding modes (SNP-context and whole-genome) based on DNABERT-2, a cross-species genomic foundation model that uses self-attention mechanisms and transfer learning to automatically identify conserved sequence features across diverse evolutionary lineages without prior biological assumptions. The whole-genome feature embedding aggregates genomic information at a global scale by pooling vectors from chunked sequences processed by DNABERT-2, whereas the context feature embedding captures local information by directly encoding variable-length (500-3000 bp) sequences centered on target SNPs. To reduce noise in the high-dimensional feature embeddings, we employed principal component analysis (PCA) and partial least squares (PLS) to project the features into a lower-dimensional space. We generated two kinds of feature embedding for three crop datasets (rice413, rice395, and maize301), investigated the impact of 500-3000 bp flanking SNP contexts on phenotypic prediction, and compared prediction accuracy variations across algorithms at 4-768 feature dimensions among the PCA, PLS, and no dimensionality reduction strategies.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Results: &lt;/strong&gt;The results demonstrate that machine learning (ML) algorithms operating under the SNP-context embedding mode achieve greater accuracy and lower mean absolute errors (MAEs) than traditional SNP features, with performance peaking at optimal context lengths that proved to be trait-dependent (e.g., 1000 bp to 3000 bp), particularly for traits with low-to-moderate heritability (H&lt;sup&gt;2&lt;/sup&gt; ∈ (0.2, 0.7]). In contrast, using whole-genome embeddings as input for ML can further improve the prediction accuracy for highly heritable traits (H&lt;sup&gt;2&lt;/sup&gt; ∈ (0.7, 1.0]), even outperforming state-of-the-art deep learning models (such as DNNGP and ResGS) that rely on SNP markers.&lt;/p&gt;&lt;p&gt;&lt;strong&gt;Conclusions: &lt;/strong&gt;The proposed feature embedding methods, which leverage DNABERT-2 to capture the contextual features of SNPs, effectively overcome the limitations of traditional prediction models. This study demonstrates that the SNP-context mode is superior for traits with low-to-moderate heritability, while the whole-genome embedding mode excels for highly heritable ones. Our work provides plant breeders with a flexible and powerful analytical framework, enabling them to select the most suitable phenotypic prediction method based on the complexity of the target trait, thereby accelerating genetic gain in the breeding of elite crop ","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"15"},"PeriodicalIF":4.4,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145960071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assisting species differentiation and taxonomic classification by hyperspectral imaging: an example from the parasitic plant realm. 利用高光谱成像辅助物种分化和分类:寄生植物领域的一个例子。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-11 DOI: 10.1186/s13007-025-01498-y
Vasili A Balios, Samuel Ortega, Karsten Heia, Anna Avetisyan, Kirsten Krause
{"title":"Assisting species differentiation and taxonomic classification by hyperspectral imaging: an example from the parasitic plant realm.","authors":"Vasili A Balios, Samuel Ortega, Karsten Heia, Anna Avetisyan, Kirsten Krause","doi":"10.1186/s13007-025-01498-y","DOIUrl":"10.1186/s13007-025-01498-y","url":null,"abstract":"","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"14"},"PeriodicalIF":4.4,"publicationDate":"2026-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12882463/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145952883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning-based classification of roses using 18 SNP markers for optimized genebank management. 利用18个SNP标记对玫瑰进行机器学习分类,优化基因库管理。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-06 DOI: 10.1186/s13007-025-01496-0
Laurine Patzer, Marcus Linde, Thomas Debener

Background: Reliable classification of rose cultivars is complicated by their long and complex breeding history, frequent hybridization, and the coexistence of traditional horticultural categories with genetically heterogeneous groups. While molecular marker sets such as SSRs have been applied to assess genetic relationships, studies across cultivated roses and species are rare. Using 18 SNP markers on 1,345 accessions in combination with machine learning now offers an opportunity to systematically evaluate how well horticultural classes align with underlying genomic structure and to provide robust tools for the management of large germplasm collections.

Results: Using a panel of SNP markers across 1,345 rose accessions from the Europa Rosarium Sangerhausen, multiple unsupervised (hierarchical, spectral, k-means, DBSCAN, HDBSCAN) and supervised (SVM, decision tree, naive Bayes, XGBoost) machine learning approaches were applied to identify genetic clusters and predict horticultural classifications. Across clustering methods, certain groups consistently emerged as genetically distinct, such as the alba and damask roses, which clustered together with low internal diversity, reflecting their shared historic origin. In contrast, tea, bengal, lutea, and remontant hybrids were repeatedly grouped together and predicted with high classification accuracies (up to 100%) but displayed high within-group diversity, which is consistent with complex breeding backgrounds. Miniature, kordesii, and rubiginosa hybrids also tended to cluster together, despite their differing horticultural labels. Overall, the labels obtained from unsupervised clustering were consistently confirmed by supervised models, which achieved balanced accuracies of up to 100%, highlighting the robustness of the observed groupings.

Conclusions: Our results demonstrate that machine learning applied to SNP marker data can robustly resolve genetic relationships among rose cultivars and provide novel insights into the alignment of horticultural classifications with genomic structure. The high predictive accuracies obtained suggest that marker-based classification can serve as a reliable complementary tool for genebank management, cultivar identification, and reassessment of traditional rose categories.

背景:玫瑰品种的可靠分类由于其漫长而复杂的育种历史、频繁的杂交以及传统园艺类别与遗传异质性群体的共存而变得复杂。虽然诸如SSRs之类的分子标记集已被用于评估遗传关系,但对栽培玫瑰和物种的研究很少。在1345份材料中使用18个SNP标记与机器学习相结合,现在提供了一个系统评估园艺类与潜在基因组结构匹配程度的机会,并为大型种质资源收藏的管理提供了强大的工具。结果:利用来自Europa Rosarium Sangerhausen的1345个玫瑰品种的SNP标记,应用了多种无监督(分层、光谱、k-means、DBSCAN、HDBSCAN)和监督(SVM、决策树、朴素贝叶斯、XGBoost)机器学习方法来识别遗传聚类并预测园艺分类。在聚类方法中,某些群体始终表现出遗传上的独特性,例如白玫瑰和大马士革玫瑰,它们聚集在一起,内部多样性较低,反映了它们共同的历史起源。相比之下,茶、孟加拉、黄茶和远缘杂交种反复归类,预测准确率高达100%,但类群内多样性较高,这与复杂的育种背景相一致。微型,kordesii,和rubiginosa杂交也倾向于聚集在一起,尽管他们不同的园艺标签。总体而言,从无监督聚类中获得的标签一致地由监督模型确认,其平衡精度高达100%,突出了观察到的分组的鲁棒性。结论:我们的研究结果表明,将机器学习应用于SNP标记数据可以可靠地解决玫瑰品种之间的遗传关系,并为园艺分类与基因组结构的一致性提供新的见解。这表明基于标记的分类可以作为一个可靠的补充工具,用于基因库管理、品种鉴定和传统玫瑰品类的重新评估。
{"title":"Machine learning-based classification of roses using 18 SNP markers for optimized genebank management.","authors":"Laurine Patzer, Marcus Linde, Thomas Debener","doi":"10.1186/s13007-025-01496-0","DOIUrl":"10.1186/s13007-025-01496-0","url":null,"abstract":"<p><strong>Background: </strong>Reliable classification of rose cultivars is complicated by their long and complex breeding history, frequent hybridization, and the coexistence of traditional horticultural categories with genetically heterogeneous groups. While molecular marker sets such as SSRs have been applied to assess genetic relationships, studies across cultivated roses and species are rare. Using 18 SNP markers on 1,345 accessions in combination with machine learning now offers an opportunity to systematically evaluate how well horticultural classes align with underlying genomic structure and to provide robust tools for the management of large germplasm collections.</p><p><strong>Results: </strong>Using a panel of SNP markers across 1,345 rose accessions from the Europa Rosarium Sangerhausen, multiple unsupervised (hierarchical, spectral, k-means, DBSCAN, HDBSCAN) and supervised (SVM, decision tree, naive Bayes, XGBoost) machine learning approaches were applied to identify genetic clusters and predict horticultural classifications. Across clustering methods, certain groups consistently emerged as genetically distinct, such as the alba and damask roses, which clustered together with low internal diversity, reflecting their shared historic origin. In contrast, tea, bengal, lutea, and remontant hybrids were repeatedly grouped together and predicted with high classification accuracies (up to 100%) but displayed high within-group diversity, which is consistent with complex breeding backgrounds. Miniature, kordesii, and rubiginosa hybrids also tended to cluster together, despite their differing horticultural labels. Overall, the labels obtained from unsupervised clustering were consistently confirmed by supervised models, which achieved balanced accuracies of up to 100%, highlighting the robustness of the observed groupings.</p><p><strong>Conclusions: </strong>Our results demonstrate that machine learning applied to SNP marker data can robustly resolve genetic relationships among rose cultivars and provide novel insights into the alignment of horticultural classifications with genomic structure. The high predictive accuracies obtained suggest that marker-based classification can serve as a reliable complementary tool for genebank management, cultivar identification, and reassessment of traditional rose categories.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"8"},"PeriodicalIF":4.4,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12849561/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145912722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction: PVP-40 mediated enhancement of mesophyll protoplast yield and viability for transient gene expression in black huckleberry. 修正:PVP-40介导的黑越莓叶肉原生质体产量和瞬时基因表达活力的增强。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-05 DOI: 10.1186/s13007-025-01488-0
Sweety Majumder, Abir U Igamberdiev, Samir C Debnath
{"title":"Correction: PVP-40 mediated enhancement of mesophyll protoplast yield and viability for transient gene expression in black huckleberry.","authors":"Sweety Majumder, Abir U Igamberdiev, Samir C Debnath","doi":"10.1186/s13007-025-01488-0","DOIUrl":"10.1186/s13007-025-01488-0","url":null,"abstract":"","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":"22 1","pages":"1"},"PeriodicalIF":4.4,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12772042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145906318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Protoplast isolation and transient gene expression in Suaeda aralocaspica, a halophyte with single-cell C4 anatomy. 单细胞C4解剖盐生植物盐田原生质体分离及瞬时基因表达。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-05 DOI: 10.1186/s13007-025-01497-z
Jing Cao, Xingxin Liao, Rui Yu, Bolaqiake Asihatibieke, YanXia Liu, Haiyan Lan
{"title":"Protoplast isolation and transient gene expression in Suaeda aralocaspica, a halophyte with single-cell C<sub>4</sub> anatomy.","authors":"Jing Cao, Xingxin Liao, Rui Yu, Bolaqiake Asihatibieke, YanXia Liu, Haiyan Lan","doi":"10.1186/s13007-025-01497-z","DOIUrl":"10.1186/s13007-025-01497-z","url":null,"abstract":"","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"13"},"PeriodicalIF":4.4,"publicationDate":"2026-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12870416/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145906341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Profiling DNA-protein interactions in Meloidogyne incognita using dCas9-based affinity purification. 利用基于dcas9的亲和纯化分析了神秘旋律细胞dna -蛋白质的相互作用。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-04 DOI: 10.1186/s13007-025-01475-5
Caroline Bournaud, Alwéna Tollec, Etienne G J Danchin, Yohann Couté, Sebastian Eves-van den Akker

Background: The root-knot nematode Meloidogyne incognita, is a highly destructive parasite that manipulates host plant processes through effector proteins, affecting agriculture globally. Despite advances in genomic and transcriptomic studies, the regulatory mechanisms controlling effector gene expression, especially at the chromatin level, are still poorly understood. Gene regulation studies in plant-parasitic nematodes (PPN) face several challenges, including the absence of transformation systems and technical barriers in chromatin preparation, particularly for transcription factors (TFs) expressed in secretory gland cells. Conventional methods like Chromatin Immunoprecipitation (ChIP) are limited in PPN due to low chromatin yields, the impermeability of nematode cuticles, and difficulties in producing antibodies for low-abundance TFs. These issues call for alternative approaches, such as dCas9-based CAPTURE (CRISPR Affinity Purification in siTU of Regulatory Elements) that allows studying chromatin interactions by using a catalytically inactive dCas9 protein to target specific genomic loci without relying on antibodies.

Results: This study presents an optimized in vitro dCas9-based CAPTURE for second stage juvenile (J2) M. incognita that addresses key challenges in chromatin extraction and stability. The protocol focuses on the promoter region of the 6F06 effector gene, a critical gene for parasitism. Several optimizations were made, including improvements in nematode disruption, chromatin extraction, and protein-DNA complex stability. This method successfully isolated chromatin-protein complexes and identified four putative chromatin-associated proteins, including BANF1, linked to chromatin remodelling complexes like SWI/SNF.

Conclusion: The optimized in vitro dCas9-based CAPTURE protocol offers a new tool for investigating chromatin dynamics and regulatory proteins in non-transformable nematodes. This method expands the scope of effector gene regulation research and provides new insights into M. incognita parasitism. Future research will aim to validate these regulatory proteins and extend the method to other effector loci, potentially guiding the development of novel nematode control strategies.

背景:根结线虫(Meloidogyne incognita)是一种极具破坏性的寄生虫,通过效应蛋白操纵寄主植物的过程,影响全球农业。尽管基因组学和转录组学研究取得了进展,但控制效应基因表达的调控机制,特别是在染色质水平上,仍然知之甚少。植物寄生线虫(PPN)的基因调控研究面临着一些挑战,包括缺乏转化系统和染色质制备的技术障碍,特别是在分泌腺细胞中表达的转录因子(TFs)。染色质免疫沉淀(ChIP)等传统方法在PPN中受到限制,因为染色质产量低,线虫表皮的不渗透性,以及难以产生低丰度tf的抗体。这些问题需要替代方法,例如基于dCas9的CAPTURE (CRISPR亲和纯化原位调控元件),它允许通过使用催化活性不高的dCas9蛋白靶向特定基因组位点来研究染色质相互作用,而不依赖于抗体。结果:本研究提出了一种优化的基于dcas9的体外捕获第二阶段幼年(J2) M. incognita,解决了染色质提取和稳定性的关键挑战。该方案侧重于6F06效应基因的启动子区域,该基因是寄生的关键基因。进行了一些优化,包括线虫破坏,染色质提取和蛋白质- dna复合物稳定性的改进。该方法成功分离了染色质-蛋白复合物,并鉴定了四种可能的染色质相关蛋白,包括BANF1,它们与染色质重塑复合物如SWI/SNF相关。结论:优化后的基于dcas9的体外捕获方案为研究不可转化线虫的染色质动力学和调控蛋白提供了新的工具。该方法扩大了效应基因调控的研究范围,为隐殖夜蛾寄生提供了新的认识。未来的研究将旨在验证这些调节蛋白,并将该方法扩展到其他效应位点,从而有可能指导新的线虫控制策略的发展。
{"title":"Profiling DNA-protein interactions in Meloidogyne incognita using dCas9-based affinity purification.","authors":"Caroline Bournaud, Alwéna Tollec, Etienne G J Danchin, Yohann Couté, Sebastian Eves-van den Akker","doi":"10.1186/s13007-025-01475-5","DOIUrl":"https://doi.org/10.1186/s13007-025-01475-5","url":null,"abstract":"<p><strong>Background: </strong>The root-knot nematode Meloidogyne incognita, is a highly destructive parasite that manipulates host plant processes through effector proteins, affecting agriculture globally. Despite advances in genomic and transcriptomic studies, the regulatory mechanisms controlling effector gene expression, especially at the chromatin level, are still poorly understood. Gene regulation studies in plant-parasitic nematodes (PPN) face several challenges, including the absence of transformation systems and technical barriers in chromatin preparation, particularly for transcription factors (TFs) expressed in secretory gland cells. Conventional methods like Chromatin Immunoprecipitation (ChIP) are limited in PPN due to low chromatin yields, the impermeability of nematode cuticles, and difficulties in producing antibodies for low-abundance TFs. These issues call for alternative approaches, such as dCas9-based CAPTURE (CRISPR Affinity Purification in siTU of Regulatory Elements) that allows studying chromatin interactions by using a catalytically inactive dCas9 protein to target specific genomic loci without relying on antibodies.</p><p><strong>Results: </strong>This study presents an optimized in vitro dCas9-based CAPTURE for second stage juvenile (J2) M. incognita that addresses key challenges in chromatin extraction and stability. The protocol focuses on the promoter region of the 6F06 effector gene, a critical gene for parasitism. Several optimizations were made, including improvements in nematode disruption, chromatin extraction, and protein-DNA complex stability. This method successfully isolated chromatin-protein complexes and identified four putative chromatin-associated proteins, including BANF1, linked to chromatin remodelling complexes like SWI/SNF.</p><p><strong>Conclusion: </strong>The optimized in vitro dCas9-based CAPTURE protocol offers a new tool for investigating chromatin dynamics and regulatory proteins in non-transformable nematodes. This method expands the scope of effector gene regulation research and provides new insights into M. incognita parasitism. Future research will aim to validate these regulatory proteins and extend the method to other effector loci, potentially guiding the development of novel nematode control strategies.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":""},"PeriodicalIF":4.4,"publicationDate":"2026-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145900870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated experimental and computational workflows for single-cell transcriptomics in plants. 植物单细胞转录组学的综合实验和计算工作流程。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2026-01-03 DOI: 10.1186/s13007-025-01490-6
Jing Wang, Shanqiao Zheng, Bojie Lu, Yuan Jiang, Yabing Zhu, Qun Liu, Song Gao, Peng Liu, Peng Yu, Sanjie Jiang, Liang Zong

Background: Single-cell transcriptomics is a powerful approach to resolve cellular heterogeneity, yet its application in plants is constrained by challenges in tissue preparation, nuclei isolation, and transcriptome quality. Optimized experimental and computational workflows are essential to achieve robust results in plant systems.

Results: We systematically benchmarked bulk and single-cell transcriptomic workflows in maize and established an integrated, optimized framework. First, we developed an improved bulk RNA-seq protocol, providing higher consistency and serving as a reference for single-cell datasets. Second, we compared three input types, protoplasts, fresh nuclei, and frozen nuclei, across tissues, demonstrating overall comparability of their transcriptomic profiles and offering guidance for studies with limited material. Third, by leveraging bulk RNA-seq as a reference, these complementary data provide additional biological context that helps to interpret and validate findings derived from single-cell transcriptomic analyses. A combination of these strategies resulted in high transcriptome integrity and clear clustering resolution in the final dataset, supporting robust identification of plant cell types. While all experimental data are derived from maize, the principles and strategies described here provide practical guidance and inspiration for single-cell studies in other plant species.

Conclusions: Our study establishes optimized experimental and computational workflows for plant single-cell transcriptomics. By validating input comparability and addressing the limitations of nuclear data, we provide methodological guidance that extends beyond maize and supports future single-cell investigations across diverse plant species.

背景:单细胞转录组学是解决细胞异质性的有力方法,但其在植物中的应用受到组织制备、细胞核分离和转录组质量等方面的挑战。优化的实验和计算工作流程对于在植物系统中获得稳健的结果至关重要。结果:我们系统地对玉米的批量和单细胞转录组工作流程进行了基准测试,并建立了一个集成的、优化的框架。首先,我们开发了一种改进的批量RNA-seq协议,提供更高的一致性,并作为单细胞数据集的参考。其次,我们比较了三种输入类型,原生质体,新鲜细胞核和冷冻细胞核,跨组织,展示了它们转录组谱的总体可比性,并为有限材料的研究提供了指导。第三,通过利用大量RNA-seq作为参考,这些补充数据提供了额外的生物学背景,有助于解释和验证单细胞转录组学分析得出的结果。这些策略的结合在最终数据集中产生了高转录组完整性和清晰的聚类分辨率,支持对植物细胞类型的可靠鉴定。虽然所有的实验数据都来自玉米,但本文所描述的原理和策略为其他植物物种的单细胞研究提供了实用的指导和启发。结论:本研究建立了植物单细胞转录组学的优化实验和计算工作流程。通过验证输入的可比性和解决核数据的局限性,我们提供了超越玉米的方法指导,并支持未来跨多种植物物种的单细胞研究。
{"title":"Integrated experimental and computational workflows for single-cell transcriptomics in plants.","authors":"Jing Wang, Shanqiao Zheng, Bojie Lu, Yuan Jiang, Yabing Zhu, Qun Liu, Song Gao, Peng Liu, Peng Yu, Sanjie Jiang, Liang Zong","doi":"10.1186/s13007-025-01490-6","DOIUrl":"10.1186/s13007-025-01490-6","url":null,"abstract":"<p><strong>Background: </strong>Single-cell transcriptomics is a powerful approach to resolve cellular heterogeneity, yet its application in plants is constrained by challenges in tissue preparation, nuclei isolation, and transcriptome quality. Optimized experimental and computational workflows are essential to achieve robust results in plant systems.</p><p><strong>Results: </strong>We systematically benchmarked bulk and single-cell transcriptomic workflows in maize and established an integrated, optimized framework. First, we developed an improved bulk RNA-seq protocol, providing higher consistency and serving as a reference for single-cell datasets. Second, we compared three input types, protoplasts, fresh nuclei, and frozen nuclei, across tissues, demonstrating overall comparability of their transcriptomic profiles and offering guidance for studies with limited material. Third, by leveraging bulk RNA-seq as a reference, these complementary data provide additional biological context that helps to interpret and validate findings derived from single-cell transcriptomic analyses. A combination of these strategies resulted in high transcriptome integrity and clear clustering resolution in the final dataset, supporting robust identification of plant cell types. While all experimental data are derived from maize, the principles and strategies described here provide practical guidance and inspiration for single-cell studies in other plant species.</p><p><strong>Conclusions: </strong>Our study establishes optimized experimental and computational workflows for plant single-cell transcriptomics. By validating input comparability and addressing the limitations of nuclear data, we provide methodological guidance that extends beyond maize and supports future single-cell investigations across diverse plant species.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"12"},"PeriodicalIF":4.4,"publicationDate":"2026-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12866480/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145896653","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A lightweight hybrid transformer approach for hyperspectral imaging-based drought tolerance evaluation in tea plants. 基于高光谱成像的茶树耐旱性评价的轻型混合变压器方法。
IF 4.4 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Pub Date : 2025-12-31 DOI: 10.1186/s13007-025-01487-1
Yuchen Li, Yi Zhang, Yu Wang, Hao Chen, Xiao Han, Yilin Mao, Litao Sun, Jiazhi Shen, Zhaotang Ding

Background: In Shandong Province of China, where annual precipitation is below 800 mm, tea plants face persistent drought stress exacerbated by global warming. Breeding drought-tolerant tea cultivars is one of the effective ways to cope with this challenge. However, traditional breeding approaches are still limited by prolonged cycles, low efficiency, and subjective evaluation. To overcome these limitations, the development of rapid and objective germplasm evaluation methods has become critical‌.

Results: In this study, hyperspectral images of leaves from 12 widely cultivated 'Lucha series' tea cultivars in Shandong Province during different drought periods were collected, and the drought-related physiological indicators were measured simultaneously. Then, a tea drought tolerance index (TDTI) with enhanced accuracy was established by integrating the rate of change of indicators with temporal weights and indicator weights. Subsequently, we developed a novel lightweight Transformer-based hybrid integrated architecture to establish prediction models for the physiological indicators and TDTI. The Transformer-based models synergistically combined a Transformer encoder with XGBoost and LightGBM within a lightweight framework that leverages ensemble learning, data augmentation, and regularization to ensure robustness on limited datasets. Finally, we compared the performance of Transformer-based models against traditional machine learning models.​ The optimal models for MRP, MDA, Pro, SS, ChlT and TDTI were identified as 1D-CARS-TF, 2D-UVE-SVM, 2D-UVE-BRR, 2D-CARS-SVM, 1D-UVE-TF-CNN, and 2D-UVE-TF, respectively, achieving determination coefficient (R²) of 0.8992, 0.8307, 0.8929, 0.8373, 0.7894, and 0.7614, on an independent test set.​ The results demonstrated that the lightweight Transformer-based models equipped with multi-head self-attention mechanism exhibited outstanding capabilities in processing indicators requiring multi-band correlation mining. ​​Simultaneously, feature selection algorithms and overfitting-mitigation optimization strategies played a critical role in enhancing both the accuracy and stability of the Transformer-based models.​.

Conclusions: This study established a robust technical foundation for rapid, accurate, and non-destructive comprehensive evaluation of drought tolerance for tea plant germplasm resources. However, it should be noted that they were based on a specific set of greenhouse-cultivated samples, and further validation under field conditions with expanded germplasm resources would strengthen generalizability. Anyway, the demonstrated potential of the Transformer-based model in our study advances phenomics of tea plants toward greater intelligence and efficiency.

背景:中国山东省年降水量低于800毫米,全球变暖加剧了茶树的持续干旱胁迫。培育耐旱茶叶品种是应对这一挑战的有效途径之一。然而,传统的育种方法仍然存在周期长、效率低、评价主观等缺点。为了克服这些限制,开发快速、客观的种质资源评价方法已成为关键。结果:本研究采集了山东省12个广泛栽培的“茶系”茶品种在不同干旱时期的叶片高光谱图像,并对其干旱相关生理指标进行了测定。然后,将指标变化率与时间权重和指标权重相结合,建立了准确度较高的茶叶耐旱性指数。随后,我们开发了一种新型的基于轻型变压器的混合集成架构,以建立生理指标和TDTI的预测模型。基于Transformer的模型将Transformer编码器与XGBoost和LightGBM协同结合在一个轻量级框架内,利用集成学习、数据增强和正则化来确保有限数据集的鲁棒性。最后,我们比较了基于transformer的模型与传统机器学习模型的性能。MRP、MDA、Pro、SS、ChlT和TDTI的最优模型分别为1D-CARS-TF、2D-UVE-SVM、2D-UVE-BRR、2D-CARS-SVM、1D-UVE-TF-CNN和2D-UVE-TF,在独立测试集上的决定系数(R²)分别为0.8992、0.8307、0.8929、0.8373、0.7894和0.7614。结果表明,采用多头自关注机制的轻量化变压器模型在处理需要多波段关联挖掘的指标方面表现出较强的能力。同时,特征选择算法和缓解过拟合优化策略对提高基于变压器的模型的精度和稳定性起着至关重要的作用。结论:本研究为茶树种质资源抗旱性的快速、准确、无损综合评价奠定了坚实的技术基础。但值得注意的是,这些结论是基于一组特定的温室栽培样品,在扩大种质资源的条件下进行进一步的田间验证将加强其普遍性。无论如何,在我们的研究中,基于变形金刚的模型所展示的潜力将茶树的表型组学推向更高的智能和效率。
{"title":"A lightweight hybrid transformer approach for hyperspectral imaging-based drought tolerance evaluation in tea plants.","authors":"Yuchen Li, Yi Zhang, Yu Wang, Hao Chen, Xiao Han, Yilin Mao, Litao Sun, Jiazhi Shen, Zhaotang Ding","doi":"10.1186/s13007-025-01487-1","DOIUrl":"10.1186/s13007-025-01487-1","url":null,"abstract":"<p><strong>Background: </strong>In Shandong Province of China, where annual precipitation is below 800 mm, tea plants face persistent drought stress exacerbated by global warming. Breeding drought-tolerant tea cultivars is one of the effective ways to cope with this challenge. However, traditional breeding approaches are still limited by prolonged cycles, low efficiency, and subjective evaluation. To overcome these limitations, the development of rapid and objective germplasm evaluation methods has become critical‌.</p><p><strong>Results: </strong>In this study, hyperspectral images of leaves from 12 widely cultivated 'Lucha series' tea cultivars in Shandong Province during different drought periods were collected, and the drought-related physiological indicators were measured simultaneously. Then, a tea drought tolerance index (TDTI) with enhanced accuracy was established by integrating the rate of change of indicators with temporal weights and indicator weights. Subsequently, we developed a novel lightweight Transformer-based hybrid integrated architecture to establish prediction models for the physiological indicators and TDTI. The Transformer-based models synergistically combined a Transformer encoder with XGBoost and LightGBM within a lightweight framework that leverages ensemble learning, data augmentation, and regularization to ensure robustness on limited datasets. Finally, we compared the performance of Transformer-based models against traditional machine learning models.​ The optimal models for MRP, MDA, Pro, SS, ChlT and TDTI were identified as 1D-CARS-TF, 2D-UVE-SVM, 2D-UVE-BRR, 2D-CARS-SVM, 1D-UVE-TF-CNN, and 2D-UVE-TF, respectively, achieving determination coefficient (R²) of 0.8992, 0.8307, 0.8929, 0.8373, 0.7894, and 0.7614, on an independent test set.​ The results demonstrated that the lightweight Transformer-based models equipped with multi-head self-attention mechanism exhibited outstanding capabilities in processing indicators requiring multi-band correlation mining. ​​Simultaneously, feature selection algorithms and overfitting-mitigation optimization strategies played a critical role in enhancing both the accuracy and stability of the Transformer-based models.​.</p><p><strong>Conclusions: </strong>This study established a robust technical foundation for rapid, accurate, and non-destructive comprehensive evaluation of drought tolerance for tea plant germplasm resources. However, it should be noted that they were based on a specific set of greenhouse-cultivated samples, and further validation under field conditions with expanded germplasm resources would strengthen generalizability. Anyway, the demonstrated potential of the Transformer-based model in our study advances phenomics of tea plants toward greater intelligence and efficiency.</p>","PeriodicalId":20100,"journal":{"name":"Plant Methods","volume":" ","pages":"11"},"PeriodicalIF":4.4,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12866587/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145878833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Plant Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1