中国农业科学

• • 上一篇    

最新录用:桃果实单果重及可溶性固形物含量的全基因组选择分析

曹珂,陈昌文,杨选文,别航灵,王力荣   

  1. 中国农业科学院郑州果树研究所,郑州 450009
  • 发布日期:2022-10-12

Genomic Selection for Fruit Weight and Soluble Solid Contents in Fruit of Peach

CAO Ke, CHEN ChangWen, YANG XuanWen, BIE HangLing, WANG LiRong   

  1. Zhengzhou Fruit Research Institute, Chinese Academy of Agricultural Sciences, Zhengzhou 450009, Henan
  • Online:2022-10-12

摘要: 【目的】桃单果重和可溶性固形物含量(SSC)是育种家关注的两个重要的数量性状,受到多个微效基因的控制,难以通过单个标记进行早期筛选。全基因组选择作为一种新颖的、数量性状早期预测工具,在果树上已经有了初步应用,但其在桃上的应用效果以及影响预测准确性的因素仍需要深入探讨。【方法】为建立桃单果重和SSC的全基因组选择技术体系,本研究以520株训练自然群体为试材,通过重测序筛选出的48 398SNPs进行分型,在11个全基因组预测模型中分别筛选出两个数量性状适宜的模型,进而在56株自然群体和1145株杂交群体上进行应用。【结果】三类群体的平均测序数据量在1.953.52 Gb,测序深度为5.2910.79×。训练自然群体经与参考基因组比对,共得到5 065 726SNPs,去除缺失率较高>20%、最小等位基因频率过低<0.05的位点后,随机挑选基因组上48 398SNPs用于训练群体的全基因组选择模型构建。单果重预测精度最高的模型是BayesASSC预测精度最高的模型为randomforest。分别利用两个数量性状最适的模型进行预测,发现在自然群体中,单果重的预测精度为0.47670.6141,高于SSC0.32200.4329;而在杂交群体中,单果重的预测精度为0.23190.4870,同样高于SSC0.02000.2793;该结果也表明利用训练自然群体构建的预测模型在预测自然群体上应用的精度高于杂交群体。本研究进而以单果重为例,发现当育种目标是大果时,全基因组选择仅需保留17.78%的单株,效率明显高于单标记和双标记筛选。研究同时探讨了群体离散程度、遗传力和群体结构等对预测精度的影响,发现预测精度可能受到上述因子的综合影响。【结论】本研究筛选了桃果实单果重和SSC适宜全基因组选择模型,表明该方法的选择效率明显高于单标记筛选,研究结果为两个数量性状的高效分子辅助育种奠定理论和技术支撑。


关键词: 桃, 单果重, 可溶性固形物含量, 全基因组选择, 早期预测

Abstract: 【ObjectiveFruit weight and soluble solid content (SSC) are two important quantitative traits in peach which are concerned by breeders and controlled by multiple minor genes. Therefore, it is difficult to perform early prediction by a single marker. As a novel genome-wide tool, genomic selection has been applied in fruit crops and expected to enhance breeding efficiency of those quantitative traits. However, its application effect in peach and influencing factors still need to be further explored.MethodThe objectives of this study were to assess the accuracy of prediction in nature and cross populations of peach for fruit weight and soluble solid content (SSC) by using genomic selection. In this study, the training population comprised 520 individuals were selected as materials. Using the genotypic data for 48,398 SNPs obtained from the resequencing results of above training population, a total of 11 genome-wide prediction models were built to select the optimum model for fruit weight and SSC. Then, we calculated the genomic breeding values of a small panel of nature population comprised 56 individuals and 29 cross populations comprising 1145 seedlings.ResultThe average sequencing data of each variety of the three groups was 1.95~3.52 Gb, and the sequencing depth was 5.29~10.79×. The sequencing data of the training natural population was algned with the reference genome, a total of 5,065,726 single nucleotide polymorphism (SNPs) were obtained. After removing the SNPs with high missing rate more than 20% and minor allele frequency lower than 0.05, a total of 48,398 SNPs on the genome were randomly selected for constructing whole-genome selection models for the training population. The model with the highest prediction accuracy for fruit weight is BayesA, and the model with the highest prediction accuracy for SSC is randomforest. Using the above two models, we found that the goodness of fit between the predicted breeding values and observed phenotype of fruit weight was 0.4767~0.6141, higher than that of SSC (0.3220~0.4329) in nature populations. And in cross populations, the prediction accuracy of fruit weight was 0.2319~0.4870, also showing higher than that of SSC (0.0200~0.2793).. The results also showed that the prediction model constructed by training natural populations was more accurate in predicting natural populations but of cross populations. Taking fruit weight as an example, we also found that only 17.78% of the seedlings need to be retained by genomic selection when targeted large fruit. Its efficiency is significantly higher than that of single and double marker selection. Finally, the effects of population dispersion, heritability and population structure on prediction accuracy also were discussed. The results indicated that prediction accuracy may be mixed and affected by a combination of several factors.ConclusionIn this study, a suitable genomic selection model for peach fruit weight and SSC was screened and confirmed that the prediction efficiency of genomic selection was significantly higher than that of single marker selection. The results underline the potential of genomic prediction to accelerate breeding progress of these two quantitative traits in peach.


Key words: Peach, fruit weight, soluble solid contents, genomic selection, early prediction