中国农业科学 ›› 2023, Vol. 56 ›› Issue (22): 4490-4505.doi: 10.3864/j.issn.0578-1752.2023.22.011

• 园艺 • 上一篇    下一篇

基于转录组数据的芦笋EST-SSR标记的开发及通用性分析

仪泽会(), 赵婧, 毛丽萍   

  1. 山西农业大学园艺学院/蔬菜花卉种质资源创新与利用山西省重点实验室,太原 030031
  • 收稿日期:2023-04-10 接受日期:2023-08-04 出版日期:2023-11-16 发布日期:2023-11-17
  • 通信作者: 仪泽会,E-mail:yizehui2008bj@163.com
  • 基金资助:
    山西省高等学校科技创新项目(2022L104); 山西农业大学农业科技创新研究课题(YCX2020YQ07); 山西农业大学园艺学院“双一流”学科建设科研补助项目

Development and Transferability of EST-SSR Markers Based on Transcriptome Data from Asparagus officinalis

YI ZeHui(), ZHAO Jing, MAO LiPing   

  1. College of Agriculture, Shanxi Agricultural University/Shanxi Provincial Key Laboratory of Innovation and Utilization of Vegetable and Flower Germplasm Resources, Taiyuan 030031
  • Received:2023-04-10 Accepted:2023-08-04 Published:2023-11-16 Online:2023-11-17

摘要:

【目的】 明确芦笋转录组中SSR位点的分布规律,开发高信息量的EST-SSR标记并分析通用性,为天门冬属植物系统进化分析、功能基因挖掘及分子标记辅助育种提供工具。【方法】以笔者课题组前期获得的15个芦笋根系RNA-seq数据为基础,采用MISA软件检索SSR位点、Primer 3.0软件批量设计引物及TB-tools软件批量e-PCR与Primer-blast程序逐一e-PCR相结合的方法剔除非有效引物。通过与基因组序列比对,获取标记所属基因ID、物理位置、潜在功能等信息。随机合成引物50对,以9份芦笋、7份天门冬、5份文竹和3份蓬莱松种质为材料,检测引物有效性、多态性和通用性。【结果】共检索出SSR位点36 590个,分布于30 229条Unigene,发生频率为4.78%,平均间距为9.17 kb。SSR位点非均匀分布于10条染色体,7号染色体数量最多(4 642个),3号染色体密度最高(37.86 SSR/Mb)。SRR位点从单核苷酸至六核苷酸均有分布,以三核苷酸(46.92%)类型最丰富,AG/CT(16.58%)基序最具优势。成功设计EST-SSR引物19 695对,经e-PCR检测共剔除非有效引物15 147对,其中,3 085对无扩增产物、10 102对扩增产物条带大小与预期严重不符、1 289对物理位置未知、402对存在与目的条带大小相似的其他扩增产物、269对扩增产物无SSR序列。以位于基因区域的2 517个EST-SSR标记为基础,构建染色体密度分布图,总覆盖长度1 125.51 Mb,平均图距447.16 kb,潜在功能涉及产量、品质及抗逆性等多方面。随机合成的50对引物均可扩增出清晰目标条带,其中36对具有多态性,平均多态性信息含量为0.330,在天门冬、文竹和蓬莱松中的通用性分别达到100.00%、92.00%和88.00%。基于EST-SSR鉴定结果的聚类分析,可将24份材料分为芦笋、天门冬、文竹和蓬莱松4类,与植物学分类完全一致。【结论】成功开发出高信息量的芦笋EST-SSR标记2 517个,有效扩增率100%,总覆盖长度1 125.51 Mb,平均图距447.16 kb,可用于芦笋及其近缘物种的系统进化分析。同时,可为其他物种的EST-SSR标记开发提供借鉴。

关键词: 转录组, 芦笋, EST-SSR, 染色体密度分布图, 功能注释

Abstract:

【Objective】The aim of this study was to clarify the distribution law of SSR loci across the transcriptome of Asparagus officinalis, to develop highly informative EST-SSR markers and to analyze their transferability, so as to provide the tools for phylogenetic analysis, functional gene mining and molecular marker-assisted breeding of asparagus plants.【Method】Based on the RNA-seq data of 15 asparagus roots obtained from the previous stage by our research group, MISA software was used to retrieve SSR loci, and Primer 3.0 software was employed to design primers in batches. Then, the ineffective primers were eliminated by performing batch e-PCR with TB-tools software and one-to-one e-PCR with the Primer-blast programme. The information of EST-SSR markers (such as gene id, physical location, and potential function) was obtained by comparison with the genome of asparagus officinalis. The DNA of 9 A. officinalis varieties, 7 A. setaceus varieties, 5 A. cochinchinensis varieties, and 3 A. umbellatus varieties were used as templates, and 50 pairs of randomly synthesized primers were used as primers to detect the effectiveness, polymorphism and transferability of the primers developed.【Result】A total of 36 590 simple sequence repeats (SSRs) loci distributed in 30 229 unigenes with a frequency of 4.78% and an average distance of 9.17 kb were identified based on data from 15 root transcriptomes of A. officinalis. The SSRs were unevenly distributed in the 10 chromosomes, with the highest number in chromosome 7 (4 642) and the highest density in chromosome 3 (37.86 SSRs/Mb). The SRRs were distributed from di- to hexa-, with tri- (46.92%) and AG/CT (16.58%) as the most abundant repeat type and predominant repeat motif, respectively. A total of 19 695 pairs of EST-SSR primers were successfully designed, and 15 147 pairs ineffective primers were eliminated by e-PCR. Among them, 3 085 pairs ineffective primers didn’t produce any amplification products, 10 102 pairs produced severely inconsistent amplification products in terms of fragment size, 1 289 pairs had unknown physical positions in the genome, 402 pairs gave other amplification products of similar size to the target fragments, and 269 pairs generated amplification products without SSRs. Based on 2 517 EST-SSR markers located in the gene region developed in this study, the chromosome density distribution map was constructed, with a total coverage length of 1 125.51 Mb and an average distance of 447.16 kb. The potential functions of these markers were involved in many aspects, such as yield, quality, stress resistance, and so on. All 50 pairs of randomly synthesized primers could amplify target bands clearly, of which 36 pairs were polymorphic, and the average polymorphic information content was 0.330. These markers could be used in three other species of Asparagus: the transferability to A. cochinchinensis, A. setaceus, and A. umbellatus were 100%, 92%, and 88%, respectively. Cluster analysis based on the EST-SSR alleles grouped the 24 accessions into four clusters that corresponded to the species of A. officinalis, A. setaceus, A. cochinchinensis, and A. umbellatus.【Conclusion】In this study, 2 517 highly informative EST-SSR markers of asparagus were successfully developed, and the effective amplification rate was 100%. The total coverage length of the physical map was 1 125.51 Mb, and the average distance was 447.16 kb, which could be used for phylogenetic analysis of asparagus and related species. Moreover, it provided a reference for the development of EST-SSR markers in other species.

Key words: transcriptome, asparagus, EST-SSR, chromosome density distribution map, functional annotation