Journal of Integrative Agriculture ›› 2025, Vol. 24 ›› Issue (3): 1140-1151.DOI: 10.1016/j.jia.2023.11.026

• • 上一篇    下一篇

TEGR:一个综合性的杜鹃花科基因组资源数据库

  

  • 收稿日期:2023-09-24 接受日期:2023-10-13 出版日期:2025-03-20 发布日期:2025-02-28

TEGR: A comprehensive Ericaceae genome resource database

Xiaojing Wang1*, Yunfeng Wei1*, Zhuo Liu2*, Tong Yu2*, Yanhong Fu2, Xiaoming Song2#    

  1. 1 Institute of Agro-Bioengineering/The Key Laboratory of Plant Resources Conservation and Germplasm Innovation in the Mountainous Region (Ministry of Education)/College of Life Sciences, Guizhou University, Guiyang 550025, China

    2 School of Life Sciences, North China University of Science and Technology, Tangshan 063210, China

  • Received:2023-09-24 Accepted:2023-10-13 Online:2025-03-20 Published:2025-02-28
  • About author:#Correspondence Xiaoming Song, Tel/Fax: +86-315-8805607, E-mail: songxm@ncst.edu.cn *These authors contributed equally to this study.
  • Supported by:
    This work was supported by the National Natural Science Foundation of China (32260097) and the National Guidance of Local Science and Technology Development Fund of China ([2023]009).

摘要:

杜鹃花科是一个分布在世界各地的开花植物群,包括126属和4000个物种。在本研究中,我们搭建一个杜鹃花科基因组资源数据库(TEGR, http://www.tegr.com.cn),这是一个基于16杜鹃花科物种已发表基因组的综合性、用户友好的基于web的功能基因组数据库。TEGR数据库包含大量重要的功能基因,生长素基因763个,开花基因2,407个,抗基因20,432个,花青素相关基因617个,N6-甲基腺苷修饰基因470个。TEGR数据库中包含鉴定的599,174条特异的CRISPR引导序列。在TEGR数据库中对16杜鹃花科物种进行了基因复制事件、共线性分析和同源性分析。TEGR数据库包含通过GONrPfamTrEMBLSwiss-Prot数据库注释的614,821个功能基因。TEGR数据库提供Primer DesignHmmsearchSyntenyBLASTJBrowse工具,帮助用户进行全面的比较基因组分析。所有高质量的参考基因组序列、基因组特征、基因注释和生物信息分析结果都可以从TEGR数据库下载。在未来,随着新基因组数据的出现,我们将继续完善更新TEGR数据库,为比较基因组学研究提丰富的数据资源。

Abstract:

Ericaceae is a diverse family of flowering plants distributed nearly worldwide, and it includes 126 genera and more than 4,000 species.  In the present study, we developed The Ericaceae Genome Resource (TEGR, ) as a comprehensive, user-friendly, web-based functional genomic database that is based on 16 published genomes from 16 Ericaceae species.  The TEGR database contains information on many important functional genes, including 763 auxin genes, 2,407 flowering genes, 20,432 resistance genes, 617 anthocyanin-related genes, and 470 N6-methyladenosine (m6A) modification genes.  We identified a total of 599,174 specific guide sequences for CRISPR in the TEGR database.  The gene duplication events, synteny analysis, and orthologous analysis of the 16 Ericaceae species were performed using the TEGR database.  The TEGR database contains 614,821 functional genes annotated through the GO, Nr, Pfam, TrEMBL, and Swiss-Prot databases.  The TEGR database provides the Primer Design, Hmmsearch, Synteny, BLAST, and JBrowse tools for helping users perform comprehensive comparative genome analyses.  All the high-quality reference genome sequences, genomic features, gene annotations, and bioinformatics results can be downloaded from the TEGR database.  In the future, we will continue to improve the TEGR database with the latest data sets when they become available and to provide a useful resource that facilitates comparative genomic studies.


Key words: TEGR , Ericaceae ,  gene functional annotation ,  m6A ,  CRISPR ,  bioinformatic tools