中国农业科学 ›› 2023, Vol. 56 ›› Issue (23): 4565-4584.doi: 10.3864/j.issn.0578-1752.2023.23.002

• 专题:棉花纤维发育 • 上一篇    下一篇

亚洲棉短纤维发育相关长链非编码RNA的鉴定及表达

王晓阳1(), 彭振1,2, 邢爱双1, 赵盈睿1, 马欣丽1, 刘方1,2(), 杜雄明1,2(), 何守朴1,2()   

  1. 1 中国农业科学院棉花研究所/棉花生物育种与综合利用全国重点实验室,河南安阳 455000
    2 郑州大学农学院/棉花生物育种与综合利用全国重点实验室郑州基地,郑州 450001
  • 收稿日期:2022-12-30 接受日期:2023-03-16 出版日期:2023-12-04 发布日期:2023-12-04
  • 通信作者:
    何守朴,E-mail:
    何守朴,E-mail:
    杜雄明,E-mail:
    刘方,E-mail:
  • 联系方式: 王晓阳,E-mail:wangxiaoyang198806@126.com。
  • 基金资助:
    国家自然科学基金(32201875)

Identification and Expression Analysis of Fuzz Fiber Development Related Long Noncoding RNAs in Gossypium arboreum

WANG XiaoYang1(), PENG Zhen1,2, XING AiShuang1, ZHAO YingRui1, MA XinLi1, LIU Fang1,2(), DU XiongMing1,2(), HE ShouPu1,2()   

  1. 1 Institute of Cotton Research of Chinese Academy of Agricultural Sciences/National Key Laboratory of Cotton Biological Breeding and Comprehensive Utilization, Anyang 455000, Henan
    2 School of Agricultural Sciences, Zhengzhou University/Zhengzhou Research Base, National Key Laboratory of Cotton Biological Breeding and Comprehensive Utilization, Zhengzhou 450001
  • Received:2022-12-30 Accepted:2023-03-16 Published:2023-12-04 Online:2023-12-04

摘要:

【目的】长链非编码RNA(long non-coding RNAs,lncRNAs)是一类无蛋白质编码能力,但参与许多重要生命活动调控过程的长度大于200 nt的RNA。通过对亚洲棉无短纤维突变体(GA0149)和野生型(GA0146)纤维发育早期的转录组数据进行分析,挖掘调控短纤维发育的lncRNA,并明确其调控网络,为进一步解析棉花纤维发育机制奠定基础。【方法】选择GA0146和GA0149 2个材料在开花后当天(0 DPA)及花后3 d(3 DPA)、5 d(5 DPA)和8 d(8 DPA)的胚珠和纤维为材料进行转录组测序。鉴定lncRNA并预测其调控的靶基因;通过mRNA和lncRNA的差异表达分析,比较2个材料在不同纤维发育时期的差异。进一步利用KOBAS软件预测对差异lncRNA的靶基因进行富集分析并预测其参与的生物过程;最后通过实时荧光定量(RT-qPCR)技术对25个差异表达的lncRNA转录组数据进行验证。【结果】共鉴定获得15 339个lncRNA,其中11 595个lncRNA位于基因间区,包括2 428个反义lncRNA、350个内含子lncRNA及966个正义lncRNA。共有1 932个差异表达lncRNA(DE-lncRNA),它们所对应的8 134个靶基因中,有788个为差异表达基因(DE-mRNA)。KEGG代谢通路富集分析表明,DE-mRNA主要参与植物激素信号转导(plant hormone signal transduction)和内质网中蛋白质加工过程(protein processing in endoplasmic reticulum)。共表达调控网络分析显示,表达量差异比较显著的lncRNA(MSTRG.454250.3)和其所调控的靶基因表达趋势一致,仅在野生型(GA0146)短纤维发育早期胚珠中特异表达;而lncRNA(MSTRG.454261.4)与其调控的靶基因表达趋势相反,在突变体(GA0149)中的表达量显著高于野生型。RT-qPCR结果证实了转录组数据的真实性。【结论】鉴定了26个与亚洲棉短纤维发育相关的lncRNA,其通过调控植物激素信号转导途径相关的吲哚乙酸合成酶基因(Ga03G2421)和生长素响应蛋白基因(Ga05G1344)的表达而影响短纤维的发育。

关键词: 亚洲棉, 短纤维突变体, 长链非编码RNA, 调控网络, 荧光定量PCR

Abstract:

【Objective】Long non-coding RNAs(lncRNAs) are a group of RNA molecules longer than 200 bp with no protein coding capacity, which are involved in various biological regulatory processes. In this study, we aim to analyze the RNA-sequencing data of two Gossypium arboreum isogenic lines, a fuzzless mutant (GA0149) and its wildtype (GA0146), to identify the lncRNA involved in early fuzz fiber development, providing a foundation for investigation the mechanism of fiber development. 【Method】We collected 0 DPA, 3 DPA and 5 DPA ovule and 8 DPA ovule and fiber from the G. arboreum fuzzless mutant GA0149 and its isogenic line GA0146 with normal fuzz and lint fibers, were used for RNA-seq to identify lncRNA and predict their target genes. Differentially expressed mRNA (DE-mRNA) and lncRNA(DE-lncRNAs) between the samples were identified. The KOBAS software was used to predict the KEGG enrichment pathways which DE-lncRNAs targets were involved in. To ensure the quality of high-through sequencing, 25 DE-lncRNAs were selected for RT-qPCR detection. 【Result】We identified 15 339 lncRNA-encoding transcripts that 11 595 lncRNAs were located to intergenic regions, 2 428 lncRNAs were classified as antisense lncRNAs, 350 were categorized as intronic lncRNAs and 966 belonged to sense lncRNAs. Compared to mRNAs, lncRNAs in Asian cotton showed shorter exons and lower GC content. Most of lncRNAs had cis-regulatory effects on their neighboring mRNAs. We identified 1 932 differentially expressed (DE) lncRNAs, with 8 134 predicted DE-lncRNA target genes. Further analysis showed that 788 genes (mRNA) were differentially expressed (DE-genes) during four fiber development stages. KEGG enrichment pathways analysis showed that DE-target-mRNAs were mainly enriched in plant hormone signal transduction and protein processing in endoplasmic reticulum. Co-expression network analysis revealed that lncRNA (MSTRG.454250.3) and its associated target genes showed identical expression trends during four fuzz fiber development stages, while lncRNAs (MSTRG.454261.4) and its associated target genes showed contrary expression tendency, exhibiting dramatic higher expression in fuzzless GA0149 compared to wildtype GA0146. The results of RT-qPCR analysis confirmed the authenticity of our RNA-seq data.【Conclusion】A total of 26 specifically expressed lncRNAs were identified which related to cotton fuzz fiber development process. We further confirmed that these lncRNAs affected the fuzz fiber development by regulating the expression of indole-3-acetic acid-amido synthetase (Ga03G2421) and Auxin-responsive protein (Ga05G1344) in the plant hormone signal transduction pathway.

Key words: Gossypium arboreum, fuzzless mutant, long non-coding RNAs, regulation network, RT-qPCR