中国农业科学

• • 上一篇    

最新录用:亚洲棉短纤维发育相关长链非编码RNA的鉴定及表达

王晓阳1,彭振1,2,邢爱双1,赵盈睿1,马欣丽1,刘方1,2*,杜雄明1,2*,何守朴1,2*
  

  1. 1中国农业科学院棉花研究所/棉花生物学国家重点实验室河南安阳 4550002郑州大学农学院/棉花生物学国家重点实验室郑州基地,郑州 450001
  • 出版日期:2023-05-20 发布日期:2023-05-20

Identification and Expression Analysis of Fuzz Fiber Development Related Long Noncoding RNAs in Gossypium arboreum

WANG XiaoYang1, PENG Zhen1,2, XING AiShuang1, ZHAO YingRui1, MA XinLi1, LIU Fang1,2*, DU XiongMing1,2*, HE ShouPu1,2*   

  1. 1Institute of Cotton Research, Chinese Academy of Agricultural Sciences/State Key Laboratory of Cotton Biology, Anyang 455000, Henan; 2School of Agricultural Sciences, Zhengzhou University/Zhengzhou Research Base, State Key Laboratory of Cotton Biology, Zhengzhou 450001
  • Published:2023-05-20 Online:2023-05-20

摘要: 【目的】长链非编码RNA(Long non-coding RNAlncRNA)是一类蛋白质编码能力,但参与许多重要生命活动调控过程长度大于200 ntRNA通过对亚洲棉无短绒突变体GA0149)和野生型(GA0146)短绒发育早期的转录组数据进行分析,挖掘调控短绒发育lncRNA,并明确其调控网络,为进一步解析棉花纤维发育机制奠定基础。【方法】选择GA0146GA0149 2个材料在开花后当天(0 DPA及花后3 d3 DPA)d5 DPA)d8 DPA)的胚珠和纤维为材料进行转录组测序。鉴定lncRNA并预测其调控的靶基因通过mRNAlncRNA的差异表达分析比较2材料在不同纤维发育时期的差异。进一步利用KOBAS软件预测对差异lncRNA靶基因进行富集分析预测其参与的生物过程最后通过实时荧光定量(RT-qPCR)技术25差异表达的lncRNAs转录组数据进行验证【结果】共鉴定获得15 339lncRNAs,其中11 595lncRNAs位于基因间区,包括2 428反义lncRNAs350内含子lncRNAs以及966正义lncRNAs共有1 932差异表达lncRNAsDE-lncRNAs),它们所对应的8 134靶基因中有788个为差异表达基因(DE-mRNA)。KEGG代谢通路富集表明,DE-mRNAs主要参与植物激素信号转导(plant hormone signal transduction)和内质网中蛋白质加工过程(protein processing in endoplasmic reticulum)。共表达调控网络分析显示,表达量差异比较显著的lncRNAMSTRG.454250.3其所调控的靶基因表达趋势一致,仅在野生型GA01464短绒发育早期胚珠中特异表达;而lncRNA(MSTRG.454261.4)与其调控的靶基因表达趋势相反在突变体GA0149)中的表达量显著高于野生型。RT-qPCR结果证实了转录组数据的真实性【结论】鉴定了26与短绒发育相关的lncRNA,其通过调控植物激素信号转导途径相关的吲哚乙酸合成酶基因Ga03G2421)和生长素响应蛋白基因Ga05G1344)的表达而影响短纤维的发育


关键词: 亚洲棉, 短绒突变体, 长链非编码RNA, 调控网络, 荧光定量PCR

Abstract: 【ObjectiveLong non-coding RNAs(lncRNAs) are a group of RNA molecules longer than 200 bp with no protein coding capacity, which are involved in various biological regulatory processes. In this study, we aim to analyze the RNA-sequencing data of two Gossypium arboreum isogenic lines, a fuzzless mutant and its wildtype, to identifying the lncRNA involved in early fuzz fiber development, providing a foundation for investigation the mechanism of fiber development. MethodWe collected 0 DPA, 3 DPA and 5 DPA ovule and 8 DPA ovule and fiber from the G. arboreum fuzzless mutant GA0149 and its isogenic line GA0146 with normal fuzz and lint fibers, were used for RNA-seq to identify lncRNA and predict their target genes. Differentially expressed mRNADE-mRNA and lncRNA(DE-lncRNAs) between the samples were identified. The KOBAS software was used to prediction the KEGG enrichment pathways which DE-lncRNAs targets were involved in. To ensure the quality of high-through sequencing, 25 DE- lncRNAs were selected for RT-qPCR detection. ResultWe identified 15339 lncRNA-encoding transcripts that 11595 lncRNAs were located to intergenic regions, 2428 lncRNAs were classified as antisense lncRNAs, 350 were categorized as intronic lncRNAs and 966 were belonged to sense lncRNAs. Compared to mRNAs, lncRNAs in Asian cotton showed shorter exons and lower GC content. Most of lncRNAs had cis-regulatory effects on their neighboring mRNAs. We identified 1932 differentially expressed (DE) lncRNAs, with 8134 predicted DE-lncRNA target genes. Further analysis showed that 788 genes (mRNA) were differentially expressed (DE-genes) during four fiber development stages. KEGG enrichment pathways analysis showed that DE-target-mRNAs were mainly enriched in Plant hormone signal transduction and Protein processing in endoplasmic reticulum. Co-expression network analysis revealed that lncRNA (MSTRG.454250.3) and its associated target genes showed identical expression trends during four fuzz fiber development stages, while lncRNAs (MSTRG.454261.4) and its associated target genes showed contrary expression tendency, exhibiting dramatic higher expression in fuzzless GA0149 compared to wildtype GA0146. The results of RT-qPCR analysis confirmed the authenticity of our RNA-seq data.【ConclusionA total of 26 specifically expressed lncRNAs were identified which related to cotton fuzz fiber development process. We further confirmed that these lncRNAs affected the fuzz fiber development by regulating the expression of indole-3-acetic acid-amido synthetase (Ga03G2421) and Auxin-responsive protein (Ga05G1344) in the plant hormone signal transduction pathway.


Key words: Gossypium arboreum, fuzzless mutant, long non-coding RNAs, regulation network, RT-qPCR