中国农业科学 ›› 2014, Vol. 47 ›› Issue (13): 2690-2698.doi: 10.3864/j.issn.0578-1752.2014.13.021

• 研究简报 • 上一篇    

牦牛CYGB基因CDS区克隆与生物信息学分析

 孙雪婧1, 3, 杜晓华1, 3, 杨孝朴1, 罗玉柱3, 刘霞2, 3   

  1. 1、甘肃农业大学动物医学院,兰州 730070;
    2、甘肃农业大学生命科学技术学院,兰州 730070;
    3、甘肃农业大学动物科学技术学院/甘肃省草食动物生物技术重点实验室,兰州 730070
  • 收稿日期:2013-11-12 出版日期:2014-07-01 发布日期:2014-04-03
  • 通讯作者: 刘霞,E-mail:liux@gsau.edu.cn
  • 作者简介:孙雪婧,E-mail:544879429@qq.com
  • 基金资助:

    甘肃省教育厅研究生导师科研项目计划(1102-04)、甘肃省高等学校基本科研业务费资助项目(2013)

Cloning and Bioinformatics Analysis on CDS of CYGB Gene in Yak

 SUN  Xue-Jing-1, 3 , DU  Xiao-Hua-1, 3 , YANG  Xiao-Pu-1, LUO  Yu-Zhu-3, LIU  Xia-2, 3   

  1. 1、College of Veterinary Medicine, Gansu Agricultural University, Lanzhou 730070;
    2、College of Life Science and Technology, Gansu Agricultural University, Lanzhou 730070;
    3、College of Animal Science and Technology, Gansu Agricultural University/Gansu Key Laboratory of Herbivorous Animal Biotechnology, Lanzhou 730070
  • Received:2013-11-12 Online:2014-07-01 Published:2014-04-03

摘要: 【目的】丰富牦牛CYGB基因研究的基础数据,对牦牛CYGB基因的CDS区进行克隆和生物信息学分析。【方法】提取牦牛大脑海马区组织的总RNA并运用RT-PCR技术反转录为cDNA,并根据GenBank中普通牛CYGB基因cDNA序列(GenBank登录号:DV874786.1),使用Primer3.0在线软件设计特异性引物,运用PCR扩增技术、TA克隆技术和核酸测序技术获得CYGB基因的完整CDS区序列及部分5′端和3′端UTR区,并使用ProtParam、PredictProtein、SWISS-MODEL等在线分析软件与Lasergene7.1软件包分析CYGB的一级结构、二级结构、三级结构与理化性质,并进行同源性分析及构建系统进化树;利用PyMol软件修饰并输出三维结构;使用在线亚细胞定位工具PSORT II Prediction预测蛋白质的亚细胞定位;使用Protfun软件对蛋白质的功能进行预测分析。【结果】克隆获得牦牛CYGB基因650 bp,包括CDS区573 bp(GenBank登录号:KF669898),碱基组成为A 20.59%、T 16.40%、G 33.33%、C 29.67%,编码190个氨基酸残基组成的蛋白质。与普通牛比对,牦牛CYGB基因在CDS区存在4个碱基突变,同源性为99.3%,这个突变未导致氨基酸序列的改变,4个突变均属同义突变。牦牛CYGB基因编码蛋白的分子式为C964H1513N263O278S7,分子量约为21.5 kD,理论等电点(pI)为6.32,消光系数为24075,不稳定系数为48.43,疏水指数为83.63,平均亲水性为-0.301,属不稳定可溶性酸性蛋白质,在哺乳动物网织红细胞内的半衰期为30 h。二级结构以α-螺旋和无规卷曲为主,其中α-螺旋占64.21%,无规卷曲占35.79%,属全α类蛋白质。三级结构是一个呈“three-over-three”三明治夹心型的α-螺旋折叠结构。亚细胞定位CYGB分布在细胞质(65.2%)、细胞核(17.4%)、线粒体(13.0%)、分泌系统的囊泡(4.3%)中,主要在细胞质,推测可能在能量代谢和辅因子的生物合成过程中发挥信号转导和转录因子调控的作用。牦牛CYGB氨基酸序列与普通牛、绵羊、家犬、小鼠、褐家鼠、原鸡、猴、黑猩猩、人的CYGB氨基酸序列的同源性分别为100%、98.9%、97.8%、95.3%、93.7%、78.8%、98.4%、95.8%和96.8%,物种之间同源性较高,系统进化情况与其亲缘关系远近一致,说明CYGB基因编码区在进化过程中比较保守。【结论】通过RT-PCR与TA克隆技术及核酸测序技术获得了牦牛CYGB基因全长573 bp的CDS区,并对其核苷酸序列和编码蛋白氨基酸序列及其蛋白结构和功能进行了分析,得知牦牛的CYGB是一个由190个氨基酸残基构成的可溶酸性蛋白质,在能量代谢和辅因子生物合成过程中发挥重要作用。CYGB基因编码区在长期生物进化过程中具有较强的保守性。该基因的成功克隆及分析为揭示牦牛CYGB基因的遗传特性提供了理论依据。

关键词: 牦牛 , 胞红蛋白 , 基因克隆 , 生物信息学

Abstract: 【Objective】In order to enrich basic data in yak CYGB gene, CDS region of yak CYGB gene was cloned and analyzed by bioinformatics method. 【Method】 Total RNA of yak hippocampus tissue was extracted and reverse transcribed into cDNA by RT-PCR technology. Specific primers were designed according to cDNA sequence of cattle CYGB gene in the GenBank (GenBank accession No.:DV874786.1) by online software Primer 3.0. The CDS region and part of 5′ UTR and 3′ UTR in yak CYGB gene were cloned from yak hippocampus total RNA by PCR amplification, TA cloning and nucleic acid sequencing technology. The primary structure, secondary structure, tertiary structure, physicochemical properties, homology were analyzed and phylogenetic tree of CYGB was constructed by online software like ProtParam, PredictProtein, SWISS-MODEL and Lasergene7.1 software package. The three-dimensional structure was modified and output by PyMol software. The protein subcellular localization was predicted by online subcellular localization tool PSORT II Prediction, and the protein function was predicted by Protfun software.【Result】The 650 bp length CYGB gene in yak was got by cloning, including the 573 bp length CDS region (GenBank accession No.:KF669898), and the bases composition were 20.59%A, 16.40%T, 33.33%G, 29.67%C, encoding 190 amino acids. Alignment with CDS and amino acid sequence of cattle CYGB gene, four base mutations were found and amino acid was not mutated, four mutations are synonymous mutations. The formula of protein encoded by CYGB gene in yak was C964H1513N263O278S7, and the molecular weight was 21.5 kD, the theory isoelectric point was 6.32, the extinction coefficient was 24075, the instability index was 48.43, the aliphatic index was 83.63, and the grand average of hydropathicity was -0.301. It was an unstable and soluble protein. Its estimated half-life is 30 hours in mammal reticulocyte. The secondary structure of CYGB was mainly α-helices and random coil, α-helices was 64.21% and random coil was 35.79%, CYGB was an α class protein. The tertiary structure of CYGB was a “three-over-three” α-helical sandwich structure. Subcellular localization of CYGB was in the cytoplasm (65.2%), nucleus (17.4%), mitochondria (13.0%) and vesicles of secretory system (4.3%), mainly in the cytoplasm, which CYGB may play a role of signal transducer or transcription regulation in the energy metabolism and cofactor biosynthesis. The amino acid homology of CYGB compared yak to cattle, sheep, dog, mouse, rat, chicken, monkey, chimpanzee and human were 100%,98.9%,97.8%,95.3%,93.7%,78.8%,98.4%,95.8% and 96.8%, respectively. There was a high homology among different species, and phyletic evolution was the same as their genetic relationship. The research indicated that the CYGB gene coding region was conservative in the course of evolution.【Conclusion】The 573 bp length CDS region of yak CYGB gene was got by RT-PCR, TA cloning and nucleic acid sequencing technology for the first time, and the nucleotide sequence, the encoded amino acid sequence and protein structure and function were analyzed. CYGB in yak was a soluble and acidic protein that 190 amino acid residues composed, which plays an important role in the energy metabolism and cofactor biosynthesis. CYGB gene coding region was conservative in the course of long-term biological evolution. The gene was cloned and analyzed, thus providing a theoretical basis for revealing the genetic characteristics of yak CYGB gene.

Key words: yak , cytoglobin , gene cloning , bioinformatics