Please wait a minute...
Journal of Integrative Agriculture  2026, Vol. 25 Issue (4): 0-    DOI: 10.1016/j.jia.2025.06.026
Advanced Online Publication | Current Issue | Archive | Adv Search |
Identification of CsAK as a critical caffeine-related upstream gene in tea accessions through genome-wide association study

Kaixin Rao1, Yuting OuYang1, Yanjun Chen1, Xiaojing Wang1,2, Ting Liu3, Qinfei Song1, Shaojuan Zhang1,2, Biao Xiong1, Suzhen Niu1,2#


1 College of Tea Science/Institute of Agro-Bioengineering, Guizhou University, Guiyang 550025, China

2 The Key Laboratory of Plant Resources Conservation and Germplasm Innovation in the Mountainous Region (Ministry of Education), Xueshi Road, Guiyang 550025, China

3 Zhengzhou Foreign Language School, Longhai west road, Zhengzhou 450006, China

 Highlights 

GWAS identified 19 SNPs and a key gene (CsAK) that are significantly associated with CAF content in tea accessions.

CsAK gene involved in CAF biosynthesis by catalyzing L-aspartate to generate the key intermediate-L-methionine.

Download:  PDF in ScienceDirect  
Export:  BibTeX | EndNote (RIS)      
摘要  

咖啡碱(CAF)作为茶叶主要风味成分,是茶饮料广受欢迎的核心元素之一。作为茶树重要的次级代谢产物,不同茶树种质资源间CAF含量呈现显著差异。然而,CAF生物合成的遗传调控机制尚未明晰。在这项研究中,基于贵州高原359份茶树种质资源开展全基因组关联研究(GWAS)以确定与CAF含量相关的遗传变异。共鉴定出19个显著关联的单核苷酸多态性(SNPs)及CAF生物合成关键基因(CsAK)。亚细胞定位分析表明CsAK-GFP融合蛋白位于细胞膜上。通过反义寡核苷酸(AsODN)靶向沉默茶树芽叶CsAK基因,结果显示AsODN处理的茶树中CsAK基因的表达水平显著下调,且CAF含量降低。在真核细胞中过表达CsAK基因可促进CAF生物合成过程中关键中间产物(L-蛋氨酸)的积累。该研究为后续培育具有高或低CAF水平的优良种质资源的茶树育种计划提供了理论依据。



Abstract  

The caffeine (CAF), a primary flavor component in tea, is one of the most popular reasons for tea beverage. As the important secondary metabolite in tea plant, the CAF content varied greatly among different tea accessions. However, the genetic mechanisms underlying the CAF biosynthesis was still unclear. In this study, we performed a genome-wide association study (GWAS) on 359 tea accessions in Guizhou plateau to identify genetic variation associated with CAF content. A total of 19 significant single nucleotide polymorphisms (SNPs) and key gene (CsAK) involved in CAF biosynthesis were identified. Subcellular localization revealed that the CsAK-GFP fusion protein was located on cell membrane. Antisense oligodeoxynucleotide (AsODN) targeting the CsAK gene to the buds and leaves revealed that the expression levels of the CsAK gene was significantly reduced, and the corresponding CAF contents were also decreased in AsODN-treated tea plants. Overexpression of CsAK gene in eukaryotic cell resulted in the accumulation of key intermediate product (L-methionine) during CAF biosynthesis process. These findings offered a theoretical foundation for future tea breeding programs aimed at cultivating of excellent germplasm with high or low levels of CAF.

Keywords:  tea accessions       caffeine (CAF)       single nucleotide polymorphism (SNP)       genome-wide association study (GWAS)       Guizhou Plateau  
Online: 30 June 2025  
Fund: 

This study was supported by China's National Science Foundation [32060700]; National Guidance Foundation for Local Science and Technology Development of China [ [2023] 009]; Guizhou Provincial Basic Research Program Youth Guidance Project, Qiankehe Basic - [2024] Youth 157; Project on Guiyang Science and Technology Plan [Construction Technology Contract [2023] 48-21]; Wangmo Eight Step Tea: Integration and Demonstration of Quality and Efficiency Enhancement Technologies [2021YFD1100307]; Guizhou Provincial Key Technology R&D Program (No. QKHZC [2023] General 481); Qianxinan Prefecture Science and Technology Plan Project [2022-1-52]; Youth Science and Technology Talent Project of Education Department of Guizhou Province (KY 2022-153); Basic Research Project of Guizhou University (No. 2023-38).

About author:  Kaixin Rao, E-mail: 1778424029@qq.com; #Correspondence Suzhen Niu, Tel: 18984167365, E-mail: niusuzhen@163.com

Cite this article: 

Kaixin Rao, Yuting OuYang, Yanjun Chen, Xiaojing Wang, Ting Liu, Qinfei Song, Shaojuan Zhang, Biao Xiong, Suzhen Niu. 2026. Identification of CsAK as a critical caffeine-related upstream gene in tea accessions through genome-wide association study. Journal of Integrative Agriculture, 25(4): 0-.

Ahammed G J, Li X. 2022. Hormonal regulation of health-promoting compounds in tea (Camellia sinensis L.). Plant Physiology and Biochemistry, 185, 390-400.

Bradbury P J, Zhang Z, Kroon D E, Casstevens T M, Ramdoss Y, Buckler ES. 2007. TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics, 23, 2633-2635.

Chen W, Hou L, Zhang Z Y, Pang X M, Li Y Y. 2017. Genetic diversity, population structure, and linkage disequilibrium of a core collection of ziziphus jujuba assessed with genome-wide SNPs developed by genotyping-by-sequencing and SSR markers. Frontiers in Plant Science, 8, 575.

Chen Y F, Jiang C J, Yin S X, Zhuang J H, Zhao Y, Zhang L J, Jiang X L, Liu Y J, Gao L P, Xia T. 2023. New insights into the function of plant tannase with promiscuous acyltransferase activity. The Plant Journal, 113, 576-594.

Chen Y J, Niu S Z, Deng X Y, Song Q F, He L M, Bai D C, He Y Q. 2023. Genome-wide association study of leaf-related traits in tea plant in Guizhou based on genotyping-by-sequencing. BMC Plant Biology, 23, 196.

Danecek P, Auton A, Abecasis G, Albers C A, Banks E, DePristo M A, Handsaker R E, Lunter G, Marth G T, Sherry S T, McVean G, Durbin R; 1000 Genomes Project Analysis Group. 2011. The variant call format and VCFtools. Bioinformatics, 27, 2156-8.

Deng X L, Huang D J, Wang Y H, An H W, Bai D C, Wang X J, Niu S Z, Song X M. 2025. Genome-wide association study of salicylic acid provides genetic insights for tea plant selective breeding. Horticulture Research12, 362.

Devitt G, Thomas M, Klibanov A M, Pfeiffer T, Bosch V. 2007. Optimized protocol for the large scale production of HIV pseudovirions by transient transfection of HEK293T cells with linear fully deacylated polyethylenimine. Journal of Virological Methods, 146, 298-304.

Fang K X, Xia Z Q, Li H J, Jiang X H, Qin D D, Wang Q H, Wang Q, Pan C D, Li B, Wu H L. 2021. Genome-wide association analysis identified molecular markers associated with important tea flavor-related metabolites. Horticulture Research, 8, 42.

Gao Y N, Han C J, Liu C L, Wang J, Zhao L, Fang L, Min W H. 2019. Enzymatic characterization and molecular mechanism of a novel aspartokinase mutant M372I/T379W from Corynebacterium pekinense. Royal Society of Chemistry Advances, 9, 21344-21354.

Gong A D, Lian S B, Wu N N, Zhou Y J, Zhao S Q, Zhang L M, Cheng L, Yuan H Y. 2020. Integrated transcriptomics and metabolomics analysis of catechins, caffeine and theanine biosynthesis in tea plant (Camellia sinensis) over the course of seasons. BMC Plant Biology, 20, 294.

Grover A, Sharma P C. 2016. Development and use of molecular markers: past and present. Critical Reviews in Biotechnology, 36, 290-302.

He L M, Luo J, Niu S Z, Bai D C, Chen Y J. 2023. Population structure analysis to explore genetic diversity and geographical distribution characteristics of wild tea plant in Guizhou Plateau. BMC Plant Biology, 23, 255.

He J Y, Chen Y H, Hao Y R, Lai D L, Jahan T, Shi Y L, Lin H, He Y Q, Huda M, Cheng J P, Zhang K X, Li J B, Ruan J J, Zhou M L. 2024. Combining GWAS and RNA-Seq approaches identifies the FtADH1 gene for drought resistance in Tartary buckwheat. Journal of Integrative Agriculture, doi: 10.1016/j.jia.2024.11.009.

Hacham Y, Matityahu I, Schuster G, Amir R. 2008. Overexpression of mutated forms of aspartate kinase and cystathionine gamma-synthase in tobacco leaves resulted in the high accumulation of methionine and threonine. The Plant Journal, 54, 260-71.

Helyar S J, Hemmer-Hansen J, Bekkevold D, Taylor M I, Ogden R, Limborg M T, Cariani A, Maes G E, Diopere E, Carvalho G R, Nielsen E E. 2011. Application of SNPs for population genetics of nonmodel organisms: new opportunities and challenges. Molecular Ecology Resources, 1, 123-36.

Jander G, Joshi V. 2010. Recent progress in deciphering the biosynthesis of aspartate-derived amino acids in plants. Molecular Plant, 3, 54-65.

Jiang P, Ren L J, Zhi L, Yu Z, Lv F X, Xu F L, Peng W, Bai X Y, Cheng K L, Quan L, Zhang X Q, Wang X H, Zhang Y, Yang D, Hu X L, Xiao R P. 2021. Negative regulation of AMPK signaling by high glucose via E3 ubiquitin ligase MG53. Molecular Cell, 81, 629-637.

Jin J Q, Chai Y F, Liu Y F, Zhang J, Yao M Z, Chen L. 2018. Hongyacha, a Naturally Caffeine-Free Tea Plant from Fujian, China. Journal of Agricultural and Food Chemistry, 66, 11311-11319.

Jin J Q, Yao M Z, Ma C L, Ma J Q, Chen L. 2016. Association mapping of caffeine content with TCS1 in tea plant and its related species. Plant Physiology and Biochemistry, 105, 251-259.

Jin Q F, Wang Z, Sandhu D, Chen L, Shao C Y, Shang F H Z, Xie S Y, Huang F Y, Chen Z Y, Zhang X Q, Hu J Y, Liu G Z, Su Q, Huang M D, Liu Z H, Huang J A, Tian N, Liu S Q. 2023. mRNA-miRNA analyses reveal the involvement of CsbHLH1 and miR1446a in the regulation of caffeine biosynthesis in Camellia sinensis. Horticulture Research, 11, 282.

Kim H, Kang S H, Kim S H, Kim S H, Hwang J, Kim J G, Han K, Kim J B. 2021. Drinking coffee enhances neurocognitive function by reorganizing brain functional connectivity. Scientific Reports, 11, 14381.

Koshiishi C, Kato A, Yama S, Crozier A, Ashihara H. 2001. A new caffeine biosynthetic pathway in tea leaves: utilisation of adenosine released from the S-adenosyl-L-methionine cycle. FEBS Letters, 499, 50-54.

Kotaka M, Ren J S, Lockyer M, Hawkins A R, Stammers D K. 2006. Structures of R- and T-state Escherichia coli Aspartokinase III. MECHANISMS OF THE ALLOSTERIC TRANSITION AND INHIBITION BY LYSINE. Journal Of Biological Chemistry, 281, 31544-31552.

Li F, Deng X Y, Huang Z, Zhao Z F, Li C Y, Song Q F, He Y Q, Niu S N. 2023. Integrated transcriptome and metabolome provide insights into flavonoid biosynthesis in “P113”, a new purple tea of Camellia tachangensis. Beverage Plant Research, 3, 3.

Li G F, Tan M, Ma J J, Cheng F, Li K, Liu X J, Zhao C P, Zhang D, Xing L B, Ren X L, Han M Y, An N. 2021. Molecular mechanism of MdWUS2-MdTCP12 interaction in mediating cytokinin signaling to control axillary bud outgrowth. Journal of Experimental Botany, 72, 4822-4838.

Lu Q, Zhang M C, Niu X J, Wang C H, Xu Q, Feng Y, Wang S, Yuan X P, Yu H Y, Wang Y P, Wei X H. 2016. Uncovering novel loci for mesocotyl elongation and shoot length in indica rice through genome-wide association mapping. Planta, 243, 645-657.

McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo M A. 2010. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research, 20, 1297-1303.

Mi X Z, Yang C, Qiao D H, Tang M S, Guo Y, Liang S H, Li Y, Chen Z W, Chen J. 2023. De novo full length transcriptome analysis of a naturally caffeine-free tea plant reveals specificity in secondary metabolic regulation. Scientific Reports, 13, 6015.

Misako K, Kouichi M. 2004. Caffeine synthase and related methyltransferases in plants. Frontiers in Bioscience-Landmark, 9, 1833-42.

Mohanpuria P, Kumar V, Yadav S K. 2010. Tea caffeine: Metabolism, functions, and reduction strategies. Food Science and Biotechnology, 19, 275-287.

Niu S N, Koiwa H, Song Q F, Qiao D H, Chen J, Zhao D G, Chen Z W, Wang Y, Zhang T Y. 2020. Development of core-collections for Guizhou tea genetic resources and GWAS of leaf size using SNP developed by genotyping-by-sequencing. PeerJ, 8, 8572.

Niu S N, Song Q F, Koiwa H, Qiao D H, Zhao D G, Chen Z W, Liu X, Wen X P. 2019. Genetic diversity, linkage disequilibrium, and population structure analysis of the tea plant (Camellia sinensis) from an origin center, Guizhou plateau, using genome-wide SNPs developed by genotyping-by-sequencing. BMC Plant Biology, 19, 328.

Qiao D H, Yang C, Chen J, Guo Y, Li Y, Niu S Z, Cao K M, Chen Z W. 2019. Comprehensive identification of the full-length transcripts and alternative splicing related to the secondary metabolism pathways in the tea plant (Camellia sinensis). Scientific Reports, 9, 2709.

Pandey J, Scheuring D C, Koym J W, Coombs J, Novy R G, Thompson A L, Holm D G, Douches DS, Miller J C Jr, Vales M I. 2021. Genetic diversity and population structure of advanced clones selected over forty years by a potato breeding program in the USA. Scientific Reports, 11, 8344.

Ravanel S, Gakière B, Job D, Douce R. 1998. The specific features of methionine biosynthesis and metabolism in plants. Proceedings of the National Academy of Sciences of the United States of America, 95, 7805-7812.

Reddy V S, Shiva S, Manikantan S, Ramakrishna S. 2024. Pharmacology of caffeine and its effects on the human body. European Journal of Medicinal Chemistry Reports, 10, 100138.

Shi R, Cao Y, Yang T H, Wang Y P, Xiang Y N, Chen F, Zhang W, Zhou X Y, Sun C M, Fu S X, Hu M L, Zhang J F, Wang X D. 2024. Genome-Wide Association Study Reveals the Genetic Basis of Crude Fiber Components in Brassica napus L. Shoots at Stem Elongation Stage. Journal of Agricultural and Food Chemistry, 72, 16530-16540.

Srdjenovic B, Djordjevic-Milic V, Grujic N, Injac R, Lepojevic Z. 2008. Simultaneous HPLC determination of caffeine, theobromine, and theophylline in food, drinks, and herbal products. Journal of Chromatographic Science, 46, 144-149.

Stecher G, Tamura K, Kumar S, 2020. Molecular evolutionary genetics analysis (MEGA) for macOS. Molecular Biology and Evolution, 37, 1237-1239.

Suzuki T. 1972. The participation of S-adenosylmethionine in the biosynthesis of caffeine in the tea plant. FEBS Letters, 24, 18-20.

Tang W J, Wu T T, Ye J, Sun J, Jiang Y, Yu J, Tang J P, Chen G M, Wang C M, Wan J M. 2016. SNP-based analysis of genetic diversity reveals important alleles associated with seed size in rice. BMC Plant Biology, 16, 93.

Teng J, Yan C Y, Zeng W, Zhang Y Q, Zeng Z, Huang Y H. 2020. Purification and characterization of theobromine synthase in a Theobromine-Enriched wild tea plant (Camellia gymnogyna Chang) from Dayao Mountain, China. Food Chemistry, 311, 125875.

Uffelmann E, Huang Q Q, Munung N S, Vries J D, Okada Y, Martin A R, Martin H C, Lappalainen T, Posthuma D. 2021. Genome-wide association studies. Nature Reviews Methods Primers, 1, 1-21.

Van Dam R M, Hu F B. 2022. Caffeine consumption and cardiovascular health. Nature Reviews Cardiology, 19, 429-430.

Wang J D, Zhang Z L, Gong Z L, Liang Y J, Ai X T, Sang Z W, Guo J P, Li X Y, Zheng J Y. 2022. Analysis of the genetic structure and diversity of upland cotton groups in different planting areas based on SNP markers. Gene, 809, 146042.

Wang R J, Gao X F, Yang J, Kong X R. 2019. Genome-Wide Association Study to Identify Favorable SNP Allelic Variations and Candidate Genes That Control the Timing of Spring Bud Flush of Tea (Camellia sinensis) Using SLAF-seq. Journal of Agricultural and Food Chemistry, 67, 10380-10391.

Wang W Y, Xu M Y, Wang G P, Galili G. 2018. New insights into the metabolism of aspartate-family amino acids in plant seeds. Plant Reproduction, 31, 203-211.

Wang Z H, Huang R, Moon D G, Ercisli S, Chen L. 2023. Achievements and prospects of QTL mapping and beneficial genes and alleles mining for important quality and agronomic traits in tea plant (Camellia sinensis). Beverage Plant Research, 3, 22.

Xia E H, Tong W, Hou Y, An Y L, Chen L B, Wu Q, Liu Y L, Yu J, Li F D, Li R P, Li P H, Zhao H J, Ge R H, Huang J, Mallano A L, Zhang Y R, Liu S R, Deng W W, Song C K, Zhang Z L, Zhao J, Wei S, Zhang Z Z, Xia T, Wei C L, Wan X C. 2020. The Reference Genome of Tea Plant and Resequencing of 81 Diverse Accessions Provide Insights into Its Genome Evolution and Adaptation. Molecular Plant, 13, 1013-1026.

Xia E H, Tong W, Wu Q, Wei S, Zhao J, Zhang Z Z, Wei CL, Wan X C.  2020. Tea plant genomics: achievements, challenges and perspectives. Horticulture Research, 7, 7.

Xia E H, Zhang H B, Sheng J, Li K, Zhang Q J, Kim C, Zhang Y, Liu Y, Zhu T, Li W, Huang H, Tong Y, Nan H, Shi C, Shi C, Jiang J J, Mao SY, Jiao J Y, Zhang D, Zhao Y, Zhao Y J, Zhang L P, Liu Y L, Liu B Y, Yu Y, Shao S F, Ni D J, Eichler E E, Gao L Z. 2017. The Tea Tree Genome Provides Insights into Tea Flavor and Independent Evolution of Caffeine Biosynthesis. Molecular Plant, 10, 866-877.

Xu J Z, Han M, Ren X D, Zhang W G. 2016. Modification of aspartokinase III and dihydrodipicolinate synthetase increases the production of L-lysine in Escherichia coli. Biochemical Engineering Journal, 114, 79-86.

Yao X Z, Chen H F, Ai A T, Wang F, Lian S S, Tang H, Jiang Y H, Jiao Y J, He Y M, Li T, Lu L T. 2023. The transcription factor CsS40 negatively regulates TCS1 expression and caffeine biosynthesis in connection to leaf senescence in Camellia sinensis. Horticulture Research, 10, 162.

Yoshida H, Okada S, Wang F M, Shiota S, Mori M, Kawamura M, Zhao X, Wang Y Q, Nishigaki N, Kobayashi A, Miura K, Yoshida S, Ikegami M, Ito A, Huang L T, Caroline Hsing Y I, Yamagata Y, Morinaka Y, Yamasaki M, Kotake T, Yamamoto E, Sun J, Hirano K, Matsuoka M. 2023. Integrated genome-wide differentiation and association analyses identify causal genes underlying breeding-selected grain quality traits in japonica rice. Molecular Plant, 16, 1460-1477.

Yu S W, Li P H, Zhao X C, Tan M M, Ahmad M Z, Xu Y J, Tadege M, Zhao J. 2021. CsTCPs regulate shoot tip development and catechin biosynthesis in tea plant (Camellia sinensis). Horticulture Research, 8, 104.

Zhang C, Dong S S, Xu J Y, He W M, Yang T L. 2019. PopLDdecay: a fast and effective tool for linkage disequilibrium decay analysis based on variant call format files. Bioinformatics, 35, 1786-1788.

Zhang W Y, Zhang Y J, Qiu H J, Guo Y F, Wan H L, Zhang X L, Scossa F, Alseekh S, Zhang Q H, Wang P, Xu L, Schmidt M H-W, Jia X X, Li D L, Zhu A T, Guo F, Chen W, Ni D J, Usadel B, Fernie A R, Wen W W. 2020. Genome assembly of wild tea tree DASZ reveals pedigree and selection history of tea varieties. Nature Communications, 11, 3719.

Zhang S R, Jin J Q, Chen J D, Ercisli S, Chen L. 2022. Purine alkaloids in tea plants: component, biosynthetic mechanism and genetic variation. Beverage Plant Research, 2, 13.

Zhang Y R, Fu J M, Zhou Q Y, Li F D, Shen Y H, Ye Z L, Tang D K, Chi N, Li L Q, Ma S Y, Inayat M A, Guo T Y, Zhao J, Li P H. 2022. Metabolite Profiling and Transcriptome Analysis Revealed the Conserved Transcriptional Regulation Mechanism of Caffeine Biosynthesis in Tea and Coffee Plants. Journal of Agricultural and Food Chemistry, 70, 3239-3251.

Zheng Y X, Yuan F, Huang Y Q, Zhao Y F, Jia X Y, Zhu L Y, Guo J J. 2021. Genome-wide association studies of grain quality traits in maize. Scientific Reports, 11, 9797.

Zhao Z F, Song Q F, Bai D C, Niu S Z, He Y Q, Qiao D H, Chen Z W, Li C Y, Luo J, Li F. 2022. Population structure analysis to explore genetic diversity and geographical distribution characteristics of cultivated-type tea plant in Guizhou Plateau. BMC Plant Biology, 22, 55.

Zhao M Y, Jin J Y, Wang J M, Gao T, Luo Y, Jing T T, Hu Y T, Pan Y T, Lu M Q, Schwab W, Song C K. 2022. Eugenol functions as a signal mediating cold and drought tolerance via UGT71A59-mediated glucosylation in tea plants. The Plant Journal, 109, 1489-1506.

Zhou M Z, Yan C Y, Zeng Z, Luo L, Zeng W, Huang Y H. 2020. N-Methyltransferases of Caffeine Biosynthetic Pathway in Plants. Journal of Agricultural and Food Chemistry, 68, 15359-15372.

Zou J N, Yu H, Yu Q, Jin X J, Cao L, Wang M Y, Wang M X, Ren C Y, Zhang Y X. 2021. Physiological and UPLC-MS/MS widely targeted metabolites mechanisms of alleviation of drought stress-induced soybean growth inhibition by melatonin. Industrial Crops and Products, 163, 0926-6690.

[1] ZHAO Bing-ru, FU Xue-feng, TIAN Ke-chuan, HUANG Xi-xia, DI Jiang, BAI Yan, XU Xin-ming, TIAN Yue-zhen, WU Wei-wei, ABLAT Sulayman, ZENG Wei-dan, HANIKEZI Tulafu. Identification of SNPs and expression patterns of FZD3 gene and its effect on wool traits in Chinese Merino sheep (Xinjiang Type)[J]. >Journal of Integrative Agriculture, 2019, 18(10): 2351-2360.
[2] LIU Xin, WANG Li-gang, LIANG Jing, YAN Hua, ZHAO Ke-bin, LI Na, ZHANG Long-chao, WANGLi-xian . Genome-Wide Association Study for Certain Carcass Traits and Organ Weights in a Large White×Minzhu Intercross Porcine Population[J]. >Journal of Integrative Agriculture, 2014, 13(12): 2721-2730.
No Suggested Reading articles found!