Scientia Agricultura Sinica ›› 2018, Vol. 51 ›› Issue (6): 1013-1019.doi: 10.3864/j.issn.0578-1752.2018.06.001

• CROP GENETICS & BREEDING·GERMPLASM RESOURCES·MOLECULAR GENETICS •     Next Articles

Utilizing Probability Estimation Improves the Accuracy of Plant Variety Identification by Molecular Markers

ZHOU Junfei1, CUI Yehan2, TANG Hao2, LI Lun1, CHEN Hong2, WEN wen2, HAN Ruixi2, HUANG Sisi2, FANG Zhiwei1, PENG Hai1   

  1. 1Institute for Systems Biology, Jianghan University, Wuhan 430056; 2Development Center of Science and Technology of Ministry of Agriculture, Beijing 100122
  • Received:2017-10-29 Online:2018-03-16 Published:2018-03-16

Abstract: 【Objective】 The current standards for plant variety identification only examine a small number of markers on the genome, which may lead to sampling errors, therefore identification conclusions are often questionable. The goal of this study is to estimate the sampling errors in plant variety identification procedure and evaluate the reliability of the conclusions, and eventually provide the scientific basis for the applications of molecular identification standards. 【Method】 Based on the number of observed differential loci between two varieties, a conditional probability model was established based on the Bayes’ theorem to estimate the true number of the different loci. Given that the observed number of differential loci between two varieties follows the binomial distribution, and the prior distribution of true number is an approximate uniform distribution, the conditional probability model was finally computed. Based on the confidence levels provided by the probabilistic model, the relationship between plant varieties is divided into the red, green or yellow zones, corresponding to the same or similar, different and undetermined varieties, respectively. To validate this probabilistic model, the genotyping data of 3 205 SSR molecular markers for each of the 8 rice varieties were used. For each pair of varieties, 10 000 sets of molecular markers were simulated, and each set is composed of 48 random SSR markers. For each simulation, the relationship between the varieties was estimated based on the probability computed by the model. And then the estimated relationship was compared with the real one to evaluate the accuracy of the probabilistic model. Finally, the probabilistic model was applied to provide probabilistic support for the conclusion of a recent watermelon variety infringement case. 【Result】The validation results showed that each pair of varieties was determined as different varieties in 4 295-10 000 simulations at a confidence level of 95%. Compared with the true relationship between varieties, the probabilistic model had an accuracy of 100% in determination of rice variety relationships. Finally, the court decision about the infringement dispute of watermelon varieties also was supported by the probabilistic model at a confidence level of 95%, indicating that the losing party's doubt on the limited number of the sampling loci is not sufficient. 【Conclusion】In this study, a probabilistic model was built to evaluation the reliability of the conclusion of the variety relationships, which provides confidence levels for the molecular identification conclusion of the relationship among varieties, and thus improves the accuracy, and finally avoids the controversies caused by the insufficient number of testing markers.

Key words: variety identification, molecular marker, sampling error

[1]    BUSCHIAZZO E, GEMMELL N J. The rise, fall and renaissance of microsatellites in eukaryotic genomes. Bioessays, 2006, 28(10): 1040-1050.
[2]    MOORE S, SARGEANT L, KING T, MATTICK J, GEORGES M, HETZEL D. The conservation of dinucleotide microsatellites among mammalian genomes allows the use of heterologous PCR primer pairs in closely related species. Genomics, 1991, 10(3): 654-660.
[3]    MOODLEY Y, BAUMGARTEN I, HARLEY E. Horse microsatellites and their amenability to comparative equid genetics. Animal genetics, 2006, 37(3): 258-261.
[4]    DAWSON D A, HORSBURGH G J, KÜPPER C, STEWART I R, BALL A D, DURRANT K L, HANSSON B, BACON I, BIRD S, KLEIN A. New methods to identify conserved microsatellite loci and develop primer sets of high cross species utility–as demonstrated for birds. Molecular Ecology Resources, 2010, 10(3): 475-494.
[5]    MOODLEY Y, MASELLO J F, COLE T L, CALDERON L, MUNIMANDA G K, THALI M R, ALDERMAN R, CUTHBERT R J, MARIN M, MASSARO M, NAVARRO J, PHILLIPS R A, RYAN P G, SUAZO C G, CHEREL Y, WEIMERSKIRCH H, QUILLFELDT P. Evolutionary factors affecting the cross-species utility of newly developed microsatellite markers in seabirds. Molecular Ecology Resources, 2015, 15(5): 1046-1058.
[6]    周青利, 王蕊, 张春宵, 周海涛, 易红梅, 王凤华, 李晓辉, 田红丽, 葛建镕, 席章营, 王凤格. 玉米SSR-DNA指纹库构建方案在高粱中的通用性. 玉米科学, 2017 http://kns.cnki.net/kcms/detail/46. 1068.S.20170406.1210.020.html.
ZHOU Q L, WANG R, ZHANG C X, ZHOU H T, YI H M, WANG F H, LI X H, TIAN H L, GE J R, XI Z Y, WANG F G. A study on universal application of maize SSR-DNA fingerprint database in Sorghum. Journal of Maize Science, 2017: http://kns.cnki.net/kcms/ detail/46.1068.S.20170406.1210.020.html. (in Chinese)
[7]    魏兴华, 韩斌, 徐群, 黄学辉, 张新明, 龚浩, 冯跃, 堵苑苑, 余汉勇. NY/T 2745-2015,水稻品种鉴定 SNP标记法. 中华人民共和国农业部, 2015.
WEI X H, HAN B, XU Q, HUANG X H, ZHANG X M, GONG H, FENG Y, DU Y Y, YU, H Y. NY/T 2745-2015, Protocol for identification of rice varieties--SNP marker method. The Ministry of Agriculture of the People's Republic of China, 2015. (in Chinese)
[8]    徐群, 魏兴华, 庄杰云, 吕波, 袁筱平, 刘平, 张新明, 余汉勇, 堵苑苑. NY/T 1433-2014,水稻品种鉴定 SSR标记法. 中华人民共和国农业部, 2014.
XU Q, WEI X H, ZHUANG J Y, LU B, YUAN Y P, LIU P, ZHANG X M, YU, H Y, DU, Y Y. NY/T 1433-2014, Protocol for identification of rice varieties--SSR marker method. The Ministry of Agriculture of the People's Republic of China, 2014. (in Chinese)
[9]    LI L, FANG Z, ZHOU J, CHEN H, HU Z, GAO L, CHEN L, REN S, MA H, LU L, ZHANG W, PENG H. An accurate and efficient method for large-scale SSR genotyping and applications. Nucleic acids research, 2017, 45(10): e88.
[10]   马艳明, 许勇, 张海英, 张勋基, 陈果, 郭绍贵, 宫国义, 刘志勇, 足木热木, 肖菁, 颜国荣. NY/T 2472-2013, 西瓜品种鉴定技术规程 SSR分子标记法, 中华人民共和国农业部, 2013.
MA Y M, XU Y, ZHANG H Y, ZHANG X J, CHEN G, GUO S G, GONG G Y, LIU Z Y, ZHU M R M, XIAO J, YAN G Y. NY/T 2472-2013, Identification of watermelon varieties--SSR marker method. The Ministry of Agriculture of the People's Republic of China, 2013. (in Chinese)
[11]   UNTERSEER S, BAUER E, HABERER G, SEIDEL M, KNAAK C, OUZUNOVA M, MEITINGER T, STROM T M, FRIES R, PAUSCH H. A powerful tool for genome analysis in maize: development and evaluation of the high density 600K SNP genotyping array. BMC Genomics, 2014, 15: 823.
[12]   XU C, REN Y, JIAN Y, GUO Z, ZHANG Y, XIE C, FU J, WANG H, WANG G, XU Y. Development of a maize 55K SNP array with improved genome coverage for molecular breeding. Molecular Breeding, 2017, 37(3): 20.
[13]   ELLEGREN H. Microsatellites: simple sequences with complex evolution. Nature Review Genetics, 2004, 5(6): 435-445.
[14]   WEBSTER M T, HAGBERG J. Is there evidence for convergent evolution around human microsatellites? Molecular biology and evolution, 2007, 24(5): 1097-1100.
[15]   BRANDSTR M M, BAGSHAW A T, GEMMELL N J, ELLEGREN H. The relationship between microsatellite polymorphism and recombination hot spots in the human genome. Molecular biology and evolution, 2008, 25(12): 2579-2587.
[16]   KELKAR Y D, TYEKUCHEVA S, CHIAROMONTE F, MAKOVA K D. The genome-wide determinants of human and chimpanzee microsatellite evolution. Genome Research, 2008, 18(1): 30-38.
[17]   FUNGTAMMASAN A, ANANDA G, HILE S E, SU M S, SUN C, HARRIS R, MEDVEDEV P, ECKERT K, MAKOVA K D. Accurate typing of short tandem repeats from genome-wide sequencing data and its applications. Genome Research, 2015, 25(5): 736-749.
[18]   ABDULOVIC A L, HILE S E, KUNKEL T A, ECKERT K A. The in vitro fidelity of yeast DNA polymerase δ and polymerase ? holoenzymes during dinucleotide microsatellite DNA synthesis. DNA repair, 2011, 10(5): 497-505.
[19]   BAPTISTE B A, ECKERT K A. DNA polymerase kappa microsatellite synthesis: Two distinct mechanisms of slippage- mediated errors. Environmental and Molecular Mutagenesis, 2012, 53(9): 787-796.
[20]   YANG S, WANG L, HUANG J, ZHANG X, YUAN Y, CHEN J-Q, HURST L D, TIAN D. Parent-progeny sequencing indicates higher mutation rates in heterozygotes. Nature, 2015, 523(7561): 463-467.
[1] WANG MengRui, LIU ShuMei, HOU LiXia, WANG ShiHui, LÜ HongJun, SU XiaoMei. Development of Artificial Inoculation Methodology for Evaluation of Resistance to Fusarium Crown and Root Rot and Screening of Resistance Sources in Tomato [J]. Scientia Agricultura Sinica, 2022, 55(4): 707-718.
[2] ZHAO ChunFang,ZHAO QingYong,LÜ YuanDa,CHEN Tao,YAO Shu,ZHAO Ling,ZHOU LiHui,LIANG WenHua,ZHU Zhen,WANG CaiLin,ZHANG YaDong. Screening of Core Markers and Construction of DNA Fingerprints of Semi-Waxy Japonica Rice Varieties [J]. Scientia Agricultura Sinica, 2022, 55(23): 4567-4582.
[3] LinHan ZOU,XinYing ZHOU,ZeYuan ZHANG,Rui YU,Meng YUAN,XiaoPeng SONG,JunTao JIAN,ChuanLiang ZHANG,DeJun HAN,QuanHao SONG. QTL Mapping of Thousand-Grain-Weight and Its Related Traits in Zhou 8425B × Xiaoyan 81 Population and Haplotype Analysis [J]. Scientia Agricultura Sinica, 2022, 55(18): 3473-3483.
[4] WANG LuWei,SHEN ZhiJun,LI HeHuan,PAN Lei,NIU Liang,CUI GuoChao,ZENG WenFang,WANG ZhiQiang,LU ZhenHua. Analysis of Genetic Diversity of 79 Cultivars Based on SSR Fluorescence Markers for Peach [J]. Scientia Agricultura Sinica, 2022, 55(15): 3002-3017.
[5] DUAN YaRu,GAO MeiLing,GUO Yu,LIANG XiaoXue,LIU XiuJie,XU HongGuo,LIU JiXiu,GAO Yue,LUAN Feishi. Map-Based Cloning and Molecular Marker Development of Watermelon Fruit Shape Gene [J]. Scientia Agricultura Sinica, 2022, 55(14): 2812-2824.
[6] MENG Yu,WEN PengFei,DING ZhiQiang,TIAN WenZhong,ZHANG XuePin,HE Li,DUAN JianZhao,LIU WanDai,FENG Wei. Identification and Evaluation of Drought Resistance of Wheat Varieties Based on Thermal Infrared Image [J]. Scientia Agricultura Sinica, 2022, 55(13): 2538-2551.
[7] FANG TaoHong,ZHANG Min,MA ChunHua,ZHENG XiaoChen,TAN WenJing,TIAN Ran,YAN Qiong,ZHOU XinLi,LI Xin,YANG SuiZhuang,HUANG KeBing,WANG JianFeng,HAN DeJun,WANG XiaoJie,KANG ZhenSheng. Application of Yr52 Gene in Wheat Improvement for Stripe Rust Resistance [J]. Scientia Agricultura Sinica, 2022, 55(11): 2077-2091.
[8] PeiPei ZHU,YiJia LUO,Wen XIANG,MingLei ZHANG,JianXia ZHANG. Rescue and Molecular Marker Assisted-Selection of the Cold-Resistant Seedless Grape Hybrid Embryo [J]. Scientia Agricultura Sinica, 2021, 54(6): 1218-1228.
[9] XI Ling, WANG YuQi, YANG Xiu, ZHU Wei, CHEN GuoYue, WANG Yi, QIN Peng, ZHOU YongHong, KANG HouYang. Evaluation of Resistance to Stripe Rust and Molecular Detection of Resistance Gene(s) in 243 Common Wheat Landraces from the Yunnan Province [J]. Scientia Agricultura Sinica, 2021, 54(4): 684-695.
[10] CHEN DouDou, GUAN LiPing, HE LiangLiang, SONG YinHua, ZHANG Peng, LIU SanJun. Commonality Identification of Molecular Markers Linked to Seedless Genes in Grape [J]. Scientia Agricultura Sinica, 2021, 54(22): 4880-4893.
[11] HAN GuangJie,LIU Qin,LI ChuanMing,QI JianHang,XU Bin,LU YuRong,XU Jian. The Persistent Infection and Detection of Cnaphalocrocis medinalis Granulovirus in Cnaphalocrocis medinalis [J]. Scientia Agricultura Sinica, 2020, 53(19): 3988-3995.
[12] NIU Hao,PING JunAi,WANG YuBin,ZHANG FuYao,LÜ Xin,LI HuiMing,CHU JianQiang. Molecular Aided Breeding System of Photosensitive Forage Sorghum Based on SSR [J]. Scientia Agricultura Sinica, 2020, 53(14): 2795-2803.
[13] WANG FangQuan,CHEN ZhiHui,XU Yang,WANG Jun,LI WenQi,FAN FangJun,CHEN LiQin,TAO YaJun,ZHONG WeiGong,YANG Jie. Development and Application of the Functional Marker for the Broad-Spectrum Blast Resistance Gene PigmR in Rice [J]. Scientia Agricultura Sinica, 2019, 52(6): 955-967.
[14] WU BingChao, TONG Lei, DU ZhaoChang, HU JiaLing, ZHANG Huan, CHEN Yi, LIU Wei, ZHANG XinQuan, HUANG LinKai. Mutagenic Effects of 60Co-γ-rays on Two Species of Pennisetum Genus Forages [J]. Scientia Agricultura Sinica, 2019, 52(3): 414-427.
[15] WANG Jia, ZENG ZhaoQiong, LIANG JianQiu, YU XiaoBo, WU HaiYing, ZHANG MingRong. Development New Molecular Markers for Quantitative Trait Locus (QTL) Analysis of the Seed Protein Content Based on Whole Genome Re-Sequencing in Soybean [J]. Scientia Agricultura Sinica, 2019, 52(16): 2743-2757.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!