Scientia Agricultura Sinica ›› 2014, Vol. 47 ›› Issue (22): 4495-4505.doi: 10.3864/j.issn.0578-1752.2014.22.015

• ANIMAL SCIENCE·VETERINARY SCIENCERE • Previous Articles     Next Articles

The Strategy of Parameter Optimization of Bayesian Methods for Genomic Selection in Livestock

ZHU Bo1,2, WANG Yan-hui1, NIU Hong1, CHEN Yan1, ZHANG Lu-pei1, GAO Hui-jiang1, GAO Xue1, LI Jun-ya1, SUN Shao-hua2   

  1. 1Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193
    2College of Animal Science and Technology, Agricultural University of Hebei, Baoding 071000, Hebei
  • Received:2013-10-23 Revised:2014-07-11 Online:2014-11-16 Published:2014-11-16

Abstract: Variety selection in livestock breeding occupies an important position. Genomic selection, as a novel technology in livestock breeding, has raised considerable concern. It can shorten the generation interval, speed up the genetic progress, and it can select the candidate individuals as breeding stock without phenotypic data. In 2001, Meuviwisen proposed the concept of genomic selection, which was first applied in dairy cattle. Until August 2014, there were 34 member countries of Interbull organization that had applicated genomic selection in their national dairy cattle breeding group. With the popularization and continuous promotion of genomic selection, some problems of the accuracy of genomic estimated breeding value need to be solved. Various methods of genomic selection have been proposed and more efficient models are being developed. So it has great practical significance to exploit better models and algorithm to improve the accuracy of genomic estimated breeding value. So far, there were 17 Bayesian methods that have been successively proposed. This thesis briefly introduced the classical BayesA and BayesB methods for genomic selection. BayesA assumed that all loca have effect, while BayesB supposed that a small part of locus have effect, and the percentage was extremely small. Therefore, BayesA and BayesB had different models and algorithms. After Meuviwisen proposed classic Bayesian methods, other methods were like mushrooms springing up. New Bayesian methods were based on the classical Bayesian methods, which was optimized by improving the hypothetical model and algorithm. For example, BayesC method, which was based on BayesB, optimized the π value in the model. BayesCπ and BayesDπ were the improvement of BayesC, and these two approaches assumed that marker effect variance of each locus had the same value, whereas BayesC assumed that its marker effect variance of each locus was different. BayesDπ, which was based on BayesCπ, optimized the scale parameter of inverse chi-square distribution. Bayes Lasso had the same idea with BayesA. However, its marker effects were assumed to be another distribution for Laplace, so its posterior distributions of marker effects were also changed. BayesRS method assumed that the variances of marker effect were allocated in different percentage of total genetic variance. In order to find proper hypothesis model and parameters, other Bayesian methods were also based on predecessors' research through changing the prior assumption and improving parameters of the model. At present, commonly used methods for genomic selection are classic Bayesian methods and BayesCπ, which have stabile calculation results and high accuracy of genomic estimated breeding value. In the three Bayesian algorithms, the accuracy is generally arranged into BayesB > BayesCπ > BayesA, but accuracy of genomic estimated breeding value of some traits is not the case. Compared with classical Bayesian method, parameter optimization can improve the accuracy of genome estimated breeding value to some extent. In a word, on the basis of the classical Bayesian method,for the purpose of improving the accuracy of genomic estimated breeding value, the extension of bayesian methods and its parameters optimization strategy seeked for the optimal model and parameters optimization through biological genetic algorithm combined with actual population situation. They enriched and expanded the genomic selection algorithm, and can make the genomic breeding value more reference significance. As the animal breeding process is far from the foreign breeding process in China, genomic selection can cultivate new breed, enrich the genetic resources of China and accelerate the pace of livestock and poultry breeding process. Meanwhile, the algorithm study of genomic slection and its application in China was introduced. In face of the advantages of genomic selection, whole genomic selection breeding technology is imperative. Furthermore, the main problems in current researches and the key points in future studies were also proposed, in hope of providing reference for obtaining more reliable and faster algorithm of genomic selection.

Key words: genomic selection, Bayesian method, parameter optimization

[1]    Hayes B J, Bowman P J, Chamberlain A J, Goddard M E. Invited review: Genomic selection in dairy cattle: Progress and challenges. Journal of Dairy Science, 2009, 92(2):433-443.
[2]    Goddard M E, Hayes B J. Mapping genes for complex traits in domestic animals and their use in breeding programmes. Nature Reviews Genetics, 2009, 10(6):381-391.
[3]    Cole J B, Null D J. Visualization of the transmission of direct genomic values for paternal and maternal chromosomes for 15 traits in US Brown Swiss, Holstein, and Jersey cattle . Journal of Dairy Science, 2013, 96(4):2713-2726.
[4]    Spelman R J, Hayes B J, Berry D P. Use of molecular technologies for the advancement of animal breeding: genomic selection in dairy cattle populations in Australia, Ireland and New Zealand. Animal Production Science, 2013, 53(9): 869-875.
[5]    Su G, Guldbrandtsen B, Gregersen V R, Lund M S. Preliminary investigation on reliability of genomic estimated breeding values in the Danish Holstein population. Journal of Dairy Science, 2010, 93(3):1175-1183.
[6]    Vanraden P M. Efficient methods to compute genomic predictions.  Journal of Dairy Science, 2008, 91:4414 - 4423.
[7]    Meuwissen T H E, Hayes B J, Goddard M E. Prediction of total genetic value using genome wide dense marker maps. Genetics, 2001, 157:1819 - 1829.
[8]    Habier D, Fernando R L, Kizilkaya K, Garrick D J. Extension of the bayesian alphabet for genomic selection. BMC Bioinformatics, 2011, 12:186.
[9]    Clark S A, Hickey J M, Werf J H V D. Different models of genetic variation and their effect on genomic evaluation. Genetics Selection Evolution, 2011, 43:18.
[10]   Xu S Z. Estimating polygenic effects using markers of the entire genome. Genetics, 2003, 163(2):789-801.
[11]   Yi N, Xu S. Bayesian LASSO for quantitative trait loci mapping. Genetics, 2008, 179:1045 - 1055.
[12]   Meuwissen T H, Solberg T R, Shepherd R, Woolliams J A. A fast algorithm for BayesB type of prediction of genome-wide estimates of genetic value. Genetics Selection Evolution, 2009, 41:2.
[13]   Gustavo D L C, Naya H, Gianola D, Crossa J, Legarra A, Manfredi E, Weigel K, Cotes J M. Predicting Quantitative Traits With Regression Models for Dense Molecular Markers and Pedigree. Genetics, 2009, 182(1):375-385.
[14]   Verbyla K L, Hayes B J, Bowman P J, Goddard M E. Accuracy of genomic selection using stochastic search variable selection in Australian Holstein Friesian dairy cattle. Genetics Research (Camb), 2009, 91(5):307-311.
[15]   Xu S. An expectation-maximization algorithm for the Lasso estimation of quantitative trait locus effects. Heredity (Edinb), 2010, 105(5):483-94.
[16]   Hayashi T, Iwata H. EM algorithm for Bayesian estimation of genomic breeding values. BMC Genetics, 2010, 11:3.
[17]   Sun W, Ibrahim J G, Zou F. Genomewide multiple-loci mapping in experimental crosses by iterative adaptive penalized regression. Genetics, 2010, 185(1):349-359.
[18]   Ross S, Meuwissen T H E, Woolliams J. Genomic selection and complex trait prediction using a fast EM algorithm applied to genome-wide markers. BMC Bioinformatics, 2010, 11(1):529.
[19]   Mutshinda C M, Mikko J S. Extended Bayesian LASSO for Multiple Quantitative Trait Loci Mapping and Unobserved Phenotype Prediction. Genetics, 2010, 186(3):1067-1075.
[20]   Rasmus B, Su G S, Lund M, Bowman P, Goddard M, Hayes B. Genome position specific priors for genomic prediction. BMC Genomics, 2012, 13(1):543.
[21]   Wang C L, Ding X D, Wang J Y, Liu J F, Fu W X, Zhang Z, Yin Z J, Zhang Q. Bayesian methods for estimating GEBVs of threshold traits. Heredity (Edinb), 2013, 110(3):213-219.
[22]   Gianola D, Gustavo D L C, William G H, Eduardo M, Rohan F. Additive Genetic Variability and the Bayesian Alphabet. Genetics, 2009, 183(1):347-363.
[23]   Gianola D. Priors in whole-genome regression: the bayesian alphabet returns. Genetics, 2013, 194(3):573-96.
[24]   Jia Y, Jannink J- L. Multiple - t rait genomic selection m e-thods increase genetic value prediction accuracy. Genetics, 2012, 192(4): 1513-1522.
[25]   Geweke J. Bayesian treatment of the independent student-t linear model. Journal of Applied Econometrics, 1993, 8(S1):S19-S40.
[26]   Erbe M, Hayes B J, Matukumalli L K, Goswami S, Bowman P J, Reich C M, Mason B A, Goddard M E. Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels. Journal of Dairy Science, 2012, 95(7):4114-4129.
[27]   Park T, George C. The Bayesian Lasso. Journal of the American Statistical Association, 2008, 103(482):681-686.
[28]   Pascale. L R, Olivier F, Demeure O, Elsen J.M. Comparison of analyses of the XVth QTLMAS common dataset III: Genomic estimations of breeding values. BMC Proceedings, 2012, 6(Suppl. 2): S3.
[29]   尼桂琰, 张哲, 姜力马裴裴, ,丁向东. 利用全基因组连锁不平衡估计中国荷斯坦牛有效群体大小. 遗传, 2012, 34(1): 50-58.
Ni G Y, Zhang Z, Jiang L, Ma P P, Zhang Q, Ding X D. Chinese Holstein Cattle effective population size estimated from whole genome linkage disequilibrium. Hereditas (Beijing), 2012, 34(1): 50-58. (in Chinese)
[30]   李聪, 孙东晓, 姜力, 刘剑锋, 张勤, 张沅, 张胜利. 奶牛重要经济性状全基因组关联分析研究进展. 遗传, 2012, 34(5): 545-550.
Li C, Sun D X, Jiang L, Liu J F, Zhang Q, Zhang Y, Zhang S L. Advances on genome-wide association study for economically im-portant traits in dairy cattle. Hereditas (Beijing), 2012, 34(5): 545-550. (in Chinese)
[31]   王晓, 解小莉, 王胜, 钱梦樱, 马裴裴, 张沅, 孙东晓, 刘剑锋, 丁向东, 姜力, 王雅春, 张毅, 张胜利, 张勤, 俞英.  中国荷斯坦牛乳房炎易感性及抗性的全基因组关联分析畜牧兽医学报,2013, 44(12):1907-1912.
Wang X, Xie X L, Wang S, Qiang M Y, Ma P P, Zhang Y, Sun D X, Liu J F, Ding X D, Jiang L, Wang Y C, Zhang Y, Zhang S L, Zhang Q, Yu Y. Genome-wide association study for mastitis susceptibility and resistance in chinese holsteins. Acta Veterinaria et Zootechnica Sinica, 2013, 44(12):1907-1912. (in Chinese)
[32]   郭刚, 周磊, 刘林, 李东, 张胜利, 刘剑锋, 丁向东, 张毅, 王雅春, 张勤, 张沅.  利用SNP标记进行北京地区中国荷斯坦牛亲子推断的研究畜牧兽医学报, 2012,43(1):44-49.
Guo G, Zhou L, Liu L, Li D, Zhang S L, Liu J F, Ding X D, Zhang Y, Wang Y C, Zhang Q, Zhang Y. Parentage inference with sinle nucleotide polymorphism markers in the Chinese Holstein in Beijing. Acta Veterinaria et Zootechnica Sinica, 2012, 43(1): 44-49. (in Chinese)
[33]   张立敏, 张猛, 周正奎, 刘利, 刘喜冬, 陈翠, 陈晓杰, 朱淼, 袁峥嵘, 李姣, 许尚忠, 高会江, 高雪, 张路培, 李俊雅.  基于高密度SNP芯片技术对中国西门塔尔牛SCD1基因与肉质性状的相关性分畜牧兽医学报, 2012, 43(6): 878-886.
Zhang L M, Zhang M, Zhou Z K, Liu X D, Cheng C, Cheng X J, Zhu M, Yuan Z R, Li J, Xu S Z, Gao H J, Gao X, Zhang L P, Li J Y. Association of SCD1 gene with meat quality traits in Chinese Simmental based on high-density SNP chip. Acta Veterinaria et Zootechnica Sinica, 2012, 43(6): 878-886. (in Chinese)
[34]   郭立平, 徐丽, 朱淼, 张路培, 高会江, 李俊雅, 许尚忠, 高雪. 西门塔尔牛微卫星亲子鉴定体系的优化畜牧兽医学报, 2013, 44(6): 871-879.
Guo L P, Xu L, Zhu M, Zhang L P, Gao H J, Li J Y, Xu S Z, Gao X. Optimization of microsatellite-based paternity testing system for simmental. Acta Veterinaria et Zootechnica Sinica, 2013, 44(6): 871-879. (in Chinese)
[35]   朱波, 吴洋, 齐欣, 牛红, 张静静, 王延晖, 陈燕, 张路培, 高会江, 高雪, 李俊雅, 孙少华. 利用混合线性模型和BayesCPi进行QTL-MAS2011公共数据集的全基因组关联分析. 畜牧兽医学报, 2014, 45(5):692-698.
Zhu B, Wu Y, Qi X, Niu H, Zhang J J, Wang Y H, Cheng Y, Zhang L P, Gao H J, Gao X, Li J Y, Sun S H. Genome-wide association analyses of the 15th QTL-MAS workshop data using mixed linear model and Bayes CPi method. Acta Veterinaria et Zootechnica Sinica, 2014, 45(5):692-698. (in Chinese)
[36]   Zhang Z, Liu J, Ding X, Bijma P, De Koning D J, Zhang Q. Best linear unbiased prediction of genomic breeding values using a trait-specific marker-derived relationship matrix. PLoS ONE, 2010, 5(9): e12648.
[37]   Zhang Z, X Ding D, Liu J F, Zhang Q, De Koning D J. Accuracy of genomic prediction using low density marker panels. Journal of Dairy Science, 2011, doi:10.3186/jds.2010-3917.
[38]   Zhang Z, Ding X D, Liu J F, De Koning D J, Zhang Q. Genomic selection for QTL-MAS data using a trait-sepcific relation matrix. BMC Proceedings, 2011, 5(S3):15.
[39]   Zhang Z, X D. Ding, J F. Liu, D. J. De Koning ,Q. Zhang. TA-BLUP: a new genetic evalution method for genomic selection. The proceedings of the 9th world congress on genetics applied to livestock production(WCGALP) , Leizig, Germany. 2010.
[40]   Zhu B, Wang Y H, Sun S H, Li J Y. Prioritization of BayesB and BayesCPi methods in genomic selection with simulation data. 第十七次全国遗传育种学术讨论会论文集. 2013.10
[41]   Petersen E W. Crossbreeding of dairy cattle: The Science and the Impact.pdf. 4th Biennial W E Petersen Symposium, 2007.
[42]   Berglund B. Genetic Improvement of Dairy Cow Reproductive Performance. Reproduction in Domestic Animals, 2008, 43:89-95.
[43]   Ding X D, Zhang Z, Li X., Wang S , Wu X., Sun D, Yu Y, Liu J , Wang Y , Zhang Y, Zhang S , Zhang Y , Zhang Q. Accuracy of genomic prediction for milk production traits in the Chinese Holstein population using a reference population consisting of cows. Journal of Dairy Science, 96(8): 5315-5323.
[44]   Zhu M, Zhu B, Wang Y H, Wu Y, Xu L, Guo L P, Yuan Z R, Zhang L P, Gao X., Gao H J, Xu S Z, Li J Y. Linkage Disequilibrium estimation of Chinese beef Simmental Cattle using high-density SNP panels. Asian Australas Journal of Animal Sciences, 2013, 26(6): 772-779.
[45]   Gao H, Christensen O F, Madsen P, Nielsen U S, Zhang Y, Lund M S, Su G. Comparison on genomic predictions using three GBLUP methods and two single-step blending methods in the Nordic Holstein population. Genetics Selection Evolution, 2012, 44:8.
[46]   Chen J, Wang Y C, Zhan Y g, Sun D X, Zhang S L, Zhang Y. Evaluation of breeding programs combining genomic information in Chinese Holstein. Agricultural Sciences in China, 2011, 10(12): 1949-1957.
[47]   Jiang J, Jiang L, Zhou B, Fu W, Liu J F, Zhang Q. Snat: a SNP annotation tool for bovine by integrating various sources of genomic information. BMC Genet, 2011, 12:85.
[48]   Fang M, Liu J, Sun D, Zhang Y, Zhang Q, Zhang S. QTL mapping in outbred half-sib families using Bayesian model selection. Heredity (Edinb), 2011, 107(3):265-276.
[49]   Li J, Liu J F , Sun D X , Ma P P, Ding X D, Yu Y, Zhang  Q. Genome wide association studies for milk production traits in Chinese Holstein population. PLoS ONE, 5(10): e13661. doi:10.1371/journal.pone. 0013661
[50] Li J, Jiang J C, Yang J, Liu X, Wang J Y, Wang H F, Ding X D, Liu J F, Zhang Q. Genome wide association studies for body conformation traits in the Chinese Holstein cattle population. BMC Genomics,2013, 14:897.
[51]   Gao H, Wu Y, Li J, Li H, Yang R. Forward LASSO analysis for high-order interactions in genome-wide association study. Brief Bioinform, 2013, 7:1093-1103.
[52]   邸超, 宋迟. 全基因组选择将为畜禽育种带来技术革命——中国农业大学胡晓湘教授专访. 中国家禽,2011, 33(10): 61-63.
Di C, Song C. Genomic selection will bring technology revolution for livestock and poultry breeding, personal interview professor Hu Xiaoxiang of China agricultural university. Chinese poultry Journal, 2011, 33(10): 61-63. (in Chinese)
[53]   Jane Silverthorne. Big Data in Agriculture A Basic Perspective. Internet2 Global Summit April 9, 2014.
[54]   Cole JB, Newman S, Foertter F, Aguilar I, Coffey M. Breeding and Genetics Symposium: really big data: processing and analysis of very large data sets. Journal of Animal Science, 2012,90(3):723-733.
[1] ZHOU Jun,LIN Qing,SHAO BaoQuan,REN DuanYang,LI JiaQi,ZHANG Zhe,ZHANG Hao. Evaluating the Application Effect of Single-Step Genomic Selection in Pig Populations [J]. Scientia Agricultura Sinica, 2022, 55(15): 3042-3049.
[2] ZHU Mo,ZHENG MaiQing,CUI HuanXian,ZHAO GuiPing,LIU Yang. Comparison of Genomic Prediction Accuracy for Meat Type Chicken Carcass Traits Based on GBLUP and BayesB Method [J]. Scientia Agricultura Sinica, 2021, 54(23): 5125-5131.
[3] TANG ZhenShuang,YIN Dong,YIN LiLin,MA YunLong,XIANG Tao,ZHU MengJin,YU Mei,LIU XiaoLei,LI XinYun,QIU XiaoTian,ZHAO ShuHong. To Evaluate the “Two-Step” Genomic Selection Strategy in Pig by Simulation [J]. Scientia Agricultura Sinica, 2021, 54(21): 4677-4684.
[4] NIE ZhiGang,LI Guang,WANG Jun,MA WeiWei,LUO CuiPing,DONG LiXia,LU YuLan. Parameter Optimization for the Simulation of Leaf Area Index of Dryland Wheat with the APSIM Model [J]. Scientia Agricultura Sinica, 2019, 52(12): 2056-2068.
[5] ZHANG JinXin, TANG ShaoQing, SONG HaiLiang, GAO Hong, JIANG Yao, JIANG YiFan, MI ShiRong, MENG QingLi, YU Fan, XIAO Wei, YUN Peng, ZHANG Qing, DING XiangDong. Joint Genomic Selection of Yorkshire in Beijing [J]. Scientia Agricultura Sinica, 2019, 52(12): 2161-2170.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!