Hybrid peeling for fast and accurate calling, phasing, and imputation with sequence data of any coverage in pedigrees
MetadataShow full item record
Background: In this paper, we extend multi-locus iterative peeling to provide a computationally efficient method for calling, phasing, and imputing sequence data of any coverage in small or large pedigrees. Our method, called hybrid peeling, uses multi-locus iterative peeling to estimate shared chromosome segments between parents and their offspring at a subset of loci, and then uses single-locus iterative peeling to aggregate genomic information across multiple generations at the remaining loci. Results: Using a synthetic dataset, we first analysed the performance of hybrid peeling for calling and phasing geno- types in disconnected families, which contained only a focal individual and its parents and grandparents. Second, we analysed the performance of hybrid peeling for calling and phasing genotypes in the context of a full general pedigree. Third, we analysed the performance of hybrid peeling for imputing whole-genome sequence data to non- sequenced individuals in the population. We found that hybrid peeling substantially increased the number of called and phased genotypes by leveraging sequence information on related individuals. The calling rate and accuracy increased when the full pedigree was used compared to a reduced pedigree of just parents and grandparents. Finally, hybrid peeling imputed accurately whole-genome sequence to non-sequenced individuals. Conclusions: We believe that this algorithm will enable the generation of low cost and high accuracy whole- genome sequence data in many pedigreed populations. We make this algorithm available as a standalone program called AlphaPeel.
Is part ofGenetics Selection Evolution, 2018, vol. 50, article number 67
European research projects
The following license files are associated with this item:
Except where otherwise noted, this item's license is described as cc-by (c) Whalen, Andrew et al., 2018
Showing items related by title, author, creator and subject.
Ros Freixedes, Roger; Whalen, Andrew; Gorjanc, Gregor; Mileham, Alan J.; Hickey, John M. (BMC (part of Springer Nature), 2020-04-06)Background: For assembling large whole-genome sequence datasets for routine use in research and breeding, the sequencing strategy should be adapted to the methods that will be used later for variant discovery and imputation. ...
Accuracy of whole-genome sequence imputation using hybrid peeling in large pedigreed livestock populations Ros Freixedes, Roger; Whalen, Andrew; Chen, Ching-Yi; Gorjanc, Gregor; Herring, William O.; Mileham, Alan J.; Hickey, John M. (BMC (part of Springer Nature), 2020-04-06)Background: The coupling of appropriate sequencing strategies and imputation methods is critical for assembling large whole-genome sequence datasets from livestock populations for research and breeding. In this paper, we ...
Whalen, Andrew; Gorjanc, Gregor; Ros Freixedes, Roger; Hickey, John M. (BMC (part of Springer Nature), 2018-09-17)Background: In this paper, we review the performance of various hidden Markov model‐based imputation methods in animal breeding populations. Traditionally, pedigree and heuristic‐based imputation methods have been used for ...