-
Články
Top novinky
Reklama- Vzdělávání
- Časopisy
Top články
Nové číslo
- Témata
Top novinky
Reklama- Kongresy
- Videa
- Podcasty
Nové podcasty
Reklama- Kariéra
Doporučené pozice
Reklama- Praxe
Top novinky
ReklamaAccounting for long-range correlations in genome-wide simulations of large cohorts
Autoři: Dominic Nelson aff001; Jerome Kelleher aff002; Aaron P. Ragsdale aff001; Claudia Moreau aff003; Gil McVean aff002; Simon Gravel aff001
Působiště autorů: McGill University and Genome Québec Innovation Centre, McGill University, Montréal, Québec, Canada aff001; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom aff002; Centre Intersectoriel en Santé Durable, Université du Québec à Chicoutimi, Saguenay, Québec, Canada aff003
Vyšlo v časopise: Accounting for long-range correlations in genome-wide simulations of large cohorts. PLoS Genet 16(5): e32767. doi:10.1371/journal.pgen.1008619
Kategorie: Research Article
doi: https://doi.org/10.1371/journal.pgen.1008619Souhrn
Coalescent simulations are widely used to examine the effects of evolution and demographic history on the genetic makeup of populations. Thanks to recent progress in algorithms and data structures, simulators such as the widely-used msprime now provide genome-wide simulations for millions of individuals. However, this software relies on classic coalescent theory and its assumptions that sample sizes are small and that the region being simulated is short. Here we show that coalescent simulations of long regions of the genome exhibit large biases in identity-by-descent (IBD), long-range linkage disequilibrium (LD), and ancestry patterns, particularly when the sample size is large. We present a Wright-Fisher extension to msprime, and show that it produces more realistic distributions of IBD, LD, and ancestry proportions, while also addressing more subtle biases of the coalescent. Further, these extensions are more computationally efficient than state-of-the-art coalescent simulations when simulating long regions, including whole-genome data. For shorter regions, efficiency can be maintained via a hybrid model which simulates the recent past under the Wright-Fisher model and uses coalescent simulations in the distant past.
Klíčová slova:
DNA recombination – Effective population size – Genetic polymorphism – Genome evolution – Linkage disequilibrium – Population genetics – Population size – Simulation and modeling
Zdroje
1. Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA. Selecting a Maximally Informative Set of Single-Nucleotide Polymorphisms for Association Analyses Using Linkage Disequilibrium. The American Journal of Human Genetics. 2004;74(1):106–120. doi: 10.1086/381000 14681826
2. Voight BF, Kudaravalli S, Wen X, Pritchard JK. A map of recent positive selection in the human genome. PLoS biology. 2006;4(3):e72. doi: 10.1371/journal.pbio.0040072 16494531
3. Gutenkunst RN, Hernandez RD, Williamson SH, Bustamante CD. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genetics. 2009;5(10). doi: 10.1371/journal.pgen.1000695 19851460
4. Li H, Durbin R. Inference of human population history from individual whole-genome sequences. Nature. 2011;475(7357):493–6. doi: 10.1038/nature10231 21753753
5. Li N, Stephens M. Modeling Linkage Disequilibrium and Identifying Recombination Hotspots Using Single-Nucleotide Polymorphism Data. Genetics. 2003;165(4):2213–2233. 14704198
6. Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Research. 2005;15(11):1566–1575. doi: 10.1101/gr.4252305 16251466
7. Hudson RR. Generating samples under a Wright-Fisher neutral model of genetic variation. Bioinformatics. 2002;18(2):337–338. doi: 10.1093/bioinformatics/18.2.337 11847089
8. Kelleher J, Etheridge AM, McVean G. Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes. PLoS Comput Biol. 2016;12(5):1–22. doi: 10.1371/journal.pcbi.1004842
9. Hudson RR. Properties of a neutral allele model with intragenic recombination. Theoretical Population Biology. 1983;23(2):183–201. doi: 10.1016/0040-5809(83)90013-8 6612631
10. Kelleher J, Thornton KR, Ashander J, Ralph PL. Efficient pedigree recording for fast population genetics simulation. PLoS computational biology. 2018;14(11):e1006581. doi: 10.1371/journal.pcbi.1006581 30383757
11. Kelleher J, Wong Y, Wohns AW, Fadil C, Albers PK, McVean G. Inferring whole-genome histories in large population datasets. Nature Genetics. 2019;51(9):1330–1338. doi: 10.1038/s41588-019-0483-y 31477934
12. Wakeley J, King L, Low BS, Ramachandran S. Gene genealogies within a fixed pedigree, and the robustness of kingman’s coalescent. Genetics. 2012;190(4):1433–1445. doi: 10.1534/genetics.111.135574 22234858
13. Bhaskar A, Clark AG, Song YS. Distortion of genealogical properties when the sample is very large. Proceedings of the National Academy of Sciences of the United States of America. 2014;111(6):2385–90. doi: 10.1073/pnas.1322709111 24469801
14. Palamara PF. ARGON: fast, whole-genome simulation of the discrete time Wright-fisher process. Bioinformatics. 2016;32(19):3032–3034. doi: 10.1093/bioinformatics/btw355 27312410
15. Hudson RR. Gene genealogies and the coalescent process. In: Futuyma D. and Antonovics J. (eds), Oxford Surveys in Evolutionary Biology. vol. 7; 1990. p. 1–44.
16. Wilton PR, Baduel P, Landon MM, Wakeley J. Population structure and coalescence in pedigrees: Comparisons to the structured coalescent and a framework for inference. Theoretical Population Biology. 2017;115 : 1–12. doi: 10.1016/j.tpb.2017.01.004 28143695
17. King L, Wakeley J, Carmi S. A non-zero variance of Tajima’s estimator for two sequences even for infinitely many unlinked loci. Theoretical Population Biology. 2018;122 : 22–29. doi: 10.1016/j.tpb.2017.03.002 28341209
18. Liang M, Nielsen R. The lengths of admixture tracts. Genetics. 2014;197(3):953–967. doi: 10.1534/genetics.114.162362 24770332
19. Ball RM, Neigel JE, Avise JC. Gene Genealogies within the Organismal Pedigrees of Random-Mating Populations. Evolution. 1990;44(2):360. doi: 10.1111/j.1558-5646.1990.tb05205.x 28564387
20. Verhoeven KJF, Simonsen KL. Genomic haplotype blocks may not accurately reflect spatial variation in historic recombination intensity. Molecular Biology and Evolution. 2005;22(3):735–740. doi: 10.1093/molbev/msi058 15563716
21. Davies JL, Simančík F, Lyngsø R, Mailund T, Hein J. On recombination-induced multiple and simultaneous coalescent events. Genetics. 2007;177(4):2151–2160. doi: 10.1534/genetics.107.071126 17947442
22. Henn BM, Hon L, Macpherson JM, Eriksson N, Saxonov S, Pe’er I, et al. Cryptic distant relatives are common in both isolated and cosmopolitan genetic samples. PLoS ONE. 2012;7(4). doi: 10.1371/journal.pone.0034267
23. Shchur V, Nielsen R. On the number of siblings and p-th cousins in a large population sample. Journal of Mathematical Biology. 2018;77(5):1–20. doi: 10.1007/s00285-018-1252-8
24. Genome Quebec. Genizon Biobank; (2020). http://www.genomequebec.com/genizon-biobank/.
25. Waples RS. A bias correction for estimates of effective population size based on linkage disequilibrium at unlinked gene loci. Conservation Genetics. 2006;7(2):167–184. doi: 10.1007/s10592-005-9100-y
26. Ragsdale AP, Gravel S. Unbiased Estimation of Linkage Disequilibrium from Unphased Data. Molecular Biology and Evolution. 2019.
27. Gravel S. Population genetics models of local ancestry. Genetics. 2012;191(2):607–619. doi: 10.1534/genetics.112.139808 22491189
28. Fisher R. The genetical theory of natural selection. Clarendon Press; 1930.
29. Wright S. Evolution in Mendelian populations. Genetics. 1931;16(2):97. 17246615
30. BALSAC. BALSAC Population Database: 2016-2017 Annual Report.; 2018. http://balsac.uqac.ca/english/files/2018/01/BALSAC_RA2017_EN_page_WEB_v2-1.pdf.
31. Caballero M, Seidman DN, Qiao Y, Sannerud J, Dyer TD, Lehman DM, et al. Crossover interference and sex-specific genetic maps shape identical by descent sharing in close relatives. PLOS Genetics. 2019;15(12):1–29. doi: 10.1371/journal.pgen.1007979
Článek A new neuropeptide insect parathyroid hormone iPTH in the red flour beetle Tribolium castaneumČlánek Rare protein-altering variants in ANGPTL7 lower intraocular pressure and protect against glaucomaČlánek Sex-biased genetic programs in liver metabolism and liver fibrosis are controlled by EZH1 and EZH2
Článek vyšel v časopisePLOS Genetics
Nejčtenější tento týden
2020 Číslo 5- „Jednohubky“ z klinického výzkumu – 2026/1
- Eutanazie na žádost pacientů s demencí? Odborná polemika je stále živá
- Pomůže AI k rychlejšímu vývoji antibiotik na kapavku a MRSA?
- Reprogramování hematoencefalické bariéry u modelu Alzheimerovy choroby
- Není statin jako statin aneb praktický přehled rozdílů jednotlivých molekul
-
Všechny články tohoto čísla
- A cross-disorder PRS-pheWAS of 5 major psychiatric disorders in UK Biobank
- Depletion of Ric-8B leads to reduced mTORC2 activity
- A copy number variant is associated with a spectrum of pigmentation patterns in the rock pigeon (Columba livia)
- An osteocalcin-deficient mouse strain without endocrine abnormalities
- Osteocalcin is necessary for the alignment of apatite crystallites, but not glucose metabolism, testosterone synthesis, or muscle mass
- Beyond SNP heritability: Polygenicity and discoverability of phenotypes estimated with a univariate Gaussian mixture model
- Accounting for long-range correlations in genome-wide simulations of large cohorts
- Novel frameshift variant in MYL2 reveals molecular differences between dominant and recessive forms of hypertrophic cardiomyopathy
- The domesticated transposase ALP2 mediates formation of a novel Polycomb protein complex by direct interaction with MSI1, a core subunit of Polycomb Repressive Complex 2 (PRC2)
- Rare protein-altering variants in ANGPTL7 lower intraocular pressure and protect against glaucoma
- The phosphorelay BarA/SirA activates the non-cognate regulator RcsB in Salmonella enterica
- Copy number variants and fixed duplications among 198 rhesus macaques (Macaca mulatta)
- The genomic landscape of metastasis in treatment-naïve breast cancer models
- Trans-ethnic meta-analysis of genome-wide association studies identifies maternal ITPR1 as a novel locus influencing fetal growth during sensitive periods in pregnancy
- Genome-wide DNA methylation and gene expression patterns reflect genetic ancestry and environmental differences across the Indonesian archipelago
- Single-nucleus RNA-seq identifies divergent populations of FSHD2 myotube nuclei
- Separable, Ctf4-mediated recruitment of DNA Polymerase α for initiation of DNA synthesis at replication origins and lagging-strand priming during replication elongation
- Bidirectional crosstalk between Hypoxia-Inducible Factor and glucocorticoid signalling in zebrafish larvae
- An EHBP-1-SID-3-DYN-1 axis promotes membranous tubule fission during endocytic recycling
- Simultaneous SNP selection and adjustment for population structure in high dimensional prediction models
- Interplay between axonal Wnt5-Vang and dendritic Wnt5-Drl/Ryk signaling controls glomerular patterning in the Drosophila antennal lobe
- Additive and mostly adaptive plastic responses of gene expression to multiple stress in Tribolium castaneum
- Polyploidy breaks speciation barriers in Australian burrowing frogs Neobatrachus
- Multiple mechanisms regulate H3 acetylation of enhancers in response to thyroid hormone
- A new neuropeptide insect parathyroid hormone iPTH in the red flour beetle Tribolium castaneum
- Scalable probabilistic PCA for large-scale genetic variation data
- An Out-of-Patagonia migration explains the worldwide diversity and distribution of Saccharomyces eubayanus lineages
- Alternative splicing of jnk1a in zebrafish determines first heart field ventricular cardiomyocyte numbers through modulation of hand2 expression
- ASEP: Gene-based detection of allele-specific expression across individuals in a population by RNA sequencing
- ALC1/eIF4A1-mediated regulation of CtIP mRNA stability controls DNA end resection
- A high-fat diet induces a microbiota-dependent increase in stem cell activity in the Drosophila intestine
- The genetic architecture of the maize progenitor, teosinte, and how it was altered during maize domestication
- Exome-wide association study reveals largely distinct gene sets underlying specific resistance to dengue virus types 1 and 3 in Aedes aegypti
- Correction: Regulation of ATG4B Stability by RNF5 Limits Basal Levels of Autophagy and Influences Susceptibility to Bacterial Infection
- Sex-biased genetic programs in liver metabolism and liver fibrosis are controlled by EZH1 and EZH2
- UVR8-mediated inhibition of shade avoidance involves HFR1 stabilization in Arabidopsis
- Yeast mismatch repair components are required for stable inheritance of gene silencing
- Dynamic genetic architecture of yeast response to environmental perturbation shed light on origin of cryptic genetic variation
- Activation of cryptic splicing in bovine WDR19 is associated with reduced semen quality and male fertility
- The temporal regulation of TEK contributes to pollen wall exine patterning
- Intimate functional interactions between TGS1 and the Smn complex revealed by an analysis of the Drosophila eye development
- Saccharomyces cerevisiae Mus81-Mms4 prevents accelerated senescence in telomerase-deficient cells
- Interaction of YAP with the Myb-MuvB (MMB) complex defines a transcriptional program to promote the proliferation of cardiomyocytes
- Glucose transporter 10 modulates adipogenesis via an ascorbic acid-mediated pathway to protect mice against diet-induced metabolic dysregulation
- Congenital hearing impairment associated with peripheral cochlear nerve dysmyelination in glycosylation-deficient muscular dystrophy
- Population genetic models of GERP scores suggest pervasive turnover of constrained sites across mammalian evolution
- The Mediator CDK8-Cyclin C complex modulates Dpp signaling in Drosophila by stimulating Mad-dependent transcription
- Correction: The persimmon genome reveals clues to the evolution of a lineage-specific sex determination system in plants
- Correction: Rapidly evolving protointrons in Saccharomyces genomes revealed by a hungry spliceosome
- PLOS Genetics
- Archiv čísel
- Aktuální číslo
- Informace o časopisu
Nejčtenější v tomto čísle- A new neuropeptide insect parathyroid hormone iPTH in the red flour beetle Tribolium castaneum
- The domesticated transposase ALP2 mediates formation of a novel Polycomb protein complex by direct interaction with MSI1, a core subunit of Polycomb Repressive Complex 2 (PRC2)
- Polyploidy breaks speciation barriers in Australian burrowing frogs Neobatrachus
- The phosphorelay BarA/SirA activates the non-cognate regulator RcsB in Salmonella enterica
Kurzy
Zvyšte si kvalifikaci online z pohodlí domova
Autoři: prof. MUDr. Vladimír Palička, CSc., Dr.h.c., doc. MUDr. Václav Vyskočil, Ph.D., MUDr. Petr Kasalický, CSc., MUDr. Jan Rosa, Ing. Pavel Havlík, Ing. Jan Adam, Hana Hejnová, DiS., Jana Křenková
Autoři: MUDr. Irena Krčmová, CSc.
Autoři: MDDr. Eleonóra Ivančová, PhD., MHA
Autoři: prof. MUDr. Eva Kubala Havrdová, DrSc.
Všechny kurzyPřihlášení#ADS_BOTTOM_SCRIPTS#Zapomenuté hesloZadejte e-mailovou adresu, se kterou jste vytvářel(a) účet, budou Vám na ni zaslány informace k nastavení nového hesla.
- Vzdělávání