Abstract
Both polygenicity (many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from a true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of the inflation in test statistics in many GWAS of large sample size.
Original language | English |
---|---|
Pages (from-to) | 291-5 |
Number of pages | 5 |
Journal | Nature Genetics |
Volume | 47 |
Issue number | 3 |
Early online date | 02 Feb 2015 |
DOIs | |
Publication status | Published - Mar 2015 |
Keywords
- Computer Simulation
- Genome, Human
- Genome-Wide Association Study
- Humans
- Linkage Disequilibrium
- Polymorphism, Single Nucleotide
- Regression Analysis
- Sample Size