Title | On the substructure controls in rare variant analysis: Principal components or variance components? |
Publication Type | Journal Article |
Year of Publication | 2018 |
Authors | Luo, Yiwen, Arnab Maity, Michael C. Wu, Chris Smith, Qing Duan, Yun Li, and Jung-Ying Tzeng |
Journal | Genet Epidemiol |
Volume | 42 |
Issue | 3 |
Pagination | 276-287 |
Date Published | 2018 Apr |
ISSN | 1098-2272 |
Keywords | Computer Simulation, Confounding Factors, Epidemiologic, Genetic Association Studies, Genetic Variation, Humans, Models, Genetic, Principal Component Analysis |
Abstract | Recent studies showed that population substructure (PS) can have more complex impact on rare variant tests and that similarity-based collapsing tests (e.g., SKAT) may suffer more severely by PS than burden-based tests. In this work, we evaluate the performance of SKAT coupling with principal components (PC) or variance components (VC) based PS correction methods. We consider confounding effects caused by PS including stratified populations, admixed populations, and spatially distributed nongenetic risk; we investigate which types of variants (e.g., common, less frequent, rare, or all variants) should be used to effectively control for confounding effects. We found that (i) PC-based methods can account for confounding effects in most scenarios except for admixture, although the number of sufficient PCs depends on the PS complexity and the type of variants used. (ii) PCs based on all variants (i.e., common + less frequent + rare) tend to require equal or fewer sufficient PCs and often achieve higher power than PCs based on other variant types. (iii) VC-based methods can effectively adjust for confounding in all scenarios (even for admixture), though the type of variants should be used to construct VC may vary. (iv) VC based on all variants works consistently in all scenarios, though its power may be sometimes lower than VC based on other variant types. Given that the best-performed method and which variants to use depend on the underlying unknown confounding mechanisms, a robust strategy is to perform SKAT analyses using VC-based methods based on all variants. |
DOI | 10.1002/gepi.22102 |
Alternate Journal | Genet Epidemiol |
Original Publication | On the substructure controls in rare variant analysis: Principal components or variance components? |
PubMed ID | 29280188 |
PubMed Central ID | PMC5851819 |
Grant List | P01 CA142538 / CA / NCI NIH HHS / United States R01 HG006292 / HG / NHGRI NIH HHS / United States R01 HG006703 / HG / NHGRI NIH HHS / United States R01 HL129132 / HL / NHLBI NIH HHS / United States |
On the substructure controls in rare variant analysis: Principal components or variance components?
Project: