omics

Genetic Ancestry

Contributors

Who is this for?

Researchers; no programming experience is needed.

How do you parallelize? 

  • Ancestry jobs distributed across nodes
  • Well-threaded for ancestry calculation

Tool category: Genetic & Proteomic Analysis Platform

Function: Ancestry

Tool type: Omics workflow

Key capabilities:

  • Unsupervised ancestry prediction that is very robust to missingness
  • 99% accurate with 80% missingness
  • 90% accurate with 99% missingness
    • Matters for GWAS arrays, targeted sequencing, or when you’re subsetting

Works with:

  • Genetic data of large size

Potential use cases:

  • Correct for population stratification in GWAS, rare variant, protein abundance analysis, regression and ML models
  • Improve sample QC by matching ancestry to stated ethnicity