Who is this for?
Researchers; no programming experience is needed.
How do you parallelize?
- Ancestry jobs distributed across nodes
- Well-threaded for ancestry calculation
Tool category: Genetic & Proteomic Analysis Platform
Function: Ancestry
Tool type: Omics workflow
Key capabilities:
- Unsupervised ancestry prediction that is very robust to missingness
- 99% accurate with 80% missingness
- 90% accurate with 99% missingness
- Matters for GWAS arrays, targeted sequencing, or when you’re subsetting
Works with:
- Genetic data of large size
Potential use cases:
- Correct for population stratification in GWAS, rare variant, protein abundance analysis, regression and ML models
- Improve sample QC by matching ancestry to stated ethnicity