Who is this for?
Researchers; no programming experience is needed.
How do you parallelize?
- Tasks are distributed to an autoscaling cluster
- In-house scheduler for batch processing + Ray for fine-grained parallelism
Tool category: Genetic & Proteomic Analysis Platform
Function: Search (NLP & Filters)
Tool type: Omics workflow
Key capabilities:
- Rapid natural language and structured variant/sample filtering of annotated VCF data
- Filter millions of mutations in a fraction of a second by any or every annotation field at once
Works with:
- Bystro genetic annotations
- SomaScan, TMT (through Python)
Potential use cases:
- GWAS, burden tests, rare variant analysis
- Cohort browser subset cohorts
- Filter and join proteomics datasets with genetic covariants