Glow
Glow is an open-source toolkit for working with genomic data at biobank-scale and beyond. The toolkit is natively built on Apache Spark, the leading unified engine for big data processing and machine learning, enabling genomics workflows to scale to population levels.
- Introduction to Glow
- Getting Started
- GWAS Tutorial
- Customizing Glow
- Variant Data Manipulation
- Tertiary Analysis
- Troubleshooting
- Contributing
- Blog Posts
- [Jul. 2020] Introducing GloWGR: An industrial-scale, ultra-fast and sensitive method for genetic association studies
- [Jun. 2020] Glow 0.4 Enables Integration of Genomic Variant and Annotation Data
- [Mar. 2020] Glow 0.3.0 Introduces Several New Large-Scale Genomic Analysis Features
- [Nov. 2019] Streamlining Variant Normalization on Large Genomic Datasets
- Additional Resources
- Python API