Variant Data Manipulation
Glow offers functionalities to extract, transform and load (ETL) genomic variant data into Spark DataFrames, enabling manipulation, filtering, quality control and transformation between file formats.
- Data Simulation
- Read and Write VCF, Plink, and BGEN with Spark
- Read Genome Annotations (GFF3) as a Spark DataFrame
- Create a Genomics Delta Lake
- Variant Quality Control
- Sample Quality Control
- Liftover
- Variant Normalization
- Split Multiallelic Variants
- Merging Variant Datasets
- Hail Interoperation
- Utility Functions