Why it mattered
This project turned genomics from abstract fascination into direct technical contact with the scale, messiness, and infrastructure burden of real sequencing data.
Personal bioinformatics
Processed 131GB of whole-genome sequencing data, identified 4.99 million variants, and built a full pipeline for ancestry and pharmacogenomics analysis.
A deliberately end-to-end project that treated personal omics data as both a scientific and product problem.
This project turned genomics from abstract fascination into direct technical contact with the scale, messiness, and infrastructure burden of real sequencing data.
An analysis pipeline that handled mapping, variant identification, ancestry characterization, and pharmacogenomic interpretation, deployed on cloud infrastructure sized for the workload.
Biological data is rarely impressive because of one model. It becomes impressive when the pipeline, interpretation, and presentation all hold together.
More work