Personal bioinformatics

Personal Genome Analysis

Processed 131GB of whole-genome sequencing data, identified 4.99 million variants, and built a full pipeline for ancestry and pharmacogenomics analysis.

A deliberately end-to-end project that treated personal omics data as both a scientific and product problem.

Back to projects GitHub Write-up

Why it mattered

This project turned genomics from abstract fascination into direct technical contact with the scale, messiness, and infrastructure burden of real sequencing data.

Personal Genome Analysis

What I built

An analysis pipeline that handled mapping, variant identification, ancestry characterization, and pharmacogenomic interpretation, deployed on cloud infrastructure sized for the workload.

Personal Genome Analysis

What I learned

Biological data is rarely impressive because of one model. It becomes impressive when the pipeline, interpretation, and presentation all hold together.

More work

More projects.

Enso Atlas TrialScope AI AgentHLE CNV Cancer RNA-seq Analysis