Skip to content

Latest commit

 

History

History
7 lines (4 loc) · 1.05 KB

intro.md

File metadata and controls

7 lines (4 loc) · 1.05 KB

Genetic Ancestry Visualization

The figure below displays samples from the 1000 Genomes Project. What you are seeing are the samples' genotypes projected into a three-dimensional feature-space through dimensionality reduction techniques. Data points are colored according to a sample's reported genetic ancestry.

The genotypes were filtered to include only a small subset of the genome called ancestry-informative single nucleotide polymorphisms (AISNPs). Then, the genotypes were one-hot encoded. Finally, dimensionality reduction was performed to facilitate visualization.

You can also visualize your direct-to-consumer genetic results (e.g. 23andMe). A k-nearest neighbors classifier will predict which of the populations you are most closely related to based on the AISNP genotypes.