Ancestry is a company providing Geneology and Direct-to-Consumer (DTC) Autosomal Genetic Testing services.

Ancestry offers two major genealogical services based on a saliva test: identity-by-descent analysis (see community detection) and genetic ancestry. ethnicity

non-sex chromosomes.

Sequencing Technology

Genetic ancestry

Community detection

Challenges of community detection curtis2017estimation

  1. Clustering is computationally intensive and is not feasible to perform regularly.
  2. Clustering algorithms typically assign nodes to only the single most suitable cluster.
    • To solve this, Ancestry built a classification algorithm for each community.
  3. While clustering is consistent across an entire database, individual assignments may change across repetitions.



[ethnicity] Noto, Wang, Song, Turissini, Sedghifar, Garrigan, Starr, Byrnes, Hong, Ball & others, Ethnicity Estimate 2018 White Paper, , .

[han2017clustering] Han, Carbonetto, Curtis, Wang, Granka, Byrnes, Noto, Kermany, Myres, Barber & others, Clustering of 770,000 genomes reveals post-colonial population structure of North America, Nature communications, 8, 14238 (2017).

[curtis2017estimation] Curtis & Girshick, Estimation of Recent Ancestral Origins of Individuals on a Large Scale, 1417-1425, in in: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, edited by (2017)

[identity] Erlich, Shor, Peer & Carmi, Identity inference of genomic data using long-range familial searches, Science, 362(6415), 690-694 (2018). link. doi.