Using modern DNA
variation to learn about (pre)history and demographics.
Motivation: This ties into the foundations from Autumn 2020: infinite
alleles model, FST, effective population size, coalescent theory,
etc. Past SGJC attendance is not required. The statistical tests reviewed often
build on each other and/or are used in the application-focused papers. Email Liz Blue
(em27@uw.edu) for more information, or the course website at https://canvas.uw.edu/courses/1448357
Week 1 (March 30). Everyone. Overview of the
general topic, and some background.
Come prepared to
provide at least one comment about what questions you have or things you know
about topics related to human population genetics.
Week 2 (April 6). Leader: Hanley and Jake. The
STRUCTURE paper.
Paper: Pritchard JK, Stephens M, Donnelly P (2000) Inference of
Population Structure Using Multilocus Genotype Data.
GENETICS 155(2): 945-959.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/10835412/
Summary: "We describe a model-based clustering method for
using multilocus genotype data to infer population
structure and assign individuals to populations...Applications of our method
include demonstrating the presence of population structure, assigning
individuals to populations, studying hybrid zones, and identifying migrants and
admixed individuals."
Week 3 (April 13). Leader: Nandana.
The ADMIXTURE update paper.
Paper: Alexander DH, Lange K (2011) Enhancements to the ADMIXTURE
algorithm for individual ancestry estimation. BMC
Bioinformatics 12: 246.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/21682921/
*note: you might also want the original
ADMIXTURE paper, here.
Summary: "...enhancements to ADMIXTURE, a high-performance
tool for estimating individual ancestries and population allele frequencies
from SNP (single nucleotide polymorphism) data. First, ADMIXTURE can be used to
estimate the number of underlying populations through cross-validation. Second,
individuals of known ancestry can be exploited in supervised learning to yield
more precise ancestry estimates."
Week 4 (April 20). Leader: Hassan and Seth.
The pairwise sequentially Markovian coalescent model (PSMC).
Paper: Li H, Durbin R. (2011) Inference of human population
history from individual whole-genome sequences. Nature 475, 493�496.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/21753753/
Summary: "... we present a more detailed history of human
population sizes between approximately ten thousand and a million years ago,
using the pairwise sequentially Markovian coalescent model applied to the
complete diploid genome sequences of a Chinese male (YH)6, a Korean
male (SJK)7, three European individuals (J. C. Venter8,
NA12891 and NA128789) and two Yoruba males (NA1850710)
and NA19239)."
Week 5 (April 27).
Leader: Aditya and Amanda. The ADMIXTOOLS paper.
Paper: Patterson N, Moorjani P, Luo Y, Mallick S, Rohland N, Zhan Y, Genschoreck T, Webster T, Reich D (2012) Ancient Admixture
in Human History. Genetics 192(3): 1065�1093.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/22960212/
Summary: "We present a suite of methods for learning about
population mixtures, implemented in a software package called ADMIXTOOLS, that
support formal tests for whether mixture occurred and make it possible to infer
proportions and dates of mixture. We also describe the development of a new
single nucleotide polymorphism (SNP) array consisting of 629,433 sites with
clearly documented ascertainment that was specifically designed for population
genetic analyses..."
Week 6 (May 4).
Leader: Yujia. The fastsimcoal2 paper.
Paper: Excoffier L, Dupanloup
I, Huerta-S�nchez E, Sousa VC, Foll M (2013) Robust
Demographic Inference from Genomic and SNP Data. PLoS
Genetics 9(10): e1003905.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/24204310/
Summary: "We introduce a flexible and robust simulation-based
framework to infer demographic parameters from the site frequency spectrum
(SFS) computed on large genomic datasets. We show that our composite-likelihood
approach allows one to study evolutionary models of arbitrary complexity, which
cannot be tackled by other current likelihood-based methods."
Week 7 (May 11).
Leader: Ruoyi and Yunqi.
The multiple sequentially Markovian coalescent (MSMC) paper.
Paper: Schiffels, S, Durbin R (2014)
Inferring human population size and separation history from multiple genome
sequences. Nature Genetics 46: 919�925.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/24952747/
Summary: "So far, application of these methods to evolutionary
history more recent than 20,000-30,000 years ago and to population separations
has been limited. Here we present a new method that overcomes these
shortcomings. The multiple sequentially Markovian coalescent (MSMC) analyzes
the observed pattern of mutations in multiple individuals, focusing on the
first coalescence between any two individuals."
Week 8 (May 18).
Leader: Sanne. The paper on Aboriginal Australia.
Paper: Malaspinas AS., Westaway M, Muller C, �, Excoffier
L, Lambert DM, Willerslev E (2016) A genomic history
of Aboriginal Australia. Nature 538: 207�214.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/27654914/
Summary: "The population history of Aboriginal Australians
remains largely uncharacterized. Here we generate high-coverage genomes for 83
Aboriginal Australians (speakers of Pama-Nyungan languages) and 25 Papuans from
the New Guinea Highlands." ... and then apply a bunch of population
genetics tools to learn about prehistory.
Week 9 (May 25).
Leader: Boya and Zorian.
The MSMC-IM paper.
Paper: Wang K, Mathieson I, O�Connell J, Schiffels
S (2020) Tracking human population structure through time from whole genome
sequences. PLoS Genet 16(3): e1008552.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/32150539/
Summary: "Our approach, called MSMC-IM, uses an improved
implementation of the MSMC (MSMC2) to estimate coalescence rates within and
across pairs of populations, and then fits a continuous Isolation-Migration
model to these rates to obtain a time-dependent estimate of gene flow. We show,
using simulations, that our method can identify complex demographic scenarios
involving post-split admixture or archaic introgression."
Week 10 (June 1).
Leader: Kelsey. The Native American gene flow paper.
Paper: Ioannidis AG, Blanco-Portillo J, Sandoval K, �, Verdugo
RA, Moraga M, Mentzer AJ, Bustamante CD,
Moreno-Estrada A (2020) Native American gene flow into Polynesia predating
Easter Island settlement. Nature 583: 572�577.
PubMed link: https://pubmed.ncbi.nlm.nih.gov/32641827/
Summary: "Several limited molecular genetic studies have
reached opposing conclusions, and the possibility continues to be as hotly
contested today as it was when first suggested8-12. Here we analyse genome-wide variation in individuals from islands
across Polynesia for signs of Native American admixture, analysing
807 individuals from 17 island populations and 15 Pacific coast Native American
groups." ... and then apply a bunch of population genetics tools to learn
about prehistory.