Article

Likelihood-based inference of population history from low-coverage de novo genome assemblies

Details

Citation

Hearn J, Stone GN, Bunnefeld L, Nicholls JA, Barton NH & Lohse K (2014) Likelihood-based inference of population history from low-coverage de novo genome assemblies. Molecular Ecology, 23 (1), pp. 198-211. https://doi.org/10.1111/mec.12578

Abstract
Short-read sequencing technologies have in principle made it feasible to draw detailed inferences about the recent history of any organism. In practice, however, this remains challenging due to the difficulty of genome assembly in most organisms and the lack of statistical methods powerful enough to discriminate between recent, nonequilibrium histories. We address both the assembly and inference challenges. We develop a bioinformatic pipeline for generating outgroup-rooted alignments of orthologous sequence blocks from de novo low-coverage short-read data for a small number of genomes, and show how such sequence blocks can be used to fit explicit models of population divergence and admixture in a likelihood framework. To illustrate our approach, we reconstruct the Pleistocene history of an oak-feeding insect (the oak gallwasp Biorhiza pallida), which, in common with many other taxa, was restricted during Pleistocene ice ages to a longitudinal series of southern refugia spanning the Western Palaearctic. Our analysis of sequence blocks sampled from a single genome from each of three major glacial refugia reveals support for an unexpected history dominated by recent admixture. Despite the fact that 80% of the genome is affected by admixture during the last glacial cycle, we are able to infer the deeper divergence history of these populations. These inferences are robust to variation in block length, mutation model and the sampling location of individual genomes within refugia. This combination of de novo assembly and numerical likelihood calculation provides a powerful framework for estimating recent population history that can be applied to any organism without the need for prior genetic resources.

Keywords
admixture; maximum likelihood; population divergence; statistical phylogeography

Journal
Molecular Ecology: Volume 23, Issue 1

StatusPublished
Publication date31/01/2014
Publication date online04/11/2013
Date accepted by journal20/09/2013
URLhttp://hdl.handle.net/1893/23732
PublisherWiley-Blackwell
ISSN0962-1083