Toggle Main Menu Toggle Search

Open Access padlockePrints

A Maximum-Likelihood Method to Correct for Allelic Dropout in Microsatellite Data with No Replicate Genotypes

Lookup NU author(s): Dr Kari Schroeder

Downloads

Full text for this publication is not currently held within this repository. Alternative links are provided below where available.


Abstract

Allelic dropout is a commonly observed source of missing data in microsatellite genotypes, in which one or both allelic copies at a locus fail to be amplified by the polymerase chain reaction. Especially for samples with poor DNA quality, this problem causes a downward bias in estimates of observed heterozygosity and an upward bias in estimates of inbreeding, owing to mistaken classifications of heterozygotes as homozygotes when one of the two copies drops out. One general approach for avoiding allelic dropout involves repeated genotyping of homozygous loci to minimize the effects of experimental error. Existing computational alternatives often require replicate genotyping as well. These approaches, however, are costly and are suitable only when enough DNA is available for repeated genotyping. In this study, we propose a maximum-likelihood approach together with an expectation-maximization algorithm to jointly estimate allelic dropout rates and allele frequencies when only one set of nonreplicated genotypes is available. Our method considers estimates of allelic dropout caused by both sample-specific factors and locus-specific factors, and it allows for deviation from Hardy-Weinberg equilibrium owing to inbreeding. Using the estimated parameters, we correct the bias in the estimation of observed heterozygosity through the use of multiple imputations of alleles in cases where dropout might have occurred. With simulated data, we show that our method can (1) effectively reproduce patterns of missing data and heterozygosity observed in real data; (2) correctly estimate model parameters, including sample-specific dropout rates, locus-specific dropout rates, and the inbreeding coefficient; and (3) successfully correct the downward bias in estimating the observed heterozygosity. We find that our method is fairly robust to violations of model assumptions caused by population structure and by genotyping errors from sources other than allelic dropout. Because the data sets imputed under our model can be investigated in additional subsequent analyses, our method will be useful for preparing data for applications in diverse contexts in population genetics and molecular ecology.


Publication metadata

Author(s): Wang CL, Schroeder KB, RosenbergO NA

Publication type: Article

Publication status: Published

Journal: Genetics

Year: 2012

Volume: 192

Issue: 2

Pages: 651-669

Print publication date: 30/07/2012

ISSN (print): 0016-6731

ISSN (electronic):

Publisher: Genetics Society of America

URL: http://dx.doi.org/10.1534/genetics.112.139519

DOI: 10.1534/genetics.112.139519


Altmetrics

Altmetrics provided by Altmetric


Funding

Funder referenceFunder name
Burroughs Wellcome Fund
Howard Hughes Medical Institute International Student Research Fellowship
R01-GM081441National Institutes of Health
R01-HG005855National Institutes of Health

Share