Abstract

Abstract Summary: Bayesian statistical methods based on simulation techniques have recently been shown to provide powerful tools for the analysis of genetic population structure. We have previously developed a Markov chain Monte Carlo (MCMC) algorithm for characterizing genetically divergent groups based on molecular markers and geographical sampling design of the dataset. However, for large-scale datasets such algorithms may get stuck to local maxima in the parameter space. Therefore, we have modified our earlier algorithm to support multiple parallel MCMC chains, with enhanced features that enable considerably faster and more reliable estimation compared to the earlier version of the algorithm. We consider also a hierarchical tree representation, from which a Bayesian model-averaged structure estimate can be extracted. The algorithm is implemented in a computer program that features a user-friendly interface and built-in graphics. The enhanced features are illustrated by analyses of simulated data and an extensive human molecular dataset. Availability: Freely available at http://www.rni.helsinki.fi/~jic/bapspage.html

Keywords

Markov chain Monte CarloComputer scienceBayesian probabilityPopulationMarkov chainSampling (signal processing)Genetic algorithmGibbs samplingData miningInterface (matter)Reversible-jump Markov chain Monte CarloComputer graphicsRepresentation (politics)GraphicsTree (set theory)AlgorithmMachine learningArtificial intelligenceMathematicsComputer graphics (images)

Affiliated Institutions

Related Publications

Publication Info

Year
2004
Type
article
Volume
20
Issue
15
Pages
2363-2369
Citations
473
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

473
OpenAlex

Cite This

Jukka Corander, Patrik Waldmann, Pekka Marttinen et al. (2004). BAPS 2: enhanced possibilities for the analysis of genetic population structure. Bioinformatics , 20 (15) , 2363-2369. https://doi.org/10.1093/bioinformatics/bth250

Identifiers

DOI
10.1093/bioinformatics/bth250