Assessing Combinability of Phylogenomic Data Using Bayes Factors.

TitleAssessing Combinability of Phylogenomic Data Using Bayes Factors.
Publication TypeJournal Article
Year of Publication2019
AuthorsNeupane, Suman, Karolina Fučíková, Louise A. Lewis, Lynn Kuo, Ming-Hui Chen, and Paul O. Lewis
JournalSyst Biol
Volume68
Issue5
Pagination744-754
Date Published2019 Sep 01
ISSN1076-836X
KeywordsBayes Theorem, Chlorophyta, Classification, Phylogeny
Abstract

With the rapid reduction in sequencing costs of high-throughput genomic data, it has become commonplace to use hundreds of genes to infer phylogeny of any study system. While sampling a large number of genes has given us a tremendous opportunity to uncover previously unknown relationships and improve phylogenetic resolution, it also presents us with new challenges when the phylogenetic signal is confused by differences in the evolutionary histories of sampled genes. Given the incorporation of accurate marginal likelihood estimation methods into popular Bayesian software programs, it is natural to consider using the Bayes Factor (BF) to compare different partition models in which genes within any given partition subset share both tree topology and edge lengths. We explore using marginal likelihood to assess data subset combinability when data subsets have varying levels of phylogenetic discordance due to deep coalescence events among genes (simulated within a species tree), and compare the results with our recently described phylogenetic informational dissonance index (D) estimated for each data set. BF effectively detects phylogenetic incongruence and provides a way to assess the statistical significance of D values. We use BFs to assess data combinability using an empirical data set comprising 56 plastid genes from the green algal order Volvocales. We also discuss the potential need for calibrating BFs and demonstrate that BFs used in this study are correctly calibrated.

DOI10.1093/sysbio/syz007
Alternate JournalSyst Biol
Original PublicationAssessing combinability of phylogenomic data using Bayes factors.
PubMed ID30726954
PubMed Central IDPMC7967903
Grant ListP01 CA142538 / CA / NCI NIH HHS / United States
R01 GM070335 / GM / NIGMS NIH HHS / United States
Project: