Understanding The Meaning and Measure of Concordance Factors in Phylogenomics

The article "The Meaning and Measure of Concordance Factors in Phylogenomics" by Robert Lanfear and Matthew W. Hahn explores the importance of concordance factors (CFs) in phylogenetic studies, particularly in the context of large, complex datasets. The authors highlight that while larger datasets can provide more accurate measurements of mean signal, they often result in uniformly high measures of branch support, such as bootstrap and posterior probabilities, which can limit their utility. They argue that CFs are more informative about the predictive power of the species tree and the underlying biological variation among loci. CFs are described as descriptors of topological variation rather than measures of statistical support. They quantify the degree of biological variation in the topologies of individual loci compared to the species tree. The authors introduce the concept of the concordance vector, which summarizes the CF and discordance factors (DFs) for a given branch of the species tree. They discuss various methods for estimating CFs, including gene-based CFs (gCFs), quartet-based CFs (qCFs), and site-based CFs (sCFs), and compare their strengths and limitations. The article emphasizes that CFs are not measures of statistical support but rather biological parameters that describe the proportion of the genome for which a given clade is true. It also highlights the practical implications of distinguishing between CFs and statistical support measures, such as the choice of outgroups for phylogenetic analyses. The authors provide a detailed framework for understanding and interpreting CFs, using examples from bird genomes to illustrate the calculations and interpretations of these measures. Overall, the article underscores the importance of CFs in understanding the evolutionary histories of genes and traits, particularly when the species tree is not a perfect predictor of these histories. It provides a comprehensive review of the methods and applications of CFs in phylogenomics, offering insights into how these measures can enhance our understanding of evolutionary processes and the reliability of phylogenetic inferences.The article "The Meaning and Measure of Concordance Factors in Phylogenomics" by Robert Lanfear and Matthew W. Hahn explores the importance of concordance factors (CFs) in phylogenetic studies, particularly in the context of large, complex datasets. The authors highlight that while larger datasets can provide more accurate measurements of mean signal, they often result in uniformly high measures of branch support, such as bootstrap and posterior probabilities, which can limit their utility. They argue that CFs are more informative about the predictive power of the species tree and the underlying biological variation among loci. CFs are described as descriptors of topological variation rather than measures of statistical support. They quantify the degree of biological variation in the topologies of individual loci compared to the species tree. The authors introduce the concept of the concordance vector, which summarizes the CF and discordance factors (DFs) for a given branch of the species tree. They discuss various methods for estimating CFs, including gene-based CFs (gCFs), quartet-based CFs (qCFs), and site-based CFs (sCFs), and compare their strengths and limitations. The article emphasizes that CFs are not measures of statistical support but rather biological parameters that describe the proportion of the genome for which a given clade is true. It also highlights the practical implications of distinguishing between CFs and statistical support measures, such as the choice of outgroups for phylogenetic analyses. The authors provide a detailed framework for understanding and interpreting CFs, using examples from bird genomes to illustrate the calculations and interpretations of these measures. Overall, the article underscores the importance of CFs in understanding the evolutionary histories of genes and traits, particularly when the species tree is not a perfect predictor of these histories. It provides a comprehensive review of the methods and applications of CFs in phylogenomics, offering insights into how these measures can enhance our understanding of evolutionary processes and the reliability of phylogenetic inferences.

The Meaning and Measure of Concordance Factors in Phylogenomics

October 17, 2024 | Robert Lanfear, Matthew W. Hahn