With the advent of molecular information (now increasingly phylogenomic data), phylogenetic inferences based simply on morphological matrices conduct keep larn obsolete when studying extant (i.e. living) groups of organisms. In the era of ‘Big Data’, nosotros conduct keep access to information sets providing us amongst fully resolved trees, where each branch has (nearly) unambiguous support. But non thus for the many groups of extinct animals together with plants that lived millions of years ago, where nosotros conduct keep naught to a greater extent than than an impression/compression of an organ/individual, or a laid of bones. Or, when nosotros showtime to study relationships betwixt (even within) species or the relationships betwixt extant together with extinct species. As I pointed out inwards several recent posts for David Morrison’s weblog The Genealogical World of Phylogenetic Networks, inferring simply a tree may non live an optimal alternative to analyse morphological matrices including (or limited) to extinct taxa; live it because of the to a greater extent than oft than non tree-unlike signal [spermatophytes][dinosaurs], convergent evolution, change over time, or ancestor-descendant relationships inwards full general (upcoming post, 3rd Oct 2017). Here are 2 papers that I constitute inspiring, together with which I believe should conduct keep been read past times whatever researcher who ever tried (successfully or not) to infer trees based on morphological information matrices (or similarly complex data).
Character together with split upward back upward values (Zander 2004)
The newspaper of Zander published inwards the 2nd number of a magazine called PhyloInformatics (not sure enough it yet exists) opens interesting viewpoints on how to facial expression at branch back upward values (sometimes called ‘node support’ inwards phylogenetic literature, which is normally a incorrect term). I constitute this paper, when I was (desperately) searching for studies dealing amongst ambiguous bootstrap back upward patterns that I could utilisation to back upward my ideas. Back then, I simply got into contact amongst the consensus networks (Holland & Moulton 2003) because my boss, Vera Hemleben (a geneticists), brought me together amongst a novel computer science professor inwards Tübingen, Daniel Huson, who simply happened to conduct keep finished simply about other update of a software named SplitsTree (Huson 1998; Huson & Bryant 2006); together with nosotros had enough of information on beech trees together with maples amongst complex split upward patterns for diverse reasons (Denk et al. 2002; Grimm 2003; Denk, Grimm & Hemleben 2005; Grimm et al. 2006).Zander performed branch back upward analyses on “invented” binary matrices of iv taxa A, B, C together with D. One of the taxa, the outgroup D, is an all-zero taxon. The relationships of the 3 ingroup taxa A, B, together with C were partly ambiguous: a varying proportion of characters supported AB vs. air-conditioning vs. BC. Zander showed that split upward back upward values (such every bit parsimony bootstrap, BS, together with Bayesian probabilities, PP) are to live expected inwards such situations, together with that applying a threshold of e.g. BS back upward > lxx makes trivial sense. When e.g. 50% of the characters supported AB together with 50% supported air-conditioning (no back upward for BC) together with then both parsimony BS back upward together with Bayesian PP converged to 50/0.5. As presently every bit ane alternative had to a greater extent than grapheme back upward than the other parsimony BS increased for that alternative (disproportionately; run across also this post), whereas PP readily converged to 1.0 (e.g. when 50 characters supported AB together with 42 supported air-conditioning together with no BC, or the ratio was 50:40:10 for AB:AC:BC). Zander also pointed the finger to a yet standing employment inwards phylogenetic literature: “Identifying imbalance [conflicting branch support]… in published papers is impossible (without recalculation from the actual information set)” (see e.g. my re-analysis of the Loranthaceae subset of Su et al. 2015). The take-home message of Zander's newspaper (which is non slow to read) tin live set inwards a political analogy:
PP is what yous get, when your parliament is manned using a winner-takes-the-seat voting procedure, BS is what yous get, when your physical care for uses simply about assort of partly proportional vote (always starting from information that supports to a greater extent than than a unmarried alternative).
Although Zander didn’t verbalize over it, his results should live kept inwards heed when analysing morpho-matrices of extinct organisms. When 2 branching alternatives (e.g. A+B together with A+C) have ample, albeit depression back upward this tin live a straight indication that H5N1 is an ancestor of B together with C, or that H5N1 is a hybrid or mix of B together with C. Consequently, if H5N1 is a potential ancestor of 3 taxa, B, C, together with D, all splits involving H5N1 would ideally have BS or JK back upward of 33, together with – inwards a perfect basis – a PP of 0.33. Conversely, BS > 50 (but << 100) together with PP 1.0 may simply reverberate the signal of a bulk (≥ 50%) of characters, together with it remains unclear whether the minority (<50%) prefers ane alternative, many, or none.
H5N1 phylogeny of artificial manuscripts (Spencer et al. 2004)
Again, my contact amongst this newspaper was co-incidental. We had to re-but (once again) nasty anonymous peer comments why nosotros showed networks together with non trees to set forrad our hypotheses (Denk & Grimm 2009), together with I was searching for studies dealing amongst actual ancestor-descendant relationships. Fortunately, ane of my colleagues inwards Tübingen, Markus Göker, a mycologist amongst skillful cognition virtually distance-based analysis (and many other things), pointed me to the newspaper of Spencer et al., published inwards the Journal of Theoretical Biology.Spencer et al. had several scribes duplicating an master copy text that they could non sympathise (or non fully, lacking the needed linguistic communication skill). As consequence, errors were produced during the copying process, the text evolved amongst each copy. Some of the copies were given to other scribes to live copied again. The authors together with then coded the differences inwards the text copies together with applied a attain of phylogenetic inferences (trees together with networks). The outcome of the inferences was together with then compared to the ‘true tree’, i.e. the actual evolutionary tree recognising the master copy re-create every bit the mutual ancestor of all copies, together with simply about copies every bit the ancestors of others (the copies beingness the straight descendants of the template). They constitute that the neighbour-net, a 2-dimensional (planar) meta-phylogenetic graph, best depicted the actual ancestor-descendant relationships (see also this post for an instance of development inwards a 2-dimensional infinite together with this post dealing amongst the development of flamenco music).
Really, halt relying (exclusively) on trees
When yous read the papers past times Spencer et al. (2004) together with Zander (2004), but also the newspaper of The Netherlands & Moulton (2003) introducing the consensus network approach, yous may enquire yourself why to a greater extent than than a decade after, most phylogenetic publications (morphology- or molecular-based) yet simply demo an incomprehensive tree (or strict consensus tree), fifty-fifty when many branches lack unambiguous support; together with why branch back upward below a sure enough threshold (currently: BS < 70, PP < 0.95) is non reported, but dropped, together with the reasons for moderate or depression back upward rarely investigated (but run across postscriptum).Why practice nosotros cling to trees, fifty-fifty if they lack support, are indecisive, or incomprehensive regarding the actual signal inwards our data? Well, trees may live simplistic, but are also uncomplicated to grasp together with fifty-fifty simpler to interpret. Showing the comprehensive final result of the analyses may warning peers to distrust your information together with study (happened nearly inwards all papers, nosotros eventually published betwixt 2005–2015), together with necessitates to verbalize over the emerging alternatives (making the newspaper besides long). But the signal – particularly, inwards morphological matrices – is non trivial, but complex. This complexity may live puzzling, but is ever interesting, together with may dot to real interesting aspects of evolution. And the tools are long available to facial expression into these puzzling, complex patterns. So, at that topographic point is no excuse for simply showing a tree.
----------------------
PS Another argue for seeing trees amongst branches of unknown back upward is that sure enough reviewers, together with occasionally editors, comfortably hiding behind the Impermeable Fog (i.e. confidential review process), insist on dropping these values, regarding them every bit meaningless. And many authors comply rather than protestation (like me). For instance, Mike Pirie, editor of Taxon, stated inwards his 2nd determination missive of the alphabet to our newspaper on Cynanchum (Khanum et al. 2016): “I don't run across that it makes whatever feel to fifty-fifty include the depression together with moderate values for the PP values. I would retrieve it would live to a greater extent than meaningful to simply consider PP ≥ ninety every bit supported together with anything below that every bit non supported.” Which is, of course, mathematically already nonsense, a PP of 0.89 indicates (ideally) a probability of 0.89 that a branch is correct, which is insignificantly (0.01) less back upward than a PP of 0.9. I naturally objected this together with farther unqualified statements past times Pirie together with the primary editor of Taxon regarding phylogenetic analyses, together with retreated every bit an writer to protestation Taxon’s editorial policies (second fourth dimension they tried to rid a newspaper of my dearest networks); but my co-authors were live granted showing the meaningless values together with superfluous reconstructions, i.e. BS together with PP back upward networks. Which Pirie together with his editorial colleagues considered a “detriment” to the study, together with defendant of helping to enshroud information problems “under the carpet”. Check out the newspaper to create upward one's heed for yourself, together with compare it to the criterion publications inwards Taxon, e.g. Su et al. 2015 together with my (network-based) re-analysis of role of Su et al's information (Grimm 2017).
References
Denk T, Grimm G, Stögerer K, Langer M, Hemleben V. 2002. The evolutionary history of Fagus in western Eurasia: Evidence from genes, morphology together with the fossil record. Plant Systematics together with Evolution 232:213-236.
Denk T, Grimm GW. 2005. Phylogeny together with biogeography of Zelkova (Ulmaceae sensu stricto) every bit inferred from leafage morphology, ITS sequence information together with the fossil record. Botanical Journal of the Linnéan Society 147:129-157.
Denk T, Grimm GW. 2009. The biogeographic history of beech trees. Review of Palaeobotany together with Palynology 158:83–100.
Denk T, Grimm GW, Hemleben V. 2005. Patterns of molecular together with morphological differentiation inwards Fagus: implications for phylogeny. American Journal of Botany 92:1006-1016
Grimm GW. 2003. Tracing the vogue together with speed of intrageneric development - a instance study of genus Acer L. together with Fagus L. D.Sc. thesis. Eberhard-Karls University. http://w210.ub.uni-tuebingen.de/dbt/volltexte/2005/1574/
Grimm GW, Renner SS, Stamatakis A, Hemleben V. 2006. H5N1 nuclear ribosomal deoxyribonucleic acid phylogeny of Acer inferred amongst maximum likelihood, splits graphs, together with motif analyses of 606 sequences. Evolutionary Bioinformatics 2:279–294. http://la-press.com/journals.php?pa=abstract&content_id=132
Grimm 2017. ‘Quick-and-dirty’ re-analysis of the Su et al. (2015) information of Loranthaceae together with their sis groups. File S6 inwards the online supporting archive to: Grímsson F, Grimm GW, Zetter R. 2017. Evolution of pollen morphology inwards Loranthaceae. Grana DOI:10.1080/00173134.2016.1261939.
Holland B, Moulton V. 2003. Consensus networks: H5N1 method for visualising incompatibilities inwards collections of trees. In: Benson G, together with Page R, eds. Algorithms inwards Bioinformatics: Third International Workshop, WABI, Budapest, Republic of Hungary Proceedings. Berlin, Heidelberg, Stuttgart: Springer Verlag, p. 165–176.
Huson DH. 1998. SplitsTree: analyzing together with visualizing evolutionary data. Bioinformatics 14:68–73. https://doi.org/10.1093/bioinformatics/14.1.68
Huson DH, Bryant D. 2006. Application of phylogenetic networks inwards evolutionary studies. Molecular Biology together with Evolution 23:254–267. https://doi.org/10.1093/molbev/msj030
Khanum R, Surveswaran S, Meve U, Liede-Schumann S. 2016. Cynanchum (Apocynaceae: Asclepiadoideae): H5N1 pantropical Asclepiadoid genus revisited. Taxon 65:467–486.
Spencer M, Davidson EA, Barbrook AC, Howe CJ. 2004. Phylogenetics of artificial manuscripts. Journal of Theoretical Biology 227:503-511.
Su H-J, Hu J-M, Anderson FE, Der JP, Nickrent DL. 2015. Phylogenetic relationships of Santalales amongst insights into the origins of holoparasitic Balanophoraceae. Taxon 64:491–506.
Zander RH. 2004. Minimal values of reliability of Bootstrap together with Jackknife proportions, Decay index, together with Bayesian posterior probability. PhyloInformatics 2:1-13.
0 Response to "Two Papers You Lot May Desire To Read Earlier Inferring Trees From Morphological (Or Other) Data"