TY - JOUR
T1 - A new method for inferring timetrees from temporally sampled molecular sequences
AU - Miura, Sayaka
AU - Tamura, Koichiro
AU - Tao, Qiqing
AU - Huuki, Louise A.
AU - Kosakovsky Pond, Sergei L.
AU - Priest, Jessica
AU - Deng, Jiamin
AU - Kumar, Sudhir
N1 - Publisher Copyright:
© 2020 Miura et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2020
Y1 - 2020
N2 - Pathogen timetrees are phylogenies scaled to time. They reveal the temporal history of a pathogen spread through the populations as captured in the evolutionary history of strains. These timetrees are inferred by using molecular sequences of pathogenic strains sampled at different times. That is, temporally sampled sequences enable the inference of sequence divergence times. Here, we present a new approach (RelTime with Dated Tips [RTDT]) to estimating pathogen timetrees based on a relative rate framework underlying the RelTime approach that is algebraic in nature and distinct from all other current methods. RTDT does not require many of the priors demanded by Bayesian approaches, and it has light computing requirements. In analyses of an extensive collection of computer-simulated datasets, we found the accuracy of RTDT time estimates and the coverage probabilities of their confidence intervals (CIs) to be excellent. In analyses of empirical datasets, RTDT produced dates that were similar to those reported in the literature. In comparative benchmarking with Bayesian and non-Bayesian methods (LSD, TreeTime, and treedater), we found that no method performed the best in every scenario. So, we provide a brief guideline for users to select the most appropriate method in empirical data analysis. RTDT is implemented for use via a graphical user interface and in high-throughput settings in the newest release of cross-platform MEGA X software, freely available from http://www.megasoftware.net.
AB - Pathogen timetrees are phylogenies scaled to time. They reveal the temporal history of a pathogen spread through the populations as captured in the evolutionary history of strains. These timetrees are inferred by using molecular sequences of pathogenic strains sampled at different times. That is, temporally sampled sequences enable the inference of sequence divergence times. Here, we present a new approach (RelTime with Dated Tips [RTDT]) to estimating pathogen timetrees based on a relative rate framework underlying the RelTime approach that is algebraic in nature and distinct from all other current methods. RTDT does not require many of the priors demanded by Bayesian approaches, and it has light computing requirements. In analyses of an extensive collection of computer-simulated datasets, we found the accuracy of RTDT time estimates and the coverage probabilities of their confidence intervals (CIs) to be excellent. In analyses of empirical datasets, RTDT produced dates that were similar to those reported in the literature. In comparative benchmarking with Bayesian and non-Bayesian methods (LSD, TreeTime, and treedater), we found that no method performed the best in every scenario. So, we provide a brief guideline for users to select the most appropriate method in empirical data analysis. RTDT is implemented for use via a graphical user interface and in high-throughput settings in the newest release of cross-platform MEGA X software, freely available from http://www.megasoftware.net.
KW - Algorithms
KW - Animals
KW - Sequence Alignment/methods
KW - Humans
KW - Virus Diseases/virology
KW - Software
KW - Viruses/classification
KW - Phylogeny
KW - Computational Biology/methods
KW - Evolution, Molecular
KW - Sequence Analysis, DNA/methods
UR - http://www.scopus.com/inward/record.url?scp=85078904036&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1007046
DO - 10.1371/journal.pcbi.1007046
M3 - Article
C2 - 31951607
AN - SCOPUS:85078904036
SN - 1553-734X
VL - 16
SP - e1007046
JO - PLoS Computational Biology
JF - PLoS Computational Biology
IS - 1
M1 - e1007046
ER -