TY - JOUR
T1 - Predicting clone genotypes from tumor bulk sequencing of multiple samples
AU - Miura, Sayaka
AU - Gomez, Karen
AU - Murillo, Oscar
AU - Huuki, Louise A.
AU - Vu, Tracy
AU - Buturla, Tiffany
AU - Kumar, Sudhir
N1 - Publisher Copyright:
© The Author(s) 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: [email protected]
PY - 2018/12/1
Y1 - 2018/12/1
N2 - Motivation: Analyses of data generated from bulk sequencing of tumors have revealed extensive genomic heterogeneity within patients. Many computational methods have been developed to enable the inference of genotypes of tumor cell populations (clones) from bulk sequencing data. However, the relative and absolute accuracy of available computational methods in estimating clone counts and clone genotypes is not yet known. Results: We have assessed the performance of nine methods, including eight previously-published and one new method (CloneFinder), by analyzing computer simulated datasets. CloneFinder, LICHeE, CITUP and cloneHD inferred clone genotypes with low error (<5% per clone) for a majority of datasets in which the tumor samples contained evolutionarily-related clones. Computational methods did not perform well for datasets in which tumor samples contained mixtures of clones from different clonal lineages. Generally, the number of clones was underestimated by cloneHD and overestimated by PhyloWGS, and BayClone2, Canopy and Clomial required prior information regarding the number of clones. AncesTree and Canopy did not produce results for a large number of datasets. Overall, the deconvolution of clone genotypes from single nucleotide variant (SNV) frequency differences among tumor samples remains challenging, so there is a need to develop more accurate computational methods and robust software for clone genotype inference. Availability and implementation: CloneFinder is implemented in Python and is available from https://github.com/gstecher/CloneFinderAPI.
AB - Motivation: Analyses of data generated from bulk sequencing of tumors have revealed extensive genomic heterogeneity within patients. Many computational methods have been developed to enable the inference of genotypes of tumor cell populations (clones) from bulk sequencing data. However, the relative and absolute accuracy of available computational methods in estimating clone counts and clone genotypes is not yet known. Results: We have assessed the performance of nine methods, including eight previously-published and one new method (CloneFinder), by analyzing computer simulated datasets. CloneFinder, LICHeE, CITUP and cloneHD inferred clone genotypes with low error (<5% per clone) for a majority of datasets in which the tumor samples contained evolutionarily-related clones. Computational methods did not perform well for datasets in which tumor samples contained mixtures of clones from different clonal lineages. Generally, the number of clones was underestimated by cloneHD and overestimated by PhyloWGS, and BayClone2, Canopy and Clomial required prior information regarding the number of clones. AncesTree and Canopy did not produce results for a large number of datasets. Overall, the deconvolution of clone genotypes from single nucleotide variant (SNV) frequency differences among tumor samples remains challenging, so there is a need to develop more accurate computational methods and robust software for clone genotype inference. Availability and implementation: CloneFinder is implemented in Python and is available from https://github.com/gstecher/CloneFinderAPI.
UR - https://www.scopus.com/pages/publications/85057208515
U2 - 10.1093/bioinformatics/bty469
DO - 10.1093/bioinformatics/bty469
M3 - Article
C2 - 29931046
AN - SCOPUS:85057208515
SN - 1367-4803
VL - 34
SP - 4017
EP - 4026
JO - Bioinformatics
JF - Bioinformatics
IS - 23
ER -