Skip to Main content Skip to Navigation
Journal articles

Comparing Phylogenetic Approaches to Reconstructing Cell Lineage from Microsatellites with Missing Data

Abstract : Due to the imperfect fidelity of DNA replication, somatic cells acquire DNA mutations at each division which record their lineage history. Microsatellites, tandem repeats of DNA nucleotide motifs, mutate more frequently than other genomic regions and by observing microsatellite lengths in single cells and implementing suitable inference procedures, the cell lineage tree of an organism can be reconstructed. Due to recent advances in single cell Next Generation Sequencing (NGS) and the phylogenetic methods used to infer lineage trees, this work investigates which computational approaches best exploit the lineage information found in single cell NGS data. We simulated trees representing cell division with mutating microsatellites, and tested a range of available phylogenetic algorithms to reconstruct cell lineage. We found that distance-based approaches are fast and accurate with fully observed data. However, Maximum Parsimony and the computationally intensive probabilistic methods are more robust to missing data and therefore better suited to reconstructing cell lineage from NGS datasets. We also investigated how robust reconstruction algorithms are to different tree topologies and mutation generation models. Our results show that the flexibility of Maximum Parsimony and the probabilistic approaches mean they can be adapted to allow good reconstruction across a range of biologically relevant scenarios.
Document type :
Journal articles
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-02869464
Contributor : Anne-Marie Lyne <>
Submitted on : Friday, May 7, 2021 - 11:23:14 AM
Last modification on : Sunday, May 9, 2021 - 3:27:22 AM

File

reconstruct_cell_lineage_AML_f...
Files produced by the author(s)

Identifiers

Citation

Anne-Marie Lyne, Leïla Perié. Comparing Phylogenetic Approaches to Reconstructing Cell Lineage from Microsatellites with Missing Data. IEEE/ACM Transactions on Computational Biology and Bioinformatics, Institute of Electrical and Electronics Engineers, In press, ⟨10.1109/TCBB.2020.2992813⟩. ⟨hal-02869464v2⟩

Share

Metrics

Record views

24

Files downloads

32