Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Hubness reduction improves clustering and trajectory inference in single-cell transcriptomic data

Abstract : Background. Single-cell RNA-seq datasets are characterized by large ambient dimensionality, and their analyses can be affected by various manifestations of the dimensionality curse. One of these manifestations is the hubness phenomenon, i.e. existence of data points with surprisingly large incoming connectivity degree in the neighbourhood graph. Conventional approach to dampen the unwanted effects of high dimension consists in applying drastic dimensionality reduction. It remains unexplored if this step can be avoided thus retaining more information than contained in the low-dimensional projections, by correcting directly hubness. Results. We investigate the phenomenon of hubness in scRNA-seq data in spaces of increasing dimensionality. We also link increased hubness to increased levels of dropout in sequencing data. We show that hub cells do not represent any visible technical or biological bias. The effect of various hubness reduction methods is investigated with respect to the visualization, clustering and trajectory inference tasks in scRNA-seq datasets. We show that hubness reduction generates neighbourhood graphs with properties more suitable for applying machine learning methods; and that it outperforms other state-of-the-art methods for improving neighbourhood graphs. As a consequence, clustering, trajectory inference and visualisation perform better, especially for datasets characterized by large intrinsic dimensionality. Conclusion. Hubness is an important phenomenon in sequencing data. Reducing hubness can be beneficial for the analysis of scRNA-seq data with large intrinsic dimensionality in which case it can be an alternative to drastic dimensionality reduction.
Document type :
Preprints, Working Papers, ...
Complete list of metadata
Contributor : Andrei Zinovyev <>
Submitted on : Friday, May 14, 2021 - 11:18:09 PM
Last modification on : Wednesday, May 19, 2021 - 3:37:12 AM


Files produced by the author(s)




Elise Amblard, Jonathan Bac, Alexander Chervov, Vassili Soumelis, Andrei Zinovyev. Hubness reduction improves clustering and trajectory inference in single-cell transcriptomic data. 2021. ⟨hal-03226626⟩



Record views


Files downloads