Reconstructing Cell Cycle Pseudo Time-Series Via Single-Cell Transcriptome Data

Single-cell mRNA sequencing, which permits whole transcriptional profiling of individual cells, has been widely applied to study growth and development of tissues and tumors. Resolving cell cycle for such groups of cells is significant, but may not be adequately achieved by commonly used approaches. Here we develop a traveling salesman problem and hidden Markov model-based computational method named reCAT, to recover cell cycle along time for unsynchronized single-cell transcriptome data. We independently test reCAT for accuracy and reliability using several data sets. We find that cell cycle genes cluster into two major waves of expression, which correspond to the two well-known checkpoints, G1 and G2. Moreover, we leverage reCAT to exhibit methylation variation along the recovered cell cycle. Thus, reCAT shows the potential to elucidate diverse profiles of cell cycle, as well as other cyclic or circadian processes (e.g., in liver), on single-cell resolution.

Includes supplementary material
Gene Expression, Sequence Analysis, RNA, DNA Methylation, Stem Cells, Genetic Heterogeneity, Pluripotent Stem Cells, Data Mining, Cell Division
National Science Foundation of China [61673241, 61561146396], National Basic Research Program of China [2012CB316504, 2012CB316503]; Hi-tech Research and Development Program of China [2012AA020401]; NSFC [61305066, 91010016, 91519326, 31361163004]; NIH/NHGRI [5U01HG006531-03; 4R01HG006465]
CC BY 4.0 (Attribution), ©2017 The Authors