This lesson is still being designed and assembled (Pre-Alpha version)

scRNA-seq Workflow

Overview

Teaching: 15 min
Exercises: 5 min
Questions
  • What is the standard workflow of scRNA-seq?

Objectives
  • Understand the main steps in scRNA-seq experiments.

  • Understand the main steps in scRNA-seq data analysis.

  • Understand the challanges in scRNA-seq data analysis.

scRNA-Seq Experimental Workflow

Single cell RNA sequencing (scRNA-seq) is a novel technique for extracting more detailed information from genome. Technically, scRNA-seq includes two parts of experimental design and data analysis. As it was mentioned before, scRNA-seq is useful in case of heterogeniety of cells and developmental studies. For example, imagine brain tissue with tens of cell types with tens of different expression profiles. For extracting transcriptome of each cell, it is necessary to isolate each cell from the tissue properly.

Note

A challange in experiment design of scRNA-seq is to isolate cells and prepare a comprehensive library.

Several methods have been developed to overcome this challange from which STRT-seq, SMART-seq, CEL-seq, MARS-seq, Fluidigm C1, CytoSeq, Drop-seq, inDrop, 10x Genomics, SPLiT-seq, sci-RNA-seq, DronC-seq, and Seq-Well can be mentioned. The developmental process of sequencing techniques has been mentioned by Svensson and colleagues. Fig1

There is a specific terminology for scRNA-seq which we should know for analysis of data. Lets check them out!

scRNA-seq dictionary

  • Unique Molecular Identifiers (UMI): Short random molecular tags which are added to DNA fragments in library preparation process before PCR amplification. UMIs are used to identify input DNA molecules.

  • External RNA Controls Consortium (ERCC) spike-in: A short RNA sequence which is used to calibrate measurements of RNA hybridization assays. ERCC spike-ins bind to DNA molecules with matching as control probes.

  • Cell Barcode: unique nucleic acid sequences, termed barcodes, which are used to label individual cells, so that they can be tracked through space and time in scRNA-seq.

  • Flourescence-activated cell sorting (FACS): A flowcytometry method for cell isolation in which fluorescent tags are used to detect each cell.

  • laser capture microdissection (LCM): Cell sorting method under direct microscopic visualization.

Here, you can see the order and construction of reads in scRNA-seq: biasedreads2

Cell barcodes are used to determine each read belongs to which cell and UMI is used for identification of each RNA molecule and enales counting the frequency of reads.

scRNA-Seq Computational Workflow

scRNA-seq raw data includes reads with cell barcode and UMIs. Before alignment of reads to genome, reads can be grouped using cell barcodes and the frequency of each read is estimated per cell per gene using UMIs. After alignment and frequency calculations, we have a gene expression table containing cells represented in the columns and genes represented in the rows. This is what we analyze in scRNA-seq comlutational workflow.

Here You can see the bioinformatics workflow for analysis of scRNA-seq data.

Workflow

Key Points

  • scRNA-seq requires much pre-processing before analysis can be performed.