Welcome to the GitHub repository associated with our recent publication
Single-cell transcriptomic analysis of mouse neocortical development
Loo, Simon et al., Nature Communications, 2019: https://www.nature.com/articles/s41467-018-08079-9
The development of the mammalian cerebral cortex depends on careful orchestration of proliferation, maturation, and migration events, ultimately giving rise to a wide variety of neuronal and non-neuronal cell types. To better understand cellular and molecular processes that unfold during late corticogenesis, we perform single-cell RNA-seq on the mouse cerebral cortex at a progenitor driven phase (embryonic day 14.5) and at birth—after neurons from all six cortical layers are born. We identify numerous classes of neurons, progenitors, and glia, their proliferative, migratory, and activation states, and their relatedness within and across age. Using the cell-type-specific expression patterns of genes mutated in neurological and psychiatric diseases, we identify putative disease subtypes that associate with clinical phenotypes. Our study reveals the cellular template of a complex neurodevelopmental process, and provides a window into the cellular origins of brain diseases.
class.R
: Additions and modifications to Shekhar et al.'sclass.R
file inititally published hereE14_processing.R
: A detailed, fully documented summary of the processing steps performed for the E14.5 cortex.- Requires
class.R
to access the functions necessary for data analysis - All processing steps and parameters for analysis of the P0 cortex were identical to those illustrated in this template
- Requires
E14_combined_matrix.txt.gz
: The main gene expression matrix for E14.5 cortex.- Rows are genes, columns are cells.
- Each biological replicate is represented here.
E14_tSNE.py
: Code for visualizing the data usingt-SNE
, in PythonP0_combined_matrix.txt.gz
: The main gene expression matrix for P0 cortex.- Rows are genes, columns are cells.
- Each biological replicate is represented here.
E14_combined_matrix_ClusterAnnotations.txt
andP0_combined_matrix_ClusterAnnotations.txt
: Lookup tables with cell IDs and final cluster labels
We also created web-based tools for visualizing our data, these can be found at: http://zylkalab.org/data
- genefilter
- sva
- igraph
- ggplot2
- Matrix
- gmodels
- RANN
- reshape
- cluster
- SLICER
- gplots
- scikit-learn
- matplotlib
- pandas
- Added several new slots to S4 object, including
pca.eigenvalues
,sils
, andnumclust
- Added new function called
perform_refined_clustering
, which iteratively runsdoGraph_clustering
to optimize the number of nearest neighbors, then iteratively runsdoGraph_clustering
again to refine the cluster assignments using computed Silhouette widths - Computes eigenvalues as part of
doPCA
and stores these values for the permutation test - Adds a legend to t-SNE plots
- Modifies the
binomcount.test
function to compute log-fold-change as(x+1)/(y+1)
to avoid NA/Inf values - Adds checks to the
binomcount.test
to ensure thateffect.size
isn't NA - Modifies cluster naming on
dot.plot
- Prevents the usage of "Dingbats" font class in
ggplot
calls
Please direct any questions or issues with the code published here to jeremy [underscore] simon (at) med [dot] unc [dot] edu