Skip to content

Normalization and difference calling for Next Generation Sequencing (NGS) data via joint multinomial modeling.

Notifications You must be signed in to change notification settings

your-highness/normR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

#normR - normR obeys regime mixture rules

Normalization and Difference Calling for Next Generation Sequencing (NGS)

Experiments via Joint Multinomial Modeling

Two NGS tracks are modeled simultaneously by fitting a binomial mixture model on mapped read counts. In the first counting process, a desired smoothing kernel (bin size) and read characteristic threshold (quality, SAMFLAG) can be specified. In a second step a binomial mixture model with a user-specified number of components is fit to the data. The fit yields different enrichment regimes in the supplied NGS tracks. Log-space computation is done in C/C++ where OpenMP enables for fast parallel computation.

Release Version

The master branch is always in sync with the normR Bioconductor release and the normR github Bioconductor mirror. A R 3.2 compliant version can be found in the normR R3.2 tree.

Installation

To install normR from the release repository, easiest way is to use Bioconductor or devtools:

#install dependencies
if (!requireNamespace("BiocManager", quietly=TRUE))
    install.packages("BiocManager")
BiocManager::install("bamsignals", suppressUpdates=T)
#fetch current normR version from github
install.packages("devtools")
require(devtools)
devtools::install_github("your-highness/normr")

Usage

See the vignette for a toy example on normR usage. The documentation of routines can be accessed from with R with ?.

Use cases

  • ChIP-seq normalization / enrichment calling with an Input experiment (Whole Cell Extract, H3/IgG ChIP-seq)

  • ChIP-seq differential enrichment calling for two different antigens in same sample population

  • ChIP-seq identification of enrichment regimes to investigate on sample heterogeneity

  • RNA-seq differential expression calling

  • ChIP-seq differential enrichment calling in two different samples (be aware of CNVs!)

  • CNV identification

Useful links

Be sure to check out the following amazing github projects for your upcoming NGS magic:

bamsignals - Efficient Counting in Indexed Bam Files for Single End and Paired End NGS Data

EpicSeg - Chromatin Segmentation Based on a Probabilistic Multinomial Model for Read Counts

kfoots - Fit Multivariate Discrete Probability Distributions to Count Data

deepTools - User-Friendly Tools for Normalization and Visualization of Deep-Sequencing Data

About

Normalization and difference calling for Next Generation Sequencing (NGS) data via joint multinomial modeling.

Resources

Stars

Watchers

Forks

Packages

No packages published