Skip to content

Latest commit

 

History

History
17 lines (11 loc) · 661 Bytes

README.md

File metadata and controls

17 lines (11 loc) · 661 Bytes

Ribo-seq metadata standardization for RiboCrypt and Riboseq.org

The script does 3 things:

  1. Given a Entrez fetch table ~ 700 columns from SRA (not included in the scripts)
  2. Standardize column names (CELL_LINE, CELL LINE, celllines are all the same)
  3. Standardize column values: (Ribo-seq, Riboseq, RIBOSEQ are all the same)
  4. Semi manual annotation (HeLA is female cell line, HEK is male etc)

Finally upload this file to google drive with statistics of how much could be standardized.

About

The procedure is packaged into 3 scripts ran from: metadata_main_script.R