Skip to content

Any NCBI RefSeq table from UCSC with output format "all fields from selected table" to a bed with only exons with correct exon number and number of extra bases around them with genename and NM numbver included.

Notifications You must be signed in to change notification settings

zezzipa/UCSCtable2bed

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 

Repository files navigation

UCSCtable2bed

When getting a bed from UCSC, we don't always get what we want. It often has exons with the number 0, not to mention numbers in the wrong order when on a negative strand, and includes UTRs. For us, only coding exons is of interest. This script takes any NCBI RefSeq tables and makes it to the bed we want.

On the UCSC homepage, under Tools, you can find Table Browser (https://genome.ucsc.edu/cgi-bin/hgTables)

group: Genes and Gene Predictions

track: NCBI RefSeq

table: any

output format: all fields from selected table

The script is run with the following:

python ucsc2bed.py <ucsc_table.infile> <extra_bases>

ucsc_table.infile is the name of the table extra_bases is the number of bases outside of the coding exon you want to include

About

Any NCBI RefSeq table from UCSC with output format "all fields from selected table" to a bed with only exons with correct exon number and number of extra bases around them with genename and NM numbver included.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages