Skip to content

Reference based alignment for mapping short reads to reference genome.

License

Notifications You must be signed in to change notification settings

Avramis/AlignmentByNumbers

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Alignment_by_numbers

Description

Alignment_by_numbers is a prototype reference-based nucleotide sequence aligner. The tool uses signal processing techniques to align short nucleotide sequences to a reference genome.

Installation (Linux, Mac OS X):

i.e for a Linux  or a Mac executable with GCC compiler 
First step make the script executable using
chmod u+x ./GCC_Installation

  and then invoke the executable

./GCC_Installation

i.e. for for a Linux  or a Mac executable with Intel compiler 
First step make the script executable using
chmod u+x ./INTEK_Installation

  and then invoke the executable

./INTEL_Installation

Examples

Usage:
    Parameters:
        -fa [~/reference_genome.fasta]    -- (Required parameter) fasta file directory and file name
        -fq [~/reads_file.fastq]          -- (Required parameter) fastq file directory and file name
        -o [~/outputdirectory]            -- (Optional parameter) output folder directory for sam file. If not provided the sam file is save in the same directory as the fastq file.
        -kmer [x]                         -- (Optional parameter) kmer size for the analysis [default: 100]
        -rep [Representation method]      -- (Optional parameter) Representation method for converting nucleotide sequences to numerical sequences [default: Voss_indicators]
        Options:
            01) Atomic_Numbers
            02) Complex_Numbers
            03) Dna_Walk
            04) EIIP_numbers
            05) Integer_numbers
            06) Pair_numbers
            07) Real_numbers
            08) Tetrahedron
            09) Voss_indicators   --   default option
            10) Z_curve
        -tra [Transformation method]      -- (Optional parameter) Transformation method for compressing data [default: DFT]
        Options:
            01) DFT   --   default option
            02) DWT
            03) PAA
        -clvl [y]                         -- (Optional parameter) Compression level to apply [default: 2]
        -knn [kn]                         -- (Optional parameter)the number of kn neighbor data to use for the KNN search [default: 100]
	-s [true/false]                   -- (Optional parameter) run a sensitive search [default: fasle]
		-bs [x]                           -- (Optional parameter) set the size of the tree [default: 100000]
	-rd [~/report output directory]   -- (Optional parameter) output directory to save the timing report file. If not provided the report text file is save in the same directory as the fastq file.
       -h                                -- Print Help Options


    Example:
        01) Alignment_by_numbers -fa ~/genome.fasta -fq ~/reads.fastq
        02) Alignment_by_numbers -fa ~/genome.fasta -fq ~/reads.fastq -kmer 300 -rep Tetrahedron -tra DWT -clvl 4 -knn 30
        03) Alignment_by_numbers -fa ~/genome.fasta -fq ~/reads.fastq -kmer 300 -rep Tetrahedron -tra DWT -o ~/samfiledirectory -clvl 4 -knn 30 -s true -bs 10000 -rd ~/reportsfile

About

Reference based alignment for mapping short reads to reference genome.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published