GenomeTools¶
GenomeTools are a collection of tools to analyse genome sequences.
GenomeTools are available as a module on Apocrita.
Usage¶
To run the default installed version of GenomeTools, simply load the
genometools module:
$ module load genometools
$ gt -help
Usage: gt [option ...] [tool | script] [argument ...]
The GenomeTools genome analysis system.
-i enter interactive mode after executing 'tool' or 'script'
-q suppress warnings
-test perform unit tests and exit
-seed set seed for random number generator manually.
0 generates a seed from current time and process id
-version display version information and exit
To display the list of available tools, execute gt -help.
Example jobs¶
Serial jobs¶
Here is an example job running on 1 core and 1GB of memory to calculate statistics for the given fasta file(s):
#!/bin/bash
#SBATCH -n 1 # (or --ntasks=1) Request 1 core
#SBATCH --mem-per-cpu=1G # Request 1GB RAM per core
#SBATCH -t 1:0:0 # Request 1 hour runtime
module load genometools
gt seqstat raw_data.fq.gz
Here is an example job running on 1 core and 1GB of memory to extract sequences
from given sequence file(s)
or fastaindex into the
following files: .des, .esq, .md5, .ois, .sds and .ssp.
#!/bin/bash
#SBATCH -n 1 # (or --ntasks=1) Request 1 core
#SBATCH --mem-per-cpu=1G # Request 1GB RAM per core
#SBATCH -t 1:0:0 # Request 1 hour runtime
module load genometools
gt extractseq raw_data.fq.gz