Installation of external software

TRAL is dependend on different external software which need to be installed before TRAL can be used.

Here, we explain how to install external software packages, such as de novo tandem repeat detectors. _Important_: Once a software is installed, the path to either the binary or the executable shell script needs to be indicated in the TRAL configuration file config.ini.

For each of the software an installation script can be found for :ref: easy setup <easy_setup>.

Sequence profile model generation

At current, only support for HMMER is integrated published in Bioinformatics (2008) (Installation instructions). TRAL searches for HMMER’s hmmbuild in the system path by default (modify in config.ini):

[hmm]
    hmmbuild = hmmbuild

If hmmbuild is not in your system path, set the absolute path:

[hmm]
    hmmbuild = path/to/hmmbuild

Alignment of tandem repeat units

Currently, MAFFT is the default tool for (re-)alignment of the tandem repeat units to each other (Installation instructions). TRAL searches for MAFFT’s ginsi in the system path by default (modify in config.ini):

[repeat]
    ginsi = ginsi

If ginsi is not in your system path, set the absolute path:

[repeat]
    ginsi = path/to/ginsi

Realignment of tandem repeat units with indel aware proPIP algorithm

The alignment module of (Castor) can be used to realign tandem repeat units with the indel aware algorithm proPIP (Installation instructions). TRAL searches for Castor in the system path by default (modify in config.ini):

[repeat]
    Castor = Castor

If Castor is not in your system path, set the absolute path:

[repeat]
    Castor = path/to/Castor

Currently integrated detectors

Software

License

HHrepID

CC-BY-NC-2.0

PHOBOS

Non-commercial

TRED

Non-commercial

T-REKS

Non-commercial?

TRF

Unlimited use

TRUST

?

XSTREAM

Non-commercial

HHrepID

HHrepID is a profile self alignment de novo amino acid tandem repeat detection software published e.g. in PLOS Computational Biology (2011) (Installation instructions). TRAL searches for the executable binary hhrepid64 in the system path by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        HHrepID = hhrepid_64

If the executable hhrepid is not in your system path, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        HHrepID = path/to/hhrepid

Also, you need to supply a null hmm file for using HHrepID. A dummy file is located in your home/.tral. Supply the path to your null hmm file of choice:

[sequence]
    [[repeat_detector_path]]
        HHrepID_dummyhmm = /path/to/home/.tral/data/dummyHMM.hmm

PHOBOS

PHOBOS is an unpublished k-mer based de novo DNA tandem repeat detection software. TRAL searches for the executable phobos in the system path by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        PHOBOS = phobos

If phobos is not in your system path, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        PHOBOS = path/to/phobos

TRED

TRED is a sequence self alignment de novo amino acid tandem repeat detection software published in Bioinformatics (2007) (The software is available on request). TRAL searches for the executable tred in the system path by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        TRED = tred

If phobos is not in your system path, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        TRED = path/to/tred

T-REKS

T-REKS is a k-mer based de novo DNA/AA tandem repeat detection software published in Bioinformatics (2009) (Installation instructions). Create an executable text file T-REKS with the following content:

#!/bin/sh
java -jar /my/path/to/T-Reks.jar "$@"

If you place this text file in your systems path, TRAL finds it by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        T-REKS = T-REKS

If you did not place T-REKS in your system path or named it differently, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        T-REKS = path/to/T-REKS

TRF

TRF is a k-mer based self alignment de novo DNA tandem repeat detection software published in Nucleic Acids Research (1999) (Installation instructions). TRAL searches for the executable trf in the system path by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        TRF = trf

If trf is not in your system path, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        TRF = path/to/trf

TRUST

TRUST is a sequence self alignment de novo amino acid tandem repeat detection software published in Bioinformatics (2004) (Installation instructions).

Create an executable text file TRUST with the following content (you can amend the java memory consumption restrictions):

#!/bin/sh
java -Xmx30G -cp /my/path/to/TRUST/1.0.0/Align nl.vu.cs.align.SelfSimilarity "$@"

If you place this text file in your systems path, TRAL finds it by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        TRUST = TRUST

If you did not place TRUST in your system path or named it differently, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        TRUST = path/to/TRUST

Also, you need to supply a substitution matrix for using TRUST (it ships with several substitution matrices). Supply the path of your favourite substitution matrix:

[sequence]
    [[repeat_detector_path]]
        TRUST_substitutionmatrix = /path/to/TRUST/Align/BLOSUM50

XSTREAM

XSTREAM is a k-mer based de novo DNA/AA tandem repeat detection software published in BMC Bioinformatics (2007) (Installation instructions).

Create an executable text file XSTREAM with the following content:

#!/bin/sh
java -jar /my/path/to/xstream.jar "$@"

If you place this text file in your systems path, TRAL finds it by default (modify in config.ini):

[sequence]
    [[repeat_detector_path]]
        XSTREAM = XSTREAM

If you did not place XSTREAM in your system path or named it differently, set the absolute path:

[sequence]
    [[repeat_detector_path]]
        XSTREAM = path/to/XSTREAM

Simulation of evolution in tandem repeats

For simulating evolved sequences ALF can be used which simulates a root genome into a number of related genomes. It was published in Molecular Biology and Evolution (2012) and can be downloaded on alfsim.org. TRAL searches for ALFs executable alfsim in the system path by default (modify in config.ini):

[repeat]
    alfsim = alfsim

If alfsim is not in your system path, set the absolute path:

[repeat]
    alfsim = path/to/alfsim

Not yet integrated software

There is a large number of tandem repeat detection software for which TRAL does not provide parsers. However, theses parsers are easily manually added to sequence.repeat_detection_io. Please file an issue on the tracker.