Summary

Tag

repeats

Owner

SGN

Input

the vector-screened sequences, from the seq analysis

External Data

MIPS repeat collection, SGN UniRepeats collection

Output

repeat-masked sequences

The repeats analysis repeat-masks each sequence in the batch using RepeatMasker. It makes hard and soft-masked versions of the output of two different RepeatMasker runs.

One run, the "regular" run, uses the ftp://ftp.sgn.cornell.edu/tomato_genome/repeats/mipsREdat_8.8_solanaceae_TE.tfa.gz repeat library, and includes the -nolow option to disable masking of low-complexity repeats. The other run, called "stringent", uses as its repeat library ftp://ftp.sgn.cornell.edu/tomato_genome/repeats/mipsREdat_8.8_eudico_TEs.tfa.gz and does not include the -nolow option.

It was suggested by Stephane that the less-stringently-masked "regular" output would be more appropriate for input to gene finders.

Input

Filenames

masked:fasta

softmasked:fasta

masked_stringent:fasta

softmasked_stringent:fasta

Requirements

Processing

Description

Parameters

normal: run as RepeatMasker -q -nolow -gff -lib mips_repeat_set.fasta -xsmall -parallel 2 bac_seq_file stringent: run as RepeatMasker -q -gff -lib mips_and_uni_combined.fasta -xsmall -parallel 2 bac_seq_file

Output

Filenames

Requirements

Comments


CategoryAnalysisDescription CategoryAnSgnRepmask (XXX = toolname minus the number)

AnSgnRepmask000 (last edited 2010-06-28 20:06:08 by RobertBuels)