The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Mdust - Perl extension for Mdust nucleotide filtering

SYNOPSIS

  use Bio::Tools::Run::Mdust;
  my $mdust = Bio::Tools::Run::Mdust->new();

  $mdust->run($bio_seq_object);

DESCRIPTION

Perl wrapper for the nucleic acid complexity filtering program mdust as available from TIGR via http://www.tigr.org/tdb/tgi/software/. Takes a Bio::PrimarySeq object of type DNA as input. Returns a Bio::Seq object with the low-complexity regions changed to Ns, or a Bio::Seq::RichSeq object with the low-complexity regions identified as a Bio::SeqFeature::Generic with primary tag = 'Excluded'.

This module uses the environment variable MDUSTDIR to find the mdust program. Set MDUSTDIR to the directory containing the mdust binary. For example, if mdust is installed as /usr/local/bin/mdust, set MDUSTDIR to /usr/local/bin.

SEE ALSO

mdust, Bio::PrimarySeq, Bio::Seq::RichSeq, Bio::SeqFeature::Generic

FEEDBACK

Mailing Lists

User feedback is an integral part of the evolution of this and other Bioperl modules. Send your comments and suggestions preferably to the Bioperl mailing list. Your participation is much appreciated.

  bioperl-l@bioperl.org                  - General discussion
  http://bioperl.org/wiki/Mailing_lists  - About the mailing lists

Reporting Bugs

Report bugs to the Bioperl bug tracking system to help us keep track of the bugs and their resolution. Bug reports can be submitted via the web:

  http://bugzilla.open-bio.org/

AUTHOR

Donald Jackson (donald.jackson@bms.com)

APPENDIX

The rest of the documentation details each of the object methods. Internal methods are usually preceded with a _

new

  Title         : new
  Usage         : my $mdust = Bio::Tools::Run::Mdust->new( -target => $target_bioseq)
  Purpose       : Create a new mdust object
  Returns       : A Bio::Seq object
  Args          : target - Bio::Seq object for masking - alphabet MUST be DNA.
                  wsize - word size for masking (default = 3)
                  cutoff - cutoff score for masking (default = 28)
                  maskchar - character for replacing masked regions (default = N)
                  coords - boolean - indicate low-complexity regions as 
                           Bio::SeqFeature::Generic 
                           objects with primary tag 'Excluded', 
                           do not change sequence (default 0)
                  tmpdir - directory for storing temporary files
                  debug - boolean - toggle debugging output, 
                          do not remove temporary files
  Note          : All of the arguments can also be get/set with their own accessors, such as:
                  my $wsize = $mdust->wsize();

run

  Title         : run
  Usage         : $mdust->run();
  Purpose       : Run mdust on the target sequence
  Args          : target (optional) - Bio::Seq object of alphabet DNA for masking
  Returns       : Bio::Seq object (see 'new' for details)

target

  Title         : target
  Usage         : $mdust->target($bio_seq)
  Purpose       : Set/get the target (sequence to be filtered).  
  Returns       : Target Bio::Seq object
  Args          : Bio::Seq object using the DNA alphabet (optional)