The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Bio::DOOP::ClusterSubset - One subset of a cluster

VERSION

Version 0.13

SYNOPSIS

  @cluster_subsets = @{$cluster->get_all_subsets};

DESCRIPTION

This object represents one subset of a cluster. A subset is a set of homologous sequences, hopefully monophyletic, grouped by evolutionary distance from the reference species (Arabidopsis or human).

AUTHORS

Tibor Nagy, Godollo, Hungary and Endre Sebestyen, Martonvasar, Hungary

METHODS

new

Creates a new subset object from the subset primary id. You usually won't need this, as you will create the subsets from a Bio::DOOP::Cluster object, based on the subset type.

Return type: Bio::DOOP::ClusterSubset object

  $cluster_subset = Bio::DOOP::ClusterSubset->new($db,"123");

get_id

Prints out the subset primary id. This is the internal ID from the MySQL database.

Return type: string

  print $cluster_subset->get_id;

get_type

Prints out the subset type.

Return type: string

  print $cluster_subset->get_type;

get_seqno

Returns the number of sequences in the subset.

Return type: string

  for(i = 0; i < $cluster_subset->get_seqno; i++){
      print $seq[$i];
  }

get_featno

Returns the total number of features (motifs, TSSs and other) in the subset.

Return type: string

  if ($cluster_subset->get_featno > 4){
      print "We have lots of features!!!\n";
  }

get_motifno

Returns the number of motifs in the subset.

Return type: string

  $motifs = $cluster_subset->get_motifno;

get_orig

Returns 'y' if the subset is the same as the original cluster, 'n' if not.

Return type: string ('y' or 'n')

  if ($cluster_subset->get_orig eq "y") {
      print "This is the original cluster!\n";
  }
  elsif ($cluster_subset->get_orig eq "n"){
      print "This is some smaller subset!\n";
  }

get_cluster

Returns the ID of the cluster, from which the subset originates.

Return type: string

  $cluster_id = $cluster_subset->get_cluster;

get_dialign

Prints out the dialign format alignment of the subset.

Return type: string

  print $cluster_subset->get_dialign;

get_fasta_align

Prints out the fasta format alignment of the subset.

Return type: string

  print $cluster_subset->get_fasta_align;

get_all_motifs

Returns the arrayref of all motifs associated with the subset.

Return type: arrayref, the array containig Bio::DOOP::Motif objects

  @motifs = @{$cluster_subset->get_all_motifs};

get_all_seqs

Returns a sorted arrayref of all sequences associated with the subset.

Sorting the sequences by the following criteria: The first sequence is always the reference species (Arabidopsis/Human). All other sequences are sorted first by the taxon_class (B E M V in the plants and P R E H M N T F V C in the chordates ) and then by the alphabetical order.

Return type: arrayref, the array containig Bio::DOOP::Sequence objects

  @seq = @{$cluster_subset->get_all_seqs};