Bio::DOOP::ClusterSubset - One subset of a cluster
Version 0.13
@cluster_subsets = @{$cluster->get_all_subsets};
This object represents one subset of a cluster. A subset is a set of homologous sequences, hopefully monophyletic, grouped by evolutionary distance from the reference species (Arabidopsis or human).
Tibor Nagy, Godollo, Hungary and Endre Sebestyen, Martonvasar, Hungary
Creates a new subset object from the subset primary id. You usually won't need this, as you will create the subsets from a Bio::DOOP::Cluster object, based on the subset type.
Return type: Bio::DOOP::ClusterSubset object
$cluster_subset = Bio::DOOP::ClusterSubset->new($db,"123");
Prints out the subset primary id. This is the internal ID from the MySQL database.
Return type: string
print $cluster_subset->get_id;
Prints out the subset type.
print $cluster_subset->get_type;
Returns the number of sequences in the subset.
for(i = 0; i < $cluster_subset->get_seqno; i++){ print $seq[$i]; }
Returns the total number of features (motifs, TSSs and other) in the subset.
if ($cluster_subset->get_featno > 4){ print "We have lots of features!!!\n"; }
Returns the number of motifs in the subset.
$motifs = $cluster_subset->get_motifno;
Returns 'y' if the subset is the same as the original cluster, 'n' if not.
Return type: string ('y' or 'n')
if ($cluster_subset->get_orig eq "y") { print "This is the original cluster!\n"; } elsif ($cluster_subset->get_orig eq "n"){ print "This is some smaller subset!\n"; }
Returns the ID of the cluster, from which the subset originates.
$cluster_id = $cluster_subset->get_cluster;
Prints out the dialign format alignment of the subset.
print $cluster_subset->get_dialign;
Prints out the fasta format alignment of the subset.
print $cluster_subset->get_fasta_align;
Returns the arrayref of all motifs associated with the subset.
Return type: arrayref, the array containig Bio::DOOP::Motif objects
@motifs = @{$cluster_subset->get_all_motifs};
Returns a sorted arrayref of all sequences associated with the subset.
Sorting the sequences by the following criteria: The first sequence is always the reference species (Arabidopsis/Human). All other sequences are sorted first by the taxon_class (B E M V in the plants and P R E H M N T F V C in the chordates ) and then by the alphabetical order.
Return type: arrayref, the array containig Bio::DOOP::Sequence objects
@seq = @{$cluster_subset->get_all_seqs};
To install Bio::DOOP::DOOP, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Bio::DOOP::DOOP
CPAN shell
perl -MCPAN -e shell install Bio::DOOP::DOOP
For more information on module installation, please visit the detailed CPAN module installation guide.