The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

BioX::SeqHash - get one or more sequences from a FASTA file quickly.

VERSION

version 0.1.1

SYNOPSIS

  # use it in object-oriented way;
  
  use Bio::SeqHash;
  my $obj = Bio::SeqHash->new(file => "test.fa");
  my $hs = $obj->fa2hs; # get a HashRef coverted from the test.fa
  my $seq = $obj->get_seq("seq_id"); # get the sequence of "seq_id"(only the sequence)
  my $seq_fa = $obj->get_id_seq("seq_id"); # get the sequence of "seq_id"(in the FASTA format)
  $obj->get_seqs_batch('id_list.txt', 'output.fa');  # extract specified sequence to output file

  # use it in Common mode;

  use Bio::SeqHash "fa2hs";
  my $hs = fa2hs("in.fa");

DESCRIPTION

  Currently, there do have some modules that can operate the FASTA file such as Bio::SeqIO, 
  But it only provide some basic operation to obtain the information about sequence. In my daily work,
  I still have to write some repetitive code. So this module is write to perform a deeper wrapper for operating FASTA file
  Notice: this module is not suitable for the FASTA file that is extremble big.

METHODS

fa2hs

  Convert a fasta file into a HashRef variable. If the C<file> parameter have been passed
  into during the process of new, here needs no parameter, otherwise you have to provide
  the path of a fasta file.

get_id_seq

  get a sequence by a id in FASTA format

get_seq

  get a sequence by a id

get_seqs_batch

  extract specified gene list from input file

AUTHOR

Yan Xueqing <yanxueqing621@163.com>

COPYRIGHT AND LICENSE

This software is copyright (c) 2017 by Yan Xueqing.

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.