NAME
Bio::SeqHash - get one or more sequences from a FASTA file quickly.
VERSION
version 0.1.4
SYNOPSIS
# use it in object-oriented way;
use
Bio::SeqHash;
my
$obj
= Bio::SeqHash->new(
file
=>
"test.fa"
);
my
$hs
=
$obj
->fa2hs;
# get a HashRef coverted from the test.fa
my
$seq
=
$obj
->get_seq(
"seq_id"
);
# get the sequence of "seq_id"(only the sequence)
my
$seq_fa
=
$obj
->get_id_seq(
"seq_id"
);
# get the sequence of "seq_id"(in the FASTA format)
$obj
->get_seqs_batch(
'id_list.txt'
,
'output.fa'
);
# extract specified sequence to output file
# use it in Common mode;
my
$hs
= fa2hs(
"in.fa"
);
DESCRIPTION
Currently, there do have some modules that can operate the FASTA file such as Bio::SeqIO, But it only provide some basic operation to obtain the information about sequence. In my daily work, I still have to write some repetitive code. So this module is write to perform a deeper wrapper for operating FASTA file Notice: this module is not suitable for the FASTA file that is extremble big.
METHODS
fa2hs
Convert a fasta file into a HashRef variable. If the C<file> parameter have been passed
into during the process of new, here needs
no
parameter, otherwise you have to provide
the path of a fasta file.
get_id_seq
get a sequence by a id in FASTA
format
get_seq
get a sequence by a id
get_seqs_batch
extract specified gene list from input file
AUTHOR
Yan Xueqing <yanxueqing621@163.com>
COPYRIGHT AND LICENSE
This software is copyright (c) 2017 by Yan Xueqing.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.