Chemistry::Harmonia - Decision of simple and difficult chemical puzzles.
use Chemistry::Harmonia qw(:redox); use Data::Dumper; for my $formula ('Fe3O4', '[Cr(CO(NH2)2)6]4[Cr(CN)6]3'){ my $ose = oxidation_state( $formula ); print Dumper $ose; }
Will print something like:
$VAR1 = { 'O' => { 'num' => [ 4 ], 'OS' => [ [ -2 ] ] }, 'Fe' => { 'num' => [ 3 ], 'OS' => [ [ 2, 3, 3 ] ] } }; $VAR1 = { 'H' => { 'num' => [ 96 ], 'OS' => [ [ 1 ] ] }, 'O' => { 'num' => [ 24 ], 'OS' => [ [ -2 ] ] }, 'N' => { 'num' => [ 48, 18 ], 'OS' => [ [ -3 ], [ -3 ] ] }, 'C' => { 'num' => [ 24, 18 ], 'OS' => [ [ 4 ], [ 2 ] ] }, 'Cr' => { 'num' => [ 4, 3 ], 'OS' => [ [ 3 ], [ 2 ] ] } };
Transformation of the chemical mix in reagent and product arrays:
my $chemical_equation = 'KMnO4 + NH3 --> N2 + MnO2 + KOH + H2O'; print Dumper parse_chem_mix( $chemical_equation );
Will print:
$VAR1 = [ ['KMnO4', 'NH3'], ['N2', 'MnO2', 'KOH','H2O'] ];
Preparation of the chemical mix (equation) from reagent and product arrays:
my $ce = [ [ 'K', 'O2'], [ 'K2O', 'Na2O2', 'K2O2', 'KO2' ] ]; my $k = { 'K2O' => 1, 'Na2O2' => 0, 'K2O2' => 2, 'KO2' => 3 }; print prepare_mix( $ce, { 'coefficients' => $k } ),"\n";
Will output:
K + O2 == 1 K2O + 0 Na2O2 + 2 K2O2 + 3 KO2
'Synthesis' of the good :) chemical formula(s):
my $abracadabra = 'ggg[crr(cog(nhz2)2)6]4[qcr(cn)5j]3qqq'; print Dumper good_formula( $abracadabra );
$VAR1 = [ '[Cr(CO(NH2)2)6]4[Cr(CN)5I]3', '[Cr(Co(NH2)2)6]4[Cr(CN)5I]3' ];
Calculation CLASS-CIR and brutto (gross) formulas of substances for reaction. See example:
my $mix = '2 KMnO4 + 5 H2O2 + 3 H2SO4 --> 1 K2SO4 + 2 MnSO4 + 8 H2O + 5 O2'; my %cf; my $ce = parse_chem_mix( $mix, \%cf ); print Dumper class_cir_brutto( $ce, \%cf );
$VAR1 = [ 'HKMnOS', 1504979632, { 'O2' => 'O2', 'MnSO4' => 'Mn1O4S1', 'KMnO4' => 'K1Mn1O4', 'K2SO4' => 'K2O4S1', 'H2SO4' => 'H2O4S1', 'H2O2' => 'H2O2', 'H2O' => 'H2O1' } ];
TTC reaction. Proceeding example above:
print Dumper ttc_reaction( $ce );
$VAR1 = { 'r' => 5, 'a' => 5, 's' => 7 };
The module provides the necessary subroutines to solve some puzzles of the general inorganic and physical chemistry. The methods implemented in this module, are all oriented to known rules and laws of general and physical chemistry.
Chemistry::Harmonia provides these subroutines:
oxidation_state( $formula_of_substance ) parse_chem_mix( $mix_of_substances [, \%coefficients ] ) good_formula( $abracadabra [, { 'zero2oxi' => 1 } ] ) prepare_mix( \@reactants_and_products [, \%facultative_parameters ] ) class_cir_brutto( \@reactants_and_products [, \%coefficients ] ) ttc_reaction( \@reactants_and_products )
All of them are context-sensitive.
This subroutine returns a hierarchical hash-reference of hash integer oxidation state (key 'OS') and hash with the number of atoms for each element (key 'num') for the inorganic $formula_of_substance.
$formula_of_substance
Always use the upper case for the first character in the element name and the lower case for the second character from Periodic Table. Examples: Na, Ag, Co, Ba, C, O, N, F, etc. Compare: Co - cobalt and CO - carbon monoxide.
For very difficult mysterious formula (usually organic) returns undef. It will be good if to set, for example, 'Pb3C2O7' and 'Pt2Cl6' as '{PbCO3}2{PbO}' and '{PtCl2}{PtCl4}'.
undef
If you doesn't know formulas of chemical elements and/or Periodic Table use subroutine good_formula(). I insist to do it always anyway :)
good_formula()
Now oxidation_state() is checked over 6200 unique inorganic substances.
oxidation_state()
A chemical equation consists of the chemical formulas of the reactants and products. This subroutine parses $mix_of_substances (usually chemical equation) to arrays of the reactants (initial substances) and products (substances formed in the chemical reaction). It is the most simple and low-cost way to carry out reaction without reactants :).
$mix_of_substances
Separator of the reactants from products can be sequence '=', '-' together or without one or some '>'. For example: =, ==, =>, ==>, ==>>, -, --, ->, -->, ->>> etc. Spaces round a separator are not essential. If the separator is not set, last substance of a mix will be a product only.
Each individual substance's chemical formula is separated from others by a plus sign ('+'), comma (','), semicolon (';') and/or space. Valid examples:
print Dumper parse_chem_mix( 'KNO3 + S ; K2SO4 , NO SO2' );
$VAR1 = [ [ 'KNO3','S','K2SO4','NO' ], [ 'SO2' ] ];
If in $mix_of_substances is stoichiometric coefficients they collect in ref to hash \%coefficients. Next example:
\%coefficients
my %coef; my $chem_eq = 'BaS + 2 H2O = Ba(OH)2 + 1 Ba(SH)2'; my $out_ce = parse_chem_mix( $chem_eq, \%coef ); print Dumper( $out_ce, \%coef );
$VAR1 = [ [ 'BaS', 'H2O'], [ 'Ba(OH)2', 'Ba(SH)2'] ]; $VAR2 = { 'Ba(SH)2' => '1', 'H2O' => '2' };
By zero (0) coefficients it is possible to eliminate substances from the mix. Next example:
my $chem_eq = '2Al O2 = Al2O3 0 CaO*Al2O3';
Will output like:
$VAR1 = [ [ 'Al', 'O2' ], [ 'Al2O3' ] ]; $VAR2 = { 'Al' => '2' };
However:
$chem_eq = '2Al O2 Al2O3 0 CaO*Al2O3';
$VAR1 = [ [ 'Al', 'O2', 'Al2O3', 'O' ], [ 'CaO*Al2O3' ] ]; $VAR2 = { 'Al' => '2' };
As without a separator ('=' or others similar) the last substance will be a product.
If in $mix_of_substances is zero (0) similar oxygen, they are replaced oxygen. Certainly, oxygen is life. I love oxygen :) Some more examples:
$chem_eq = '2Al 02 Ca CaO*Al2O3';
$VAR1 = [ [ 'Al', 'O2', 'Ca' ], [ 'CaO*Al2O3' ] ]; $VAR2 = { 'Al' => '2' };
Input:
$chem_eq = '2Al 102 Ca CaO*Al2O3';
Output:
$VAR1 = [ [ 'Al', 'Ca' ], [ 'CaO*Al2O3' ] ]; $VAR2 = { 'Al' => '2', 'Ca' => '102' };
$chem_eq = '2Al --> 0 0Al2O3 0';
$VAR1 = [ [ 'Al' ], [ 'O' ] ]; $VAR2 = { 'Al' => '2' };
$chem_eq = 'Al O2 = 1 0Al2O3';
$VAR1 = [ [ 'Al', 'O2' ], [ 'OAl2O3' ] ]; $VAR2 = { 'OAl2O3' => '1' };
The forced conversion of single zero to oxygen is set by parameter 'zero2oxi'. It add in \%coefficients. Next example:
'zero2oxi'
$coef{ 'zero2oxi' } = 1; $chem_eq = 'Al CaO = 0 Al2O3';
$VAR1 = [ ['Al', 'CaO'], ['O', 'Al2O3'] ];
Without 'zero2oxi' output:
$VAR1 = [ ['Al'], ['CaO'] ];
Actually the subroutine recognizes more many difficult situations. Here some examples:
'2Al 1 02 Ca Al2O3' to-> [ ['Al', 'O2', 'Ca'], ['Al2O3'] ], {'Al' => 2, 'O2' => 1} '2Al 102 Ca Al2O3' to-> [ ['Al', 'Ca'], ['Al2O3'] ], {'Al' => 2, 'Ca' => 102} '2Al 1 02 4O2 = 1 0Al2O3' to-> [ ['Al', 'O2'], ['OAl2O3'] ], {'Al' => 2, 'O2' => 4, 'OAl2O3' => 1} '2Al = 00 Al2O3' to-> [ ['Al'], ['O0', 'Al2O3'] ], {'Al' => 2} '2Al O 2 = Al2O3' to-> [ ['Al', 'O2'], ['Al2O3'] ], {'Al' => 2} '2Al O = ' to-> [ ['Al', 'O'], ['='] ], {'Al' => 2} ' = 2Al O' to-> [ ['=','Al'], ['O'] ], {'Al' => 2} '0Al = O2 Al2O3' to-> [ ['O2'], ['Al2O3'] ] '2Al 1 2 3 4 Ca 5 6 Al2O3 7 8 9' to-> [ ['Al', 'Ca'], ['Al2O3'] ], {'Al2O3' => 56, 'Al' => 2, 'Ca' => 1234} '2Al 1 2 3 4 Ca 5 6 Al2O3' to-> [ ['Al', 'Ca'], ['Al2O3'] ], {'Al2O3' => 56, 'Al' => 2, 'Ca' => 1234} '2Al 1 2 3 4 Ca 5 6 = Al2O3' to-> [ ['Al', 'Ca56'], ['Al2O3'] ], {'Al' => 2, 'Ca56' => 1234} '2Al 1 2 3 4 Ca 5 6 = Al2O3 CaO 9' to-> [ ['Al', 'Ca56'], ['Al2O3', 'CaO'] ], {'Al' => 2, 'Ca56' => 1234} 'Al O + 2 = Al2O3' to-> [ ['Al', 'O'], ['Al2O3'] ], {'Al2O3' => 2} 'Cr( OH ) 3 + NaOH = Na3[ Cr( OH ) 6 ]' to-> [ ['Cr(OH)3', 'NaOH'], ['Na3[Cr(OH)6]'] ]
This subroutine parses $abracadabra to array reference of "good" chemical formula(s). The "good" formula it does NOT mean chemically correct. The subroutine oxidation_state() will help with a choice chemically correct formula.
$abracadabra
Algorithm basis is the robust sense and chemical experience.
'Co' to-> 'Co' 'Cc' to-> 'CC' 'co' to-> 'CO', 'Co' 'CO2' to-> 'CO2' 'Co2' to-> 'Co2', 'CO2' 'mo2' to-> 'Mo2'
The good formula(s) there are chemical elements, brackets ()[]{} and digits only. good_formula() love oxygen. Fraction will be scaled in the integer.
Fragments A*B, xC*yD are transformed to {A}{B}, {C}x{D}y (here A, B, C, D - groups of chemical elements, digits and brackets ()[]{}; x, y - digits only). Next examples:
'0.3al2o3*1.5sio2' to-> '{Al2O3}{SIO2}5', '{Al2O3}{SiO2}5' 'al2(so4)3*10h20' to-> '{Al2(SO4)3}{H20}10' '..,,..mg0,,,,.*si0...s..,..' to-> '{MgO}{SIOS}', '{MgO}{SiOS}'
Superfluous brackets won't be:
'Irj(){}[]' to-> 'IrI' '[{(na)}]' to-> 'Na'
'{[[([[CaO]])*((SiO2))]]}' to-> '{([[CaO]])}{((SiO2))}'
If in $abracadabra is zero (0) similar oxygen, they are replaced oxygen. I love the oxygen is still :) Next examples:
'00O02' to-> 'OOOO2' 'h02' to-> 'Ho2', 'HO2'
'h20' to-> 'H20'
The forced conversion of zero to oxygen is set by parameter 'zero2oxi':
my $chem_formulas = good_formula( 'h20', { 'zero2oxi' => 1 } );
Output @$chem_formulas:
@$chem_formulas
'H20', 'H2O'
If mode of paranoiac is necessary, then transform $abracadabra to low case as:
lc $abracadabra
Beware use very long $abracadabra!
This subroutine simple but useful. It forms the chemical mix (equation) from ref to array of arrays \@reactants_and_products, i.e. is parse_chem_mix antipode.
\@reactants_and_products
parse_chem_mix
The following can be \%facultative_parameters: 'substances' - ref to array of real (required) substances, 'coefficients' - ref to hash stoichiometry coefficients for substances. Full examples:
\%facultative_parameters
'substances'
'coefficients'
my $ce = [ [ 'O2', 'K' ], [ 'K2O', 'Na2O2', 'K2O2', 'KO2' ] ]; my $k = { 'K' => 2, 'K2O2' => 1, 'KO2' => 0 }; my $mix = prepare_mix( $ce, { 'coefficients' => $k } );
Will output $mix:
$mix
O2 + 2 K == K2O + Na2O2 + 1 K2O2 + 0 KO2
For real substances:
my $real = [ 'K', 'O2', 'K2O2', 'KO2' ]; print prepare_mix( $ce, { 'coefficients' => $k, 'substances' => $real } );
O2 + 2 K == 1 K2O2 + 0 KO2
This subroutine calculates Unique Common Identifier of Reaction \@reactants_and_products with stoichiometry \%coefficients and brutto (gross) formulas of substances, i.e ref to array: 0th - alphabetic CLASS, 1th - Chemical Integer Reaction Identifier (CIR), 2th - hash brutto substances.
my $reaction = '1 H2O + 1 CO2 --> 1 H2CO3'; my %cf; my $ce = parse_chem_mix( $reaction, \%cf ); print Dumper class_cir_brutto( $ce, \%cf );
Will print
$VAR1 = [ 'CHO', 1334303561, { 'H2CO3' => 'C1H2O3', 'CO2' => 'C1O2', 'H2O' => 'H2O1' } ];
CIR is a 32 bit CRC of normalized chemical equation, generating the same CRC value as the POSIX GNU cksum program. The returned CIR will always be a non-negative integer in the range 0..2^32-1, i.e. 0..4,294,967,295.
cksum
The nature is diversiform, but we search simple decisions :)
The class_cir_brutto() protesting CLASS-CIR over 22,100 unique inorganic reactions. Yes, to me it was hard to make it.
class_cir_brutto()
This subroutine calculates Tactico-Technical characteristics (TTC) of reaction \@reactants_and_products, sorry military slang :), i.e. quantity SAR: (s)ubstance, (a)toms and (r)ank of reaction. Proceeding example above:
$VAR1 = { 'r' => 2, 'a' => 3, 's' => 3 };
Chemistry::Harmonia exports nothing by default. Each of the subroutines can be exported on demand, as in
use Chemistry::Harmonia qw( oxidation_state );
the tag redox exports the subroutines oxidation_state, parse_chem_mix and prepare_mix:
redox
oxidation_state
prepare_mix
use Chemistry::Harmonia qw(:redox);
the tag equation exports the subroutines good_formula, parse_chem_mix, prepare_mix, class_cir_brutto and ttc_reaction:
equation
good_formula
class_cir_brutto
ttc_reaction
use Chemistry::Harmonia qw(:equation);
and the tag all exports them all:
all
use Chemistry::Harmonia qw(:all);
Chemistry::Harmonia is known to run under perl 5.8.8 on Linux. The distribution uses Chemistry::File::Formula, Algorithm::Combinatorics, Regexp::Common, Math::BigInt, Math::BigRat, Math::Assistant, String::CRC::Cksum, Inline::Files and Carp.
Greenwood, Norman N.; Earnshaw, Alan. (1997), Chemistry of the Elements (2nd ed.), Oxford: Butterworth-Heinemann
Irving Langmuir. The arrangement of electrons in atoms and molecules. J. Am. Chem. Soc. 1919, 41, 868-934.
Chemistry-Elements, Chemistry::Mol, Chemistry::File and Chemistry::MolecularMass.
Alessandro Gorohovski, <angel@feht.dgtu.donetsk.ua>
Copyright (C) 2011 by A. N. Gorohovski
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.8 or, at your option, any later version of Perl 5 you may have available.
To install Chemistry::Harmonia, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Chemistry::Harmonia
CPAN shell
perl -MCPAN -e shell install Chemistry::Harmonia
For more information on module installation, please visit the detailed CPAN module installation guide.