- METHODS AND ARGUMENTS
- GIT REPOSITORY
- SEE ALSO
- COPYRIGHT AND LICENSE
Lingua::AR::Tashkeel - Subroutines for handling Arabic Vowels and Vowel marks
use Lingua::AR::Tashkeel qw(strip prune fix); # Strip all short vowels strip('مَكَرُونَة'); # => مكرونة # Heuristic for removing short vowels without causing ambiguity prune('فَتَّة'); # => فتّة # Heuristic for fixing mixed up short and long vowels fix('ماحشي'); # => مَحشي
Subroutines for working with Arabic long (حروف علة) and short vowels (حركات تشكيل)
Strips away all Arabic short vowels (Tashkeel).
Heuristic for pruning the short vowels that a native speaker wouldn't write, as leaving them out wouldn't introduce ambiguity.
This is often preferable to strip, as Shaddas, or Dammas that indicate a passive verb are useful clues that one might want to keep.
Transliterating from a romanized representation of Arabic to actual Arabic script often gives incorrect results regarding short/long vowels.
This subroutine implements a heuristic for fixing such mix ups.
Copyright (C) 2016 Ahmad Fatoum
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.