regularize it
Perl
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
author
lib/Lingua/JA/Regular
t
xt
.gitignore
.travis.yml
Build.PL
Changes
LICENSE
META.json
README.md
cpanfile
minil.toml

README.md

Build Status MetaCPAN Release

NAME

Lingua::JA::Regular::Unicode - convert japanese chars.

SYNOPSIS

use Lingua::JA::Regular::Unicode qw/alnum_z2h hiragana2katakana space_z2h/;
alnum_z2h("A1");                                        # => "A1"
hiragana2katakana("ほげ");                                # => "ホゲ"
space_z2h("\x{0300}");                                    # => 半角スペース

DESCRIPTION

Lingua::JA::Regular::Unicode is regularizer.

  • alnum_z2h

    Convert alphabet, numbers and symbols ZENKAKU to HANKAKU.

    Symbols contains >, <.

    Yes, it's bit strange. But so, this behaviour is needed by historical reason.

  • alnum_h2z

    Convert alphabet, numbers and symbols HANKAKU to ZENKAKU.

  • space_z2h

    convert spaces ZENKAKU to HANKAKU.

  • space_h2z

    convert spaces HANKAKU to ZENKAKU.

  • katakana_z2h

    convert katakanas ZENKAKU to HANKAKU.

  • katakana_h2z

    convert katakanas HANKAKU to ZENKAKU.

  • katakana2hiragana

    convert KATAKANA to HIRAGANA.

    This method ignores following chars:

      KATAKANA LETTER VA
      KATAKANA LETTER SMALL RE
      KATAKANA LETTER SMALL HU
      KATAKANA LETTER SMALL HI
      KATAKANA LETTER SMALL HE
      KATAKANA DIGRAPH KOTO
      KATAKANA LETTER SMALL SU
      KATAKANA LETTER SMALL HO
      KATAKANA LETTER SMALL SI
      KATAKANA LETTER SMALL RI
      KATAKANA LETTER VE
      KATAKANA LETTER SMALL TO
      KATAKANA LETTER SMALL KU
      KATAKANA LETTER VO
      KATAKANA LETTER SMALL RO
      KATAKANA LETTER SMALL RA
      KATAKANA LETTER SMALL MU
      KATAKANA LETTER SMALL HA
      KATAKANA LETTER VI
      KATAKANA LETTER SMALL RU
      KATAKANA LETTER SMALL NU
      KATAKANA MIDDLE DOT
      HALFWIDTH KATAKANA SEMI-VOICED SOUND MARK
      HALFWIDTH KATAKANA VOICED SOUND MARK
      HALFWIDTH KATAKANA MIDDLE DOT
    
  • hiragana2katakana

    convert HIRAGANA to KATAKANA.

    This method ignores following chars:

      HIRAGANA DIGRAPH YORI
    

AUTHOR

Tokuhiro Matsuno <tokuhirom AAJKLFJEF@ GMAIL COM>

THANKS To

takefumi kimura - the author of L<Lingua::JA::Regular>
dankogai

SEE ALSO

Lingua::JA::Regular

LICENSE

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.