Skip to content

Instantly share code, notes, and snippets.

@decodedmrq
Forked from terrancesnyder/regex-japanese.txt
Created August 16, 2018 03:38
Show Gist options
  • Save decodedmrq/0e9d23ecb3bcf8d92b103351e559e806 to your computer and use it in GitHub Desktop.
Save decodedmrq/0e9d23ecb3bcf8d92b103351e559e806 to your computer and use it in GitHub Desktop.

Revisions

  1. @terrancesnyder terrancesnyder created this gist Nov 7, 2011.
    50 changes: 50 additions & 0 deletions regex-japanese.txt
    Original file line number Diff line number Diff line change
    @@ -0,0 +1,50 @@
    Regex for matching ALL Japanese common & uncommon Kanji (4e00 – 9fcf) ~ The Big Kahuna!
    ([一-龯])

    Regex for matching Hirgana or Katakana
    ([ぁ-んァ-ン])

    Regex for matching Non-Hirgana or Non-Katakana
    ([^ぁ-んァ-ン])

    Regex for matching Hirgana or Katakana or basic punctuation (、。’)
    ([ぁ-んァ-ン\w])

    Regex for matching Hirgana or Katakana and random other characters
    ([ぁ-んァ-ン!:/])

    Regex for matching Hirgana
    ([ぁ-ん])

    Regex for matching full-width Katakana (zenkaku 全角)
    ([ァ-ン])

    Regex for matching half-width Katakana (hankaku 半角)
    ([ァ-ン゙゚])

    Regex for matching full-width Numbers (zenkaku 全角)
    ([0-9])

    Regex for matching full-width Letters (zenkaku 全角)
    ([A-z])

    Regex for matching Hiragana codespace characters (includes non phonetic characters)
    ([ぁ-ゞ])

    Regex for matching full-width (zenkaku) Katakana codespace characters (includes non phonetic characters)
    ([ァ-ヶ])

    Regex for matching half-width (hankaku) Katakana codespace characters (this is an old character set so the order is inconsistent with the hiragana)
    ([ヲ-゚])

    Regex for matching Japanese Post Codes
    /^¥d{3}¥-¥d{4}$/
    /^¥d{3}-¥d{4}$|^¥d{3}-¥d{2}$|^¥d{3}$/

    Regex for matching Japanese mobile phone numbers (keitai bangou)
    /^¥d{3}-¥d{4}-¥d{4}$|^¥d{11}$/
    /^0¥d0-¥d{4}-¥d{4}$/

    Regex for matching Japanese fixed line phone numbers
    /^[0-9-]{6,9}$|^[0-9-]{12}$/
    /^¥d{1,4}-¥d{4}$|^¥d{2,5}-¥d{1,4}-¥d{4}$/