Actions
Bug #5685
closedOniguruma does not recognize U+30FC as Katakana
Description
The character U+30FC KATAKANA-HIRAGANA PROLONGED SOUND MARK (Japanese choonpu) belongs to the Unicode Katakana block (U+30A0-30FF), but it is not matched by /\p{Katakana}/. Demonstration:
"私のホバークラフトは鰻でいっぱいです".gsub(/(\p{Katakana}|\p{Hiragana}|\p{Han})+/, 'X')
=> "XーX"
In other words, all kana and kanji in that string except U+30FC are matched. And it really is 30FC/12540:
"私のホバークラフトは鰻でいっぱいです".gsub(/(\p{Katakana}|\p{Hiragana}|\p{Han})+/, '').unpack("U*")
=> [12540]
Also occurs in Ruby 1.8 with the Oniguruma library.
Actions
Like0
Like0