Project

General

Profile

Actions

Feature #3989

closed

Update Oniguruma for Unicode 6

Added by runpaint (Run Paint Run Run) over 13 years ago. Updated almost 13 years ago.

Status:
Closed
Assignee:
-
Target version:
-

Description

=begin
Unicode 6.0 has been released, so it would be preferable to update Oniguruma for Ruby 1.9.3. A patch is attached which updates enc/unicode/unicode/name2ctype.kwd, and fixes the instructions in tool/enc-unicode.rb. Given the scope of this update, I'd prefer this merged sooner rather than later so we have time to test it.

A tentative test follows:

U+20B9, INDIAN RUPEE SIGN, is a new codepoint in 6.0. It has the general category Sc.

/\p{sc}/u =~ "\u{20B9}" #=> 0

U+0B72, ORIYA FRACTION ONE QUARTER, is a new codepoint in the Oriya script.

/\p{oriya}/u =~ "\u{0b72}" #=> 0

U+0B77, ORIYA FRACTION THREE SIXTEENTHS, is the last codepoint in the Oriya script block.

/\p{oriya}/u =~ "\u{0b78}" #=> nil

U+FBC0, TAMIL VOWEL SIGN II, is a new codepoint in the Arabic Presentation Forms-A block.

/\p{arabic}/u =~ "\u{fbc0}" #=> 0

U+1F130, SQUARED LATIN CAPITAL LETTER A, is a new codepoint with a general category of So.

/\p{so}/u =~ "\u{1f130}" #=> 0

U+0847, MANDAIC LETTER IT, is a new codepoint in the Mandaic script. The Mandaic script is new.

/\p{mandaic}/u =~ "\u{0847}" #=> 0

(When this is complete, NEWS needs to be updated, too).
=end


Files

onig-u6.patch (731 KB) onig-u6.patch runpaint (Run Paint Run Run), 10/27/2010 09:20 AM
Actions

Also available in: Atom PDF

Like0
Like0Like0