Feature #10002: String swapcase - Ruby - Ruby Issue Tracking System

The current implementation of case conversion methods in String class only understands ASCII characters.
We'd like to enhance it when possible. But we have to know how each character should be converted.
For example, how should we convert "ß" (eszett)?

Matz.

Actions

Copy link

#3 [ruby-core:63480]

Updated by naruse (Yui NARUSE) almost 11 years ago

At this time, ffi-icu or twitter-text-rb is useful.

Actions

Copy link

#4 [ruby-core:63481]

Updated by davispuh (Dāvis Mosāns) almost 11 years ago

It have been already figured out by Unicode Standard, so just have to implement it. Look at Default Case Algorithms in section 3.13 and Case Mappings in section 5.18. Mappings can be viewed in SpecialCasing.txt (and UnicodeData.txt) also CaseFolding.txt could be useful.

From there "ß" (LATIN SMALL LETTER SHARP S) in uppercase would be "SS" (LATIN CAPITAL LETTER S) and it's user's responsibility to know that generally they are not reversible.

Also useful to read Character Properties, Case Mappings & Names FAQ

Actions

Copy link

#5 [ruby-core:63483]

Updated by zzak (zzak _) almost 11 years ago

We should delegate to @emboss everytime we need to convert ß...

Actions

Copy link

#6 [ruby-core:63517]

Updated by shyouhei (Shyouhei Urabe) almost 11 years ago

We are talking about swapcase, not folding. The "generally they are not reversible" you say is the difficulty we are facing here. Also as you cited CaseFolding.txt, you should have been aware of type T folding, which is impossible without locale information.

If you think you can implement it, please show us.

Dāvis Mosāns wrote:

It have been already figured out by Unicode Standard, so just have to implement it. Look at Default Case Algorithms in section 3.13 and Case Mappings in section 5.18. Mappings can be viewed in SpecialCasing.txt (and UnicodeData.txt) also CaseFolding.txt could be useful.

From there "ß" (LATIN SMALL LETTER SHARP S) in uppercase would be "SS" (LATIN CAPITAL LETTER S) and it's user's responsibility to know that generally they are not reversible.

Also useful to read Character Properties, Case Mappings & Names FAQ

Actions

Copy link

#7 [ruby-core:63970]

Updated by duerst (Martin Dürst) almost 11 years ago

Related to Feature #10085: Add non-ASCII case conversion to String#upcase/downcase/swapcase/capitalize added

Actions

Copy link

#8 [ruby-core:81773]

Updated by duerst (Martin Dürst) about 8 years ago

Status changed from Open to Closed

This has actually been implemented by Feature #10085, so it can be closed.

Actions

Copy link

Also available in: Atom PDF

Like0

Like0Like0Like0Like0Like0Like0Like0Like0

Project

General

Profile

Ruby

Tags

Custom queries

Feature #10002

String swapcase

Updated by nobu (Nobuyoshi Nakada) almost 11 years ago

Updated by matz (Yukihiro Matsumoto) almost 11 years ago

Updated by naruse (Yui NARUSE) almost 11 years ago

Updated by davispuh (Dāvis Mosāns) almost 11 years ago

Updated by zzak (zzak _) almost 11 years ago

Updated by shyouhei (Shyouhei Urabe) almost 11 years ago

Updated by duerst (Martin Dürst) almost 11 years ago

Updated by duerst (Martin Dürst) about 8 years ago