Project

General

Profile

Actions

Feature #5877

closed

Poor performance of initial and final UTF-8 substrings

Added by nex3 (Natalie Weizenbaum) almost 13 years ago. Updated about 12 years ago.

Status:
Rejected
Assignee:
-
Target version:
-
[ruby-core:42049]

Description

Currently, taking the substring of a UTF-8 string always takes time proportional to the length of the string. While this is necessary in the general case, there are many common cases where it could be much faster, in particular when the substring is near the beginning of the end of the superstring. In those cases, the operation should only be proportional to the length of the substring.

Gist with reproduction and more detailed analysis forthcoming.

Updated by naruse (Yui NARUSE) almost 13 years ago

  • Status changed from Open to Feedback

Updated by kosaki (Motohiro KOSAKI) almost 13 years ago

You didn't write gist url. I haven't catch your point.

Updated by cjheath (Clifford Heath) almost 13 years ago

On 20/01/2012, at 1:08 PM, Motohiro KOSAKI wrote:

Bug #5877: Poor performance of initial and final UTF-8 substrings
https://bugs.ruby-lang.org/issues/5877

Author: Nathan Weizenbaum
Status: Feedback
Priority: Normal
Assignee:
Category:
Target version:
ruby -v: ruby 1.9.3p0 (2011-10-30 revision 33570) [x86_64-linux]

Currently, taking the substring of a UTF-8 string always takes time proportional to the length of the string. While this is necessary in the general case, there are many common cases where it could be much faster, in particular when the substring is near the beginning of the end of the superstring. In those cases, the operation should only be proportional to the length of the substring.

This discussion from ten years ago is relevant to performance workarounds for UTF-8:
http://blade.nagaokaut.ac.jp/cgi-bin/vframe.rb/ruby/ruby-talk/45935?45702-47288

Clifford Heath.

Actions #4

Updated by mame (Yusuke Endoh) about 12 years ago

  • Tracker changed from Bug to Feature
  • Status changed from Feedback to Rejected
  • Priority changed from Normal to 3

Not a bug. "forthcoming" did not come.
Please try to make your proposal self-contained.

--
Yusuke Endoh

Actions

Also available in: Atom PDF

Like0
Like0Like0Like0Like0