Feature #8848: Syntax for binary strings - Ruby - Ruby Issue Tracking System

Actions

Copy link

Feature #8848

open

Syntax for binary strings

Feature #8848: Syntax for binary strings

Added by duerst (Martin Dürst) almost 13 years ago. Updated over 4 years ago.

Status:

Open

Assignee:

Target version:

[ruby-core:56924]

Description

In commit 37486, Yui (Naruse) added a String#b method as proposed in http://bugs.ruby-lang.org/issues/6767.

String#b was added to allow easy generation of binary strings; this became necessary in particular after the source file encoding was changed to UTF-8.

However, as also recognized in http://bugs.ruby-lang.org/issues/6767, in the long term (ideally starting with Ruby 2.1) it would be better to make binary strings available as part of Ruby syntax.

One reason for this efficiency. String#b creates a duplicate object, which is not at all necessary for the frequent use case of String literals.

Another reason is encoding validity. To be able to e.g. create a "\xFF" binary string, with String#b in an UTF-8 source context, it is necessary to allow "\xFF" (temporarily at least) as an (actually invalid) UTF-8 string. This may be difficult for some implementations, and isn't desirable in general.

Regarding syntax, there are mainly two solutions:

a '%b' prefix
a 'b' suffix

The preferable syntax depends on the overall future approach of Ruby to String literal suffixes (see https://bugs.ruby-lang.org/issues/8579).

Related issues 1 (1 open — 0 closed)

Updated by duerst (Martin Dürst) over 11 years ago Actions
Copy link
#1 [ruby-core:65924]

Related to Feature #10391: Provide %eISO-8859-1'string \xAA literal' string literals with explicit encoding added

Updated by normalperson (Eric Wong) over 11 years ago Actions
Copy link
#2 [ruby-core:65927]

duerst@it.aoyama.ac.jp wrote:

One reason for this efficiency. String#b creates a duplicate object,
which is not at all necessary for the frequent use case of String
literals.

Avoiding one allocation is easy to add to [Feature #10423]
(which avoids string literal allocations for many methods)

Another reason is encoding validity. To be able to e.g. create a
"\xFF" binary string, with String#b in an UTF-8 source context, it is
necessary to allow "\xFF" (temporarily at least) as an (actually
invalid) UTF-8 string. This may be difficult for some implementations,
and isn't desirable in general.

We can even go farther than #10423 and move the evaluation of
"string literal".{b,encode,force_encoding} to compile time.

The downside is compatibility with people who wish to override one of
those methods, but doubt anybody overrides those...
There's no new (and strange looking, IMHO) syntax to learn,
it looks like a normal method call, and the optimization would be
usable with existing code.

Updated by duerst (Martin Dürst) over 11 years ago Actions
Copy link
#3 [ruby-core:65938]

Eric Wong wrote:

We can even go farther than #10423 and move the evaluation of
"string literal".{b,encode,force_encoding} to compile time.

The downside is compatibility with people who wish to override one of
those methods, but doubt anybody overrides those...

Even if nobody overrides String#encode, they may configure it in various ways.

There's no new (and strange looking, IMHO) syntax to learn,
it looks like a normal method call, and the optimization would be
usable with existing code.

It's not enough to move evaluation to compile time. We may want to know the desired encoding before we start to parse the string. That by definition doesn't work when the method (or whatever) comes after the end of the literal.

Updated by hsbt (Hiroshi SHIBATA) over 4 years ago Actions
Copy link
#4

Project changed from 14 to Ruby

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Feature #8848

Syntax for binary strings

Updated by duerst (Martin Dürst) over 11 years ago Actions
Copy link
#1 [ruby-core:65924]

Updated by normalperson (Eric Wong) over 11 years ago Actions
Copy link
#2 [ruby-core:65927]

Updated by duerst (Martin Dürst) over 11 years ago Actions
Copy link
#3 [ruby-core:65938]

Updated by hsbt (Hiroshi SHIBATA) over 4 years ago Actions
Copy link
#4

Project

General

Profile

Ruby

Custom queries

Feature #8848

Syntax for binary strings

Updated by duerst (Martin Dürst) over 11 years ago ActionsCopy link #1 [ruby-core:65924]

Updated by normalperson (Eric Wong) over 11 years ago ActionsCopy link #2 [ruby-core:65927]

Updated by duerst (Martin Dürst) over 11 years ago ActionsCopy link #3 [ruby-core:65938]

Updated by hsbt (Hiroshi SHIBATA) over 4 years ago ActionsCopy link #4

Updated by duerst (Martin Dürst) over 11 years ago Actions
Copy link
#1 [ruby-core:65924]

Updated by normalperson (Eric Wong) over 11 years ago Actions
Copy link
#2 [ruby-core:65927]

Updated by duerst (Martin Dürst) over 11 years ago Actions
Copy link
#3 [ruby-core:65938]

Updated by hsbt (Hiroshi SHIBATA) over 4 years ago Actions
Copy link
#4