Feature #20878: A new C API to create a String by adopting a pointer: `rb_enc_str_adopt(const char *ptr, long len, long capa, rb_encoding *enc)` - Ruby - Ruby Issue Tracking System

Actions

Feature #20878

open

A new C API to create a String by adopting a pointer: `rb_enc_str_adopt(const char ptr, long len, long capa, rb_encoding enc)`

Feature #20878: A new C API to create a String by adopting a pointer: `rb_enc_str_adopt(const char *ptr, long len, long capa, rb_encoding *enc)`

Added by byroot (Jean Boussier) over 1 year ago. Updated over 1 year ago.

Status:

Open

Assignee:

Target version:

[ruby-core:119801]

Description

Context¶

A common use case when writing C extensions is to generate text or bytes into a buffer, and to return it back
wrapped into a Ruby String. Examples are JSON.generate(obj) -> String, and all other format serializers,
compression libraries such as ZLib.deflate, etc, but also methods such as Time.strftime,

Current Solution¶

Work in a buffer and copy the result¶

The most often used solution is to work with a native buffer and to manage a native allocated buffer,
and once the generation is done, call rb_str_new* to copy the result inside memory managed by Ruby.

It works, but isn't very efficient because it cause an extra copy and an extra free().

On ruby/json macro-benchmarks, this represent around 5% of the time spent in JSON.generate.

static void fbuffer_free(FBuffer *fb)
{
    if (fb->ptr && fb->type == FBUFFER_HEAP_ALLOCATED) {
        ruby_xfree(fb->ptr);
    }
}

static VALUE fbuffer_to_s(FBuffer *fb)
{
    VALUE result = rb_utf8_str_new(FBUFFER_PTR(fb), FBUFFER_LEN(fb));
    fbuffer_free(fb);
    return result;
}

Work inside RString allocated memory¶

Another way this is currently done, is to allocate an RString using rb_str_buf_new,
and write into it with various functions such as rb_str_catf,
or writing past RString.len through RSTRING_PTR and then resize it with rb_str_set_len.

The downside with this approach is that it contains a lot of inefficiencies, as rb_str_set_len will perform
numerous safety checks, compute coderange, and write the string terminator on every invocation.

Another major inneficiency is that this API make it hard to be in control of the buffer
growth, so it can result in a lot more realloc() calls than manually managing the buffer.

This method is used by Kernel#sprintf, Time#strftime etc, and when I attempted to improve Time#strftime
performance, this problem showed up as the biggest bottleneck:

Proposed API¶

I think a more effcient way to do this would be to work with a native buffer, and then build a RString
that "adopt" the memory region.

Technically, you can currently do this by reaching directly into RString members, but I don't think it's clean,
and a dedicated API would be preferable:

/**
 * Similar to rb_str_new(), but it adopts the pointer instead of copying.
 *
 * @param[in]  ptr             A memory region of `capa` bytes length. MUST have been allocated with `ruby_xmalloc`
 * @param[in]  len             Length  of the string,  in bytes,  not including  the
 *                             terminating NUL character, not including extra capacity.
 * @param[in]  capa            The usable length of `ptr`, in bytes,  including  the
 *                             terminating NUL character.
 * @param[in]  enc             Encoding of `ptr`.
 * @exception  rb_eArgError    `len` is negative.
 * @return     An instance  of ::rb_cString,  of `len`  bytes length, `capa - 1` bytes capacity,
 *             and of `enc` encoding.
 * @pre        At  least  `capa` bytes  of  continuous  memory region  shall  be
 *             accessible via `ptr`.
 * @pre        `ptr` MUST have been allocated with `ruby_xmalloc`.
 * @pre        `ptr` MUST not be manually freed after `rb_enc_str_adopt` has been called.
 * @note       `enc` can be a  null pointer.  It can also be  seen as a routine
 *             identical to rb_usascii_str_new() then.
 */
rb_enc_str_adopt(const char *ptr, long len, long capa, rb_encoding *enc);

An alternative to the adopt term, could be move.

Files

Capture d’écran 2024-12-11 à 11.03.08.png (250 KB) Capture d’écran 2024-12-11 à 11.03.08.png

byroot (Jean Boussier), 12/11/2024 10:03 AM

Related issues 1 (1 open — 0 closed)

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Feature #20878

A new C API to create a String by adopting a pointer: `rb_enc_str_adopt(const char *ptr, long len, long capa, rb_encoding *enc)`

Context¶

Current Solution¶

Work in a buffer and copy the result¶

Work inside RString allocated memory¶

Proposed API¶

Updated by Eregon (Benoit Daloze) over 1 year ago ActionsCopy link #1 [ruby-core:119813]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #2 [ruby-core:119815]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago ActionsCopy link #3 [ruby-core:119816]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #4 [ruby-core:119819]

Updated by shyouhei (Shyouhei Urabe) over 1 year ago ActionsCopy link #5 [ruby-core:119828]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago ActionsCopy link #6 [ruby-core:119830]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #7 [ruby-core:119834]

Updated by shyouhei (Shyouhei Urabe) over 1 year ago ActionsCopy link #8 [ruby-core:119835]

Updated by rhenium (Kazuki Yamaguchi) over 1 year ago ActionsCopy link #9 [ruby-core:119836]

Work inside RString allocated memory¶

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #10 [ruby-core:119840]

Updated by kddnewton (Kevin Newton) over 1 year ago ActionsCopy link #11 [ruby-core:119847]

Updated by mdalessio (Mike Dalessio) over 1 year ago ActionsCopy link #12 [ruby-core:119848]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #13 [ruby-core:119982]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago ActionsCopy link #14 [ruby-core:119989]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #15 [ruby-core:119990]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago ActionsCopy link #16 [ruby-core:120148]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #17 [ruby-core:120152]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago ActionsCopy link #18 [ruby-core:120170]

Updated by byroot (Jean Boussier) over 1 year ago · Edited ActionsCopy link #19 [ruby-core:120175]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago ActionsCopy link #20 [ruby-core:120197]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #21 [ruby-core:120202]

Updated by mame (Yusuke Endoh) over 1 year ago ActionsCopy link #22 [ruby-core:120206]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #23 [ruby-core:120208]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #24 [ruby-core:120216]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #25 [ruby-core:120220]

Updated by rhenium (Kazuki Yamaguchi) over 1 year ago ActionsCopy link #26 [ruby-core:120228]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #27 [ruby-core:120229]

Updated by shyouhei (Shyouhei Urabe) over 1 year ago ActionsCopy link #28 [ruby-core:120292]

Updated by mame (Yusuke Endoh) over 1 year ago ActionsCopy link #29 [ruby-core:120516]

Updated by byroot (Jean Boussier) over 1 year ago ActionsCopy link #30 [ruby-core:120518]

Updated by mdalessio (Mike Dalessio) over 1 year ago · Edited ActionsCopy link #31 [ruby-core:120522]

Updated by byroot (Jean Boussier) 10 days ago ActionsCopy link #32

A new C API to create a String by adopting a pointer: `rb_enc_str_adopt(const char ptr, long len, long capa, rb_encoding enc)`

Updated by Eregon (Benoit Daloze) over 1 year ago Actions
Copy link
#1 [ruby-core:119813]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#2 [ruby-core:119815]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago Actions
Copy link
#3 [ruby-core:119816]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#4 [ruby-core:119819]

Updated by shyouhei (Shyouhei Urabe) over 1 year ago Actions
Copy link
#5 [ruby-core:119828]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago Actions
Copy link
#6 [ruby-core:119830]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#7 [ruby-core:119834]

Updated by shyouhei (Shyouhei Urabe) over 1 year ago Actions
Copy link
#8 [ruby-core:119835]

Updated by rhenium (Kazuki Yamaguchi) over 1 year ago Actions
Copy link
#9 [ruby-core:119836]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#10 [ruby-core:119840]

Updated by kddnewton (Kevin Newton) over 1 year ago Actions
Copy link
#11 [ruby-core:119847]

Updated by mdalessio (Mike Dalessio) over 1 year ago Actions
Copy link
#12 [ruby-core:119848]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#13 [ruby-core:119982]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago Actions
Copy link
#14 [ruby-core:119989]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#15 [ruby-core:119990]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago Actions
Copy link
#16 [ruby-core:120148]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#17 [ruby-core:120152]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago Actions
Copy link
#18 [ruby-core:120170]

Updated by byroot (Jean Boussier) over 1 year ago · Edited Actions
Copy link
#19 [ruby-core:120175]

Updated by nobu (Nobuyoshi Nakada) over 1 year ago Actions
Copy link
#20 [ruby-core:120197]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#21 [ruby-core:120202]

Updated by mame (Yusuke Endoh) over 1 year ago Actions
Copy link
#22 [ruby-core:120206]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#23 [ruby-core:120208]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#24 [ruby-core:120216]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#25 [ruby-core:120220]

Updated by rhenium (Kazuki Yamaguchi) over 1 year ago Actions
Copy link
#26 [ruby-core:120228]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#27 [ruby-core:120229]

Updated by shyouhei (Shyouhei Urabe) over 1 year ago Actions
Copy link
#28 [ruby-core:120292]

Updated by mame (Yusuke Endoh) over 1 year ago Actions
Copy link
#29 [ruby-core:120516]

Updated by byroot (Jean Boussier) over 1 year ago Actions
Copy link
#30 [ruby-core:120518]

Updated by mdalessio (Mike Dalessio) over 1 year ago · Edited Actions
Copy link
#31 [ruby-core:120522]

Updated by byroot (Jean Boussier) 10 days ago Actions
Copy link
#32