Bug #13343: Improve Hash#merge performance - Ruby - Ruby Issue Tracking System

Actions

Copy link

Bug #13343

closed

Improve Hash#merge performance

Bug #13343: Improve Hash#merge performance

Added by watson1978 (Shizuo Fujita) almost 9 years ago. Updated over 8 years ago.

Status:

Closed

Assignee:

Target version:

ruby -v:

Backport:

2.2: UNKNOWN, 2.3: UNKNOWN, 2.4: UNKNOWN

[ruby-dev:50026]

Description

Hash#merge will be faster around 60%.

Before¶

                 user     system      total        real
Hash#merge   0.160000   0.020000   0.180000 (  0.182357)

After¶

                 user     system      total        real
Hash#merge   0.110000   0.010000   0.120000 (  0.114404)

Test code¶

require 'benchmark'

Benchmark.bmbm do |x|
  hash1 = {}
  100.times { |i| hash1[i.to_s] = i }
  hash2 = {}
  100.times { |i| hash2[(i*2).to_s] = i*2 }

  x.report "Hash#merge" do
    10000.times do
      hash1.merge(hash2)
    end
  end
end

Patch¶

The patch is in https://github.com/ruby/ruby/pull/1533

Updated by normalperson (Eric Wong) almost 9 years ago Actions
Copy link
#1 [ruby-dev:50046]

watson1978@gmail.com wrote:

https://bugs.ruby-lang.org/issues/13343 ¶

Hash#merge will be faster around 60%.

+Cc ruby-core, since your post was English (and I don't read Japanese)

This is promising!

The patch is in https://github.com/ruby/ruby/pull/1533

We need to check for redefinition of initialize_dup and
initialize_copy methods in Hash for this to be correct.

Unfortunately for people optimizing Ruby, corner-case
redefinition checks are probably necessary :<

Also, I wonder if we can improve rb_funcall to better support
inline caching. rb_funcall API is also bad since it cannot use
inline cache for method lookup. Maybe a better C API can be
introduced for faster function calls from C.

Note: I checked commit c5d74afdb4cfea2a4c9ff432d9da82f0649a1e67
by having a "fetch = +refs/pull/:refs/remotes/ruby/pull/"
line in a "remote" section of my .git/config. I did not
use any proprietary API or JavaScript to view your changes.

Updated by normalperson (Eric Wong) almost 9 years ago Actions
Copy link
#2 [ruby-core:80367]

watson1978@gmail.com wrote:

https://bugs.ruby-lang.org/issues/13343 ¶

Hash#merge will be faster around 60%.

+Cc ruby-core, since your post was English (and I don't read Japanese)

This is promising!

The patch is in https://github.com/ruby/ruby/pull/1533

We need to check for redefinition of initialize_dup and
initialize_copy methods in Hash for this to be correct.

Unfortunately for people optimizing Ruby, corner-case
redefinition checks are probably necessary :<

Updated by watson1978 (Shizuo Fujita) almost 9 years ago Actions
Copy link
#3 [ruby-dev:50047]

I followed the behavior of Array's methods such as

VALUE
rb_ary_sort(VALUE ary)
{
    ary = rb_ary_dup(ary);

It does not check whether initialize_dup/initialize_copy were overridden.

Updated by watson1978 (Shizuo Fujita) over 8 years ago Actions
Copy link
#4

Status changed from Open to Closed

Applied in changeset trunk|r58811.

Improve Hash#merge performance

hash.c (rb_hash_merge): use rb_hash_dup() instead of rb_obj_dup() to duplicate
Hash object. rb_hash_dup() is faster duplicating function for Hash object
which got rid of Hash#initialize_dup method calling.

Hash#merge will be faster around 60%.
[ruby-dev:50026] [Bug #13343] [Fix GH-1533]

Before¶

             user     system      total        real

Hash#merge 0.160000 0.020000 0.180000 ( 0.182357)

After¶

             user     system      total        real

Hash#merge 0.110000 0.010000 0.120000 ( 0.114404)

Test code¶

require 'benchmark'

Benchmark.bmbm do |x|
hash1 = {}
100.times { |i| hash1[i.to_s] = i }
hash2 = {}
100.times { |i| hash2[(i2).to_s] = i2 }

x.report "Hash#merge" do
10000.times do
hash1.merge(hash2)
end
end
end

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Bug #13343

Improve Hash#merge performance

Before¶

After¶

Test code¶

Patch¶

Updated by normalperson (Eric Wong) almost 9 years ago Actions
Copy link
#1 [ruby-dev:50046]

https://bugs.ruby-lang.org/issues/13343 ¶

Updated by normalperson (Eric Wong) almost 9 years ago Actions
Copy link
#2 [ruby-core:80367]

https://bugs.ruby-lang.org/issues/13343 ¶

Updated by watson1978 (Shizuo Fujita) almost 9 years ago Actions
Copy link
#3 [ruby-dev:50047]

Updated by watson1978 (Shizuo Fujita) over 8 years ago Actions
Copy link
#4

Before¶

After¶

Test code¶

Project

General

Profile

Ruby

Custom queries

Bug #13343

Improve Hash#merge performance

Before¶

After¶

Test code¶

Patch¶

Updated by normalperson (Eric Wong) almost 9 years ago ActionsCopy link #1 [ruby-dev:50046]

https://bugs.ruby-lang.org/issues/13343¶

Updated by normalperson (Eric Wong) almost 9 years ago ActionsCopy link #2 [ruby-core:80367]

https://bugs.ruby-lang.org/issues/13343¶

Updated by watson1978 (Shizuo Fujita) almost 9 years ago ActionsCopy link #3 [ruby-dev:50047]

Updated by watson1978 (Shizuo Fujita) over 8 years ago ActionsCopy link #4

Before¶

After¶

Test code¶

Updated by normalperson (Eric Wong) almost 9 years ago Actions
Copy link
#1 [ruby-dev:50046]

https://bugs.ruby-lang.org/issues/13343 ¶

Updated by normalperson (Eric Wong) almost 9 years ago Actions
Copy link
#2 [ruby-core:80367]

https://bugs.ruby-lang.org/issues/13343 ¶

Updated by watson1978 (Shizuo Fujita) almost 9 years ago Actions
Copy link
#3 [ruby-dev:50047]

Updated by watson1978 (Shizuo Fujita) over 8 years ago Actions
Copy link
#4