Bug #20462
openNative threads are no longer reused
Description
Ruby used to reuse native threads in order to amortize the cost of making a pthread.
For example this program:
ntids = 1000.times.map {
Thread.new {
Thread.current.native_thread_id
}.value
}
p ntids.uniq.length
With Ruby 3.2.0, this would return 1. With Ruby 3.3.x, it returns 1000. It means we cannot amortize the cost of a pthread for short lived threads.
I was able to bisect this to commit be1bbd5b7d40ad863ab35097765d3754726bbd54. But the change is big so I don't know how to fix it.
Updated by jhawthorn (John Hawthorn) 9 months ago
In a benchmark of thread creation we see that 3.3 is slower than 3.2
jhawthorn@zergling:~ [ruby 3.2.2]
$ time ruby --disable-gems -e '100_000.times { Thread.new{}.join }'
ruby --disable-gems -e '100_000.times { Thread.new{}.join }' 0.73s user 1.16s system 107% cpu 1.751 total
jhawthorn@zergling:~ [ruby 3.3.1]
$ time ruby --disable-gems -e '100_000.times { Thread.new{}.join }'
ruby --disable-gems -e '100_000.times { Thread.new{}.join }' 0.55s user 1.67s system 70% cpu 3.126 total
Updated by maciej.mensfeld (Maciej Mensfeld) 7 months ago
I also believe it can multiple any potential issues with resources leakage. Not sure yet what is causing this but working with a Rails user that sees this:
I wrote a little script that creates 100k threads and can report 3.3.3 allocates way more memory than 3.2.x
3.2.2 - Memory allocated: 49603264 bytes
3.3.3 - Memory allocated: 104926404080 bytes
Updated by byroot (Jean Boussier) 7 months ago ยท Edited
- Backport changed from 3.1: UNKNOWN, 3.2: UNKNOWN, 3.3: UNKNOWN to 3.1: DONTNEED, 3.2: DONTNEED, 3.3: UNKNOWN
be1bbd5b7d40ad863ab35097765d3754726bbd54 very explicitly removes the cache. What's unclear is whether it was intended, or if the intent was to reimplement it differently and it was forgotten.
@ko1 (Koichi Sasada) could you let us know what the intent was?
Updated by maciej.mensfeld (Maciej Mensfeld) 7 months ago
Not sure if related but here's a benchmark of the impact (written by Gordon Chan):
#!/usr/bin/env ruby
# Disable GC during the benchmark
GC.disable
# Get the current memory usage
mem_before = GC.stat[:malloc_increase_bytes]
obj_before = GC.stat[:total_allocated_objects]
# Perform the operation you want to benchmark
result = 100_000.times { Thread.new{}.join }
# Get the new memory usage and object count
mem_after = GC.stat[:malloc_increase_bytes]
obj_after = GC.stat[:total_allocated_objects]
# Re-enable GC
GC.enable
GC.stress
sleep(5)
# Calculate the memory and object allocation deltas
mem_delta = mem_after - mem_before
obj_delta = obj_after - obj_before
puts "Memory allocated: #{mem_delta} bytes"
puts "Objects allocated: #{obj_delta}"
and the difference:
# 3.1.3
Memory allocated: 46402080 bytes
Objects allocated: 700009
# 3.2.3
Memory allocated: 49603216 bytes
Objects allocated: 700009
# 3.3.3
Memory allocated: 104926404048 bytes
Objects allocated: 700010
3.1 and 3.2 are consistent, while 3.3 is as you can see above (2000x).
Updated by byroot (Jean Boussier) 7 months ago
Not sure if related
It is. The thread cache was to reuse short lived thread and avoid all these allocations etc. Note however that it wasn't enabled on all platforms, so relying on it never really was a good idea.
Updated by maciej.mensfeld (Maciej Mensfeld) 7 months ago
byroot (Jean Boussier) wrote in #note-5:
Not sure if related
It is. The thread cache was to reuse short lived thread and avoid all these allocations etc. Note however that it wasn't enabled on all platforms, so relying on it never really was a good idea.
I myself do not rely on this explicitly. It came up during a deeper investigation related to high memory usage / potential memory leaks of 3.3.x. We've been postponing the deployment of 3.3 due to much higher memory usage than 3.2 and this is one of the things that came up. As of now I am trying to identify short-lived thread usage amongst other Ruby OSS gems.
Updated by byroot (Jean Boussier) 7 months ago
- Assignee set to ko1 (Koichi Sasada)
@ko1 (Koichi Sasada) told me it was removed intentionally, but I don't have much more details, so I'll defer to him on what to do about this issue.
Updated by hsbt (Hiroshi SHIBATA) 7 months ago
- Status changed from Open to Assigned