Bug #5258
closed
SizedQueueにBug #5195と同様のバグ
Added by Glass_saga (Masaki Matsushita) over 13 years ago.
Updated over 12 years ago.
Description
=begin
[Bug #5195][ruby-dev:44400]と同様ですが、SizedQueue#pushでsleepしているthreadをwakeupさせると、SizedQueueの@queue_waitにそのthreadがpushされてしまいます。
require 'thread'
sq = SizedQueue.new(1)
sq.push(0)
t1 = Thread.start { sq.push(1) ; sleep }
nil until t1.stop?
t1.wakeup
nil until t1.stop?
t2 = Thread.start { sq.push(2) }
nil until t1.stop? && t2.stop?
p t1, t2
sq.instance_eval{ p @queue_wait }
3.times{ sq.pop }
t2.join
上記のコードを実行すると、
#<Thread:0x00000000b1a198 sleep>
#<Thread:0x00000000b1a120 sleep>
[#<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a198 sleep>, #<Thread:0x00000000b1a120 sleep>]
/usr/local/lib/ruby/1.9.1/thread.rb:185:in sleep': deadlock detected (fatal) from /usr/local/lib/ruby/1.9.1/thread.rb:185:in
block in pop'
from internal:prelude:10:in synchronize' from /usr/local/lib/ruby/1.9.1/thread.rb:180:in
pop'
from /usr/local/lib/ruby/1.9.1/thread.rb:324:in pop' from sized_queue.rb:19:in
block in '
from sized_queue.rb:19:in times' from sized_queue.rb:19:in
'
となります。
場当たり的なものですが[Bug #5195][ruby-dev:44400]の時と同様なpatchを添付します。適用後もtest/thread/test_queue.rbをパスします。
=end
Files
- Status changed from Open to Assigned
- Assignee set to kosaki (Motohiro KOSAKI)
別件の修正中にパッチを書いていたのですが、このチケットと同じ問題だと気がつきました。
#5195 のぶんも同じですがパッチを添付します。
wakeup や例外で抜けた時に @waiting に残ってしまうと次の Queue#push の時に意図せず Thread#wakeup が呼ばれてしまったり、実際に待っている Thread に wakeup が呼ばれなくなったりするので、残さないようにしたほうが良いと思います。
This is also present in 1.9.3 and there is an even simpler test case:
$ ruby19 -r thread -e 'q=SizedQueue.new 10;1_000_000.times {|i| p i;q.enq i}'
0
1
2
3
4
5
6
7
8
9
10
/opt/lib/ruby/1.9.1/thread.rb:301:in sleep': deadlock detected (fatal) from /opt/lib/ruby/1.9.1/thread.rb:301:in
block in push'
from internal:prelude:10:in synchronize' from /opt/lib/ruby/1.9.1/thread.rb:297:in
push'
from -e:1:in block in <main>' from -e:1:in
times'
from -e:1:in `'
rklemme@padrklemme2 /cygdrive/c/SCMws/RKlemme/JavaProducts_oslee_ngcp_dev_R3.3_be4rb
$ ruby19 -v
ruby 1.9.3p125 (2012-02-16) [i386-cygwin]
It seems natural because the only thread is about to sleep.
nobu (Nobuyoshi Nakada) wrote:
It seems natural because the only thread is about to sleep.
I would expect the thread to block indefinitely. Signalling an error here seems to try to be too smart. Even in absence of other threads I can imagine conditions under which data is read from the queue (for example a signal handler).
But even if an error is signaled here, it is certainly not a deadlock - for that you need at least two threads. This is rather something like "only blocking thread is suspended (with no chance to wake up)". Maybe that error should be controllable via a switch, e.g.
Thread.single_thread_block_is_error = true # default false
Hello,
2012/4/11 rklemme (Robert Klemme) shortcutter@googlemail.com:
nobu (Nobuyoshi Nakada) wrote:
It seems natural because the only thread is about to sleep.
I would expect the thread to block indefinitely. Signalling an error here seems to try to be too smart. Even in absence of other threads I can imagine conditions under which data is read from the queue (for example a signal handler).
Interesting. The error is indeed "false positive." I did not noticed
signal handler.
But I'm not enthusiastic for your proposal. It is too conservative.
It will increase too many "false negative."
But even if an error is signaled here, it is certainly not a deadlock - for that you need at least two threads. This is rather something like "only blocking thread is suspended (with no chance to wake up)". Maybe that error should be controllable via a switch, e.g.
Thread.single_thread_block_is_error = true # default false
Do you have any practical case where you get bothered by the "false positive"?
If so, please open a feature request ticket with the use case.
--
Yusuke Endoh mame@tsg.ne.jp
Hi again,
mame (Yusuke Endoh) wrote:
2012/4/11 rklemme (Robert Klemme) shortcutter@googlemail.com:
nobu (Nobuyoshi Nakada) wrote:
It seems natural because the only thread is about to sleep.
I would expect the thread to block indefinitely. Signalling an error here seems to try to be too smart. Even in absence of other threads I can imagine conditions under which data is read from the queue (for example a signal handler).
Interesting. The error is indeed "false positive." I did not noticed
signal handler.
I was lucky because I was just sensitized by Eric's recent article about signal handling in Ruby. :-)
But I'm not enthusiastic for your proposal. It is too conservative.
It will increase too many "false negative."
You mean processes with no unblocked threads would go unnoticed? What about the effort for JRuby and others to mimic this behavior?
But even if an error is signaled here, it is certainly not a deadlock - for that you need at least two threads. This is rather something like "only blocking thread is suspended (with no chance to wake up)". Maybe that error should be controllable via a switch, e.g.
Thread.single_thread_block_is_error = true # default false
Do you have any practical case where you get bothered by the "false positive"?
If so, please open a feature request ticket with the use case.
Not at the moment. The case I thought I had evaporated because it was a "silent thread death" case (i.e. Thread.abort_on_exception was at default - false - and the only other thread had silently died where I thought it would still be running). But a better error message than "deadlock" as I have suggested above would have made me find the source of the issue much more quickly because it had hinted at the real problem.
I think I'll open a feature request to at least change the error message in case there is no other thread left. Btw, is this really a deadlock detection or does the code only check for blocked threads? From the code in check_deadlock_i() it is not totally obvious to me; I think the check only ensures at least one live thread - at least it does not seem to try to find the deadlock loop (as e.g. some relational databases do).
Kind regards
robert
- Status changed from Assigned to Closed
- % Done changed from 0 to 100
This issue was solved with changeset r36938.
Masaki, thank you for reporting this issue.
Your contribution to Ruby is greatly appreciated.
May Ruby be with you.
-
lib/thread.rb (Queue#pop): Fixed double registration issue when
mutex.sleep is interrupted. [Bug #5258] [ruby-dev:44448]
-
lib/thread.rb (SizedQueue#push): ditto.
-
test/thread/test_queue.rb (test_sized_queue_and_wakeup,
test_queue_pop_interrupt, test_sized_queue_pop_interrupt,
test_sized_queue_push_interrupt): new tests.
Also available in: Atom
PDF
Like0
Like0Like0Like0Like0Like0Like0Like0Like0