Project

General

Profile

Actions

Bug #9321

closed

rb_mod_const_missing does not generate a c-return event

Added by drkaes (Stefan Kaes) almost 11 years ago. Updated over 10 years ago.

Status:
Closed
Assignee:
-
Target version:
-
ruby -v:
ruby 2.1.0p1 (2013-12-26 revision 44431) [x86_64-darwin12.0]
[ruby-core:59398]

Description

We have had an issue reported for ruby-prof where execution times were attributed incorrectly in the call graph.

It turned out that the problem is caused by a missing c-return event for Module#const_missing.

ruby-prof simulates the ruby call stack by subscribing to line, call, c-call, return and c-return events.

Obviously, the missing return throws ruby-prof off the track.


Files

rb_mod_const_missing.patch (643 Bytes) rb_mod_const_missing.patch patch rb_const_missing to generate c-return event drkaes (Stefan Kaes), 12/30/2013 06:28 PM
rb_mod_const_missing_with_tests.patch (1.48 KB) rb_mod_const_missing_with_tests.patch drkaes (Stefan Kaes), 12/30/2013 08:05 PM
rb_mod_const_missing_with_tests_redefined.patch (3.01 KB) rb_mod_const_missing_with_tests_redefined.patch drkaes (Stefan Kaes), 12/31/2013 07:39 PM

Updated by drkaes (Stefan Kaes) almost 11 years ago

I believe the attached patch fixes the problem.

Updated by drkaes (Stefan Kaes) almost 11 years ago

I believe this bug also exists in 1.9.3 and 2.0.0.

Any chance to backport?

Updated by tmm1 (Aman Karmani) almost 11 years ago

Thanks for adding the test.

Seems like rb_frame_pop() should probably emit the event, but I'm not sure how to tell klass/mid in there.

BTW, doesn't this problem happen anytime an exception is raised? One or more stack frames can disappear without corresponding events.

Updated by drkaes (Stefan Kaes) almost 11 years ago

rb_frame_pop is called in only one place, and that's the one which the patch addresses.

So yes, we could move the code to rb_frame_pop(), if we change it's signature.

I'm not sure, under which circumstances the problem can show it's ugly head.

This breaks (without patch):

def foo; UNKNOWN_CONSTANT rescue nil; end

These don't:

def foo; 1/0 rescue nil; end
def bar; raise "shoot" rescue nil; end

Updated by drkaes (Stefan Kaes) almost 11 years ago

The patch has some issues with redefined Module#const_missing.

I'm in the process of preparing a new one.

Updated by drkaes (Stefan Kaes) almost 11 years ago

Here's the new patch. I have moved the hook to rb_frame_pop().

The new patch makes sure that the method names sent along with call and return events are identical.

But I'm not sure whether the current ruby behavior in the presence of aliased methods is really the desired behavior.

Looking forward to feedback on this topic.

Updated by ko1 (Koichi Sasada) almost 11 years ago

The last patch is not acceptable because of performance reason.

I'll think about it.

Updated by tmm1 (Aman Karmani) almost 11 years ago

I like the second patch, but now I'm not so sure about this technique.

Technically, const_missing never returns. So emitting a c_return event seems wrong. In fact, a raise event should be emitted from inside const_missing.

The "right" way to deal with this in ruby-prof would be to listen for the raise event and unwind the internal representation of the stack accordingly. Unfortunately, this is kind of tricky to implement.

Updated by tmm1 (Aman Karmani) almost 11 years ago

In rblineprof, I maintain a stack of CALL events. When a RETURN event comes in, CALL entries on the internal stack are popped until one matching the RETURN is found.

https://github.com/tmm1/rblineprof/blob/32795afc748f2f51cf76ce20ef6a85f50b9278c6/ext/rblineprof.c#L471-L475

Updated by drkaes (Stefan Kaes) almost 11 years ago

@ko1 (Koichi Sasada): Why do you think the second patch creates a performance problem?

Module#const_missing is an exceptional case. Am I missing something?

@tmm1 (Aman Karmani): the stack tracking approach breaks down if the method id reported on return by ruby does not match the one reported on call; which I have seen happening in the past.

Updated by drkaes (Stefan Kaes) almost 11 years ago

@tmm1's suspicion was right: c-return is also missing in the case when you try to call a nonexistent method.

It seems that c-return will be missing when raising an exception from C land.

Which means my patch doesn't really solve the underlying, more fundamental problem.

Updated by deivid (David Rodríguez) almost 11 years ago

But when raising an exception from ruby, return events are emitted... Shouldn't this 2 cases be consistent?

Updated by deivid (David Rodríguez) almost 11 years ago

In byebug, the problem is I don't manually mantain the call stack, but only keep track of the stack size through the TracePoint API events. The only time I load the call stack is through the DebugInspector API right before the debugger stops in order to make it available to the user.

If I was to implement the solution suggested by @tmm1 (Aman Karmani), how can I access the method_id for each frame from the C extension? Is the Thread::Backtrace::Locations API available to C extensions? Calling it through ruby for every c_return event will impact performance I guess...

Updated by tmm1 (Aman Karmani) almost 11 years ago

You can use rb_profile_frames() in 2.1, but it only reports ruby frames so it won't help identify C-call/return events.

You could also use TracePoint#method_id either from ruby or C, but @skaes reports the method ids sometimes don't match up. Maybe we can figure out why and fix it.

Updated by drkaes (Stefan Kaes) almost 11 years ago

I have extended the patch to also cover method_missing, by changing a single cfp pointer manipulation to a call of rb_frame_pop().

See https://github.com/skaes/rvm-patchsets/blob/master/patches/ruby/2.1.0/head/railsexpress/05-fix-missing-c-return-event.patch#L121

I have tested various other c functions raising exceptions. All of them work fine without the patches.

And my position is still that this is a ruby bug, which should be fixed instead of being worked around by all tool implementers.

The only thing stopping us from fixing it is @ko1's comment on performance impact, which I don't understand.

Updated by drkaes (Stefan Kaes) almost 11 years ago

The patch isn't ideal yet: in case of const-missing, the c-return event is generated too early. I know how to fix this. Will upload a modified patch later.

Updated by deivid (David Rodríguez) almost 11 years ago

@drkaes (Stefan Kaes) If this only happens in certain c-methods it certainly looks like a bug. Thanks for working on this.

Thanks @tmm1 (Aman Karmani), I guess I would have to store the method_id provided by the TracePoint API to be able to match it later.

Updated by ko1 (Koichi Sasada) almost 11 years ago

(2014/01/03 3:19), drkaes (Stefan Kaes) wrote:

@ko1 (Koichi Sasada): Why do you think the second patch creates a performance problem?

Module#const_missing is an exceptional case. Am I missing something?

Sorry for my misunderstanding.

Let me clarify the spec.

 ###
 n = 0
 TracePoint.new(){|tp|
   next if tp.event == :line
   case tp.event.to_s
   when /call/
     puts "#{' ' * 2 * n}#{tp.inspect}"
     n+=1
   when /return/
     n-=1 if n>0
     puts "#{' ' * 2 * n}#{tp.inspect}"
   else
     puts "#{' ' * 2 * n}#{tp.inspect}"
   end
 }.enable
 
 def f
   Object::XYZZY #=> const_missing
 end
 
 f
__END__
 #=>
 
 current behavior:
 
 #<TracePoint:c_return `enable'@test.rb:15>
 #<TracePoint:c_call `method_added'@test.rb:17>
 #<TracePoint:c_return `method_added'@test.rb:17>
 #<TracePoint:call `f'@test.rb:17>
   #<TracePoint:c_call `const_missing'@test.rb:18>
     #<TracePoint:c_call `initialize'@test.rb:18>
       #<TracePoint:c_call `initialize'@test.rb:18>
       #<TracePoint:c_return `initialize'@test.rb:18>
     #<TracePoint:c_return `initialize'@test.rb:18>
     #<TracePoint:c_call `exception'@test.rb:18>
     #<TracePoint:c_return `exception'@test.rb:18>
     #<TracePoint:c_call `backtrace'@test.rb:18>
     #<TracePoint:c_return `backtrace'@test.rb:18>
     #<TracePoint:raise@test.rb:18>
   #<TracePoint:return `f'@test.rb:18>
 
 expected behavior:
 
 #<TracePoint:c_return `enable'@test.rb:15>
 #<TracePoint:c_call `method_added'@test.rb:17>
 #<TracePoint:c_return `method_added'@test.rb:17>
 #<TracePoint:call `f'@test.rb:17>
   #<TracePoint:c_call `const_missing'@test.rb:18>
   #<TracePoint:c_return `const_missing'@test.rb:18> <-- NEW!
   #<TracePoint:c_call `initialize'@test.rb:18>
     #<TracePoint:c_call `initialize'@test.rb:18>
     #<TracePoint:c_return `initialize'@test.rb:18>
   #<TracePoint:c_return `initialize'@test.rb:18>
   #<TracePoint:c_call `exception'@test.rb:18>
   #<TracePoint:c_return `exception'@test.rb:18>
   #<TracePoint:c_call `backtrace'@test.rb:18>
   #<TracePoint:c_return `backtrace'@test.rb:18>
   #<TracePoint:raise@test.rb:18>
 #<TracePoint:return `f'@test.rb:18>

is it right?

This behavior is similar to "raise" behavior.

--
// SASADA Koichi at atdot dot net

Updated by ko1 (Koichi Sasada) almost 11 years ago

I modify the patch and all tests passes.
http://www.atdot.net/sp/raw/6ny0zm

If we can accept the spec of this behavior (order of tracing events), I
will commit it.

--
// SASADA Koichi at atdot dot net

Updated by drkaes (Stefan Kaes) almost 11 years ago

The order of trace events is acceptable.

It's not ideal though: I think one would expect the events for creating the exception and attaching the backtrace to it to happen inside const_missing/method_missing.

But it seems to me this would require more complex code changes.

Thx a lot for fixing this.

Is it possible to backport this to 2.0 and 1.9.3?

Updated by deivid (David Rodríguez) almost 11 years ago

It sounds good to me. Any behavior with a balanced count of call and return events work.

Big thanks @ko1 (Koichi Sasada).

Updated by drkaes (Stefan Kaes) almost 11 years ago

Compiling with clang produces warnings:

in file included from ./include/ruby/ruby.h:1694:
./include/ruby/intern.h:955:40: warning: unknown attribute 'warning' ignored [-Wattributes]
void rb_frame_pop(void) attribute((warning("rb_frame_pop is obsolete. It will be deleted after Ruby 2.2.0.")));;

You probably need to change the method signature to:

DEPRECATED(void rb_frame_pop(void));

Actions #25

Updated by ko1 (Koichi Sasada) almost 11 years ago

  • Status changed from Open to Closed
  • % Done changed from 0 to 100

This issue was solved with changeset r44535.
Stefan, thank you for reporting this issue.
Your contribution to Ruby is greatly appreciated.
May Ruby be with you.


  • vm.c (rb_vm_pop_cfunc_frame): added. It cares c_return event.
    The patch base by drkaes (Stefan Kaes).
    [Bug #9321]
  • variable.c (rb_mod_const_missing): use rb_vm_pop_cfunc_frame()
    instead of rb_frame_pop().
  • vm_eval.c (raise_method_missing): ditto.
  • vm_eval.c (rb_iterate): ditto.
  • internal.h (rb_vm_pop_cfunc_frame): add decl.
  • test/ruby/test_settracefunc.rb: add tests.
    provided by drkaes (Stefan Kaes).
  • vm.c, eval.c, include/ruby/intern.h (rb_frame_pop):
    move definition of rb_frame_pop() and deprecate it.
    It doesn't care about `return' events.

Updated by deivid (David Rodríguez) over 10 years ago

Hi @koichi, I'm still running into this issue... :( I really don't know what's going on, because the tests pass in ruby's test suite, but the c_return event is not generated when I use the TracePoint API.

require 'minitest/autorun'

class TestTracePoint < Minitest::Test
  def test_method_missing
    events = []
    assert !respond_to?(:missing_method_59398)
    TracePoint.new(:c_call, :c_return, :call, :return) { |tp|
      events << [tp.event, tp.method_id] if tp.method_id == :method_missing
    }.enable {
      missing_method_59398 rescue nil
    }
    assert_equal([[:c_call, :method_missing], [:c_return, :method_missing]], events)
  end
end

$ ruby --version
ruby 2.1.1p76 (2014-02-24 revision 45161) [i686-linux]

$ ruby test_tracepoint.rb 
Run options: --seed 31045

# Running:

F

Finished in 0.035420s, 28.2322 runs/s, 56.4645 assertions/s.

  1) Failure:
TestTracePoint#test_method_missing [test_tracepoint.rb:12]:
--- expected
+++ actual
@@ -1 +1 @@
-[[:c_call, :method_missing], [:c_return, :method_missing]]
+[[:c_call, :method_missing]]


1 runs, 2 assertions, 1 failures, 0 errors, 0 skips

Also I have a couple of failing tests in byebug's test suite because of this. https://travis-ci.org/deivid-rodriguez/byebug/jobs/21874423

Thanks a lot.

Updated by ko1 (Koichi Sasada) over 10 years ago

  • Status changed from Closed to Open

Updated by ko1 (Koichi Sasada) over 10 years ago

I check with test/unit (modified as follow) and I got no error. Do I miss anything?

require 'test/unit'

class TestTracePoint < Test::Unit::TestCase
  def test_method_missing
    events = []
    assert !respond_to?(:missing_method_59398)
    TracePoint.new(:c_call, :c_return, :call, :return) { |tp|
      events << [tp.event, tp.method_id] if tp.method_id == :method_missing
    }.enable {
      missing_method_59398 rescue nil
    }

    assert_equal([[:c_call, :method_missing], [:c_return, :method_missing]], events)
  end
end

And I got

1 tests, 2 assertions, 0 failures, 0 errors, 0 skips

ruby -v: ruby 2.2.0dev (2014-03-31 trunk 45486) [x86_64-linux]

Thanks,
Koichi

Updated by deivid (David Rodríguez) over 10 years ago

no no I was missing something.... :( I thought this bug fix was included in 2.1 but it's not. Last master works fine. I'm sorry for the confusion. And thanks a lot for having a look so quick!

Can I ask for this to be backported to 2.0 and 2.1?

Thanks!

Updated by ko1 (Koichi Sasada) over 10 years ago

Ah, I see.

I think it should be backported, but it changes a behavior.

@naruse-san, can we backport it?

Updated by naruse (Yui NARUSE) over 10 years ago

  • Status changed from Open to Closed
  • Backport changed from 1.9.3: UNKNOWN, 2.0.0: UNKNOWN, 2.1: UNKNOWN to 2.0.0: REQUIRED, 2.1: REQUIRED

ささださん曰く、

const_missing が呼ばれると、c_call は飛ばれるのに c_return が呼ばれない、ので、デバッガ作るときに困るので、
c_return 呼ばれるようにした。
これに依存した挙動は無いんじゃ無いかと思うのだが
とのこと。
具体的に結果がどう変わるかは
https://bugs.ruby-lang.org/issues/9321#note-20

Updated by ko1 (Koichi Sasada) over 10 years ago

This issues is available on 2.1 and 2.0.

This is possible that some applications implement workaround for this issue. This patch can breaks such applications. However, I'm not sure such applications exist (maybe don't exist).

Updated by deivid (David Rodríguez) over 10 years ago

As far as I'm concerned, this changes an (incorrect) behaviour and it should be backported. In byebug's case, having a balanced count of call and return events is critical and directly affects usability of the debugger. And when I thought of implementing a workaround inside byebug's didn't seem like an easy thing to do...

Stefan Kaes, did you implement a workaround for this in ruby-prof?

Cheers!

Updated by deivid (David Rodríguez) over 10 years ago

Thanks Koichi, appreciated. :)

Updated by deivid (David Rodríguez) over 10 years ago

Is there an official way to request a backport? Thanks!

Updated by nagachika (Tomoyuki Chikanaga) over 10 years ago

I agree that this should be backported to 2.1 branch.
Is there any opposite opinion?

Updated by deivid (David Rodríguez) over 10 years ago

Thanks for following up Tomoyuki!

Actions #39

Updated by nagachika (Tomoyuki Chikanaga) over 10 years ago

  • Backport changed from 2.0.0: REQUIRED, 2.1: REQUIRED to 2.0.0: REQUIRED, 2.1: DONE

Backported into ruby_2_1 branch at r46608.

Updated by usa (Usaku NAKAMURA) over 10 years ago

backported into ruby_2_0_0 at r46671.

Updated by usa (Usaku NAKAMURA) over 10 years ago

  • Backport changed from 2.0.0: REQUIRED, 2.1: DONE to 2.0.0: DONE, 2.1: DONE
Actions

Also available in: Atom PDF

Like0
Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0Like0