Bug #12100
closedCSV converters fail when using single arg proc
Description
CSV#parse (and others) throw an ArgumentError when passing a single argument Proc as a converter because of a performance optimization check that assumes Proc#arity is always positive. Lambdas with one argument work fine as do Procs and lambdas with two arguments. The documentation suggests to me that either should work. Supporting the Proc variant is trivial and allows to_proc and to_proc shortcut syntax to be used eliminating the surprising behavior and uninformative error message. Illustration of the problem is below and a patch with tests is attached. I'm also creating a PR on Github because I think this is a minor enough change to be merged quickly. Thank you.
require 'csv'
puts CSV.parse(" foo , bar ", converters: -> f { f.strip }).inspect
puts CSV.parse(" foo , bar ", converters: :strip.to_proc).inspect
# Without patch this outputs:
# [["foo", "bar"]]
# /home/mdaubert/.rbenv/versions/2.3.0/lib/ruby/2.3.0/csv.rb:2205:in `strip': wrong number of arguments (given 1, expected 0) (ArgumentError)
# With patch this outputs:
# [["foo", "bar"]]
# [["foo", "bar"]]
Files
Updated by recursive-madman (Recursive Madman) over 8 years ago
I'm pretty sure this will introduce new bugs.
The meaning of an arity of -1 is "any number of arguments".
In reality you're not passing a proc which takes 1 argument (which would
work w/o your patch), but a proc which takes any number of arguments.
The reason for the :strip.to_proc
Proc to take an arbitrary number of
arguments (while String#strip
) does not, is that Symbol#to_proc
cannot
know the arity of the method call it is wrapping, thus always accepting
any number of arguments.
To illustrate,
:strip.to_proc
is equivalent to -> receiver, *rest { receiver.strip(*rest) }
When you pass a proc like:
-> receiver { receiver.strip }
directly, everything should work fine.
Only by using Symbol#to_proc
you "break" the arity check within the CSV
library.
Updated by mdaubert (Matthew Daubert) over 8 years ago
Upon further investigation it looks like this might actually stem from Symbol#to_proc returning a Proc with unspecified number of arguments, which makes sense, but I think the fix here is valid regardless. What REcursive Madman said... thanks!
Updated by mdaubert (Matthew Daubert) over 8 years ago
Thanks for the explanation Madman, that's super helpful. So the actual bug is reproduced when passing a proc/non-lambda with any number of arguments but I agree, this would cause regressions in a case like this...
parser.convert do |*args|
field, info = *args
# do things and return
end
https://bugs.ruby-lang.org/issues/12100#
I don't think there's a way to fix this without causing a regression so this can be closed.
Updated by shyouhei (Shyouhei Urabe) over 8 years ago
- Status changed from Open to Closed
closing as per request
Updated by usa (Usaku NAKAMURA) over 8 years ago
- Status changed from Closed to Rejected