Project

General

Profile

Actions

Bug #18638

closed

CSV parse does not honor field_size_limit option unless and until a comma occurs in the data, and field_size_limit is off by one

Added by Capncavedan (Dan Buettner) about 2 years ago. Updated about 2 years ago.

Status:
Third Party's Issue
Target version:
-
[ruby-core:107927]

Description

When using CSV.parse or CSV.foreach and specifying option field_size_limit: 2_000, we do not consistently see an exception raised when a field contains over 2,000 characters.

I was finally able to reproduce the issue as occurring only after a comma has occurred within a data field.

I then also found that what could be considered an off-by-one error with respect to "field_size_limit": you need to set a value 1 higher than the maximum field length you want to allow.

This occurs on Ruby 2.7.5 and 3.1.1.

This is a simple ruby script to demonstrate both issues:

require "csv"

the_alphabet = ("a".."z").to_a.join
the_alphabet.size # => 26

# this does not honor field_size_limit; it should raise an exception but does not
CSV.parse("\"I am a working man\",\"#{the_alphabet}\"", field_size_limit: 20)

# this raises the proper exception
CSV.parse("\"I am a workin, man\",\"#{the_alphabet}\"", field_size_limit: 20)

# this raises a "Field size exceeded" error, even though field size equals field size limit
CSV.parse("\"I am a workin, man\",\"#{the_alphabet}\"", field_size_limit: the_alphabet.size)

# this works as expected, no exception raised
CSV.parse("\"I am a workin, man\",\"#{the_alphabet}\"", field_size_limit: the_alphabet.size+1)

Updated by kou (Kouhei Sutou) about 2 years ago

  • Status changed from Open to Third Party's Issue
  • Assignee set to kou (Kouhei Sutou)

Could you report this to https://github.com/ruby/csv/issues ?
https://github.com/ruby/csv is the upstream.

Actions

Also available in: Atom PDF

Like0
Like0