Bug #18027: test/ruby/enc/test_emoji_breaks.rb does not use the file emoji-variation-sequences.txt - Ruby - Ruby Issue Tracking System

While working on issue #17750, I found out that the number of assertions is essentially the same whether or not the file emoji-variation-sequences.txt is included in the files used to create the test data. This is the case both before and after the move of this file. So it seems unrelated to the move of the file, or to the version upgrade.

I have not yet been able to figure out a reason for this behavior; the structure of the file emoji-variation-sequences.txt looks just like the structure of the other three files (emoji-sequences.txt, emoji-test.txt, and emoji-zwj-sequences.txt).

This bug serves to track this issue. Any hints appreciated. In particular, I'm a puts debugger, but puts debugging doesn't seem to work well when testing.

Related issues 1 (0 open — 1 closed)

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#1

Related to Feature #17750: Update Unicode data to Unicode Version 13.0.0 added

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#2 [ruby-core:104944]

duerst (Martin Dürst) wrote:

In particular, I'm a puts debugger, but puts debugging doesn't seem to work well when testing.

I found that adding and using the following method in the Test... class allowed me to use log_test instead of puts for debugging. If you know of a better way, please tell me.

  def log_test(message)
    open('log_test.txt', 'a') { |f| f.write(message+"\n") }
  end

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#3 [ruby-core:104946]

Status changed from Open to Closed

Resolved with commit fd7f61c.

It turned out the problem was that in emoji-variation-sequences.txt, lines look like
0023 FE0E ; text style ; # (1.1) NUMBER SIGN
whereas in the other two files that we processed the same way, they look like
23F0 ; Basic_Emoji; alarm clock # E0.6 [1] (⏰)
(aligned to show the similarities and differences). The empty field between the second semicolon and the hash mark wasn't properly accounted for, but this has now been fixed in commit fd7f61c.

Actions

Copy link

Also available in: PDF Atom

Project

General

Profile

Ruby

Custom queries

Bug #18027

test/ruby/enc/test_emoji_breaks.rb does not use the file emoji-variation-sequences.txt

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#1

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#2 [ruby-core:104944]

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#3 [ruby-core:104946]

Project

General

Profile

Ruby

Custom queries

Bug #18027

test/ruby/enc/test_emoji_breaks.rb does not use the file emoji-variation-sequences.txt

Updated by duerst (Martin Dürst) almost 5 years ago ActionsCopy link #1

Updated by duerst (Martin Dürst) almost 5 years ago ActionsCopy link #2 [ruby-core:104944]

Updated by duerst (Martin Dürst) almost 5 years ago ActionsCopy link #3 [ruby-core:104946]

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#1

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#2 [ruby-core:104944]

Updated by duerst (Martin Dürst) almost 5 years ago Actions
Copy link
#3 [ruby-core:104946]