Actions
Misc #20406
openQuestion about Regexp encoding negotiation
Status:
Open
Assignee:
-
Description
I am wondering what are the rules to calculate Regexp literal encoding in case an encoding modifier is specified.
From the documentstion:
By default, a regexp with only US-ASCII characters has US-ASCII encoding:
...
A regular expression containing non-US-ASCII characters is assumed to use the source encoding. This can be overridden with one of the following modifiers.
//n ...
//u ...
//e ...
//s ...
Looking at the following examples I would assume that these rules are followed except one case:
p /\xc2\xa1/e .encoding # EUC-JP
p /#{ }\xc2\xa1/e .encoding # EUC-JP
p /a/e .encoding # EUC-JP
p /a #{} a/e .encoding # EUC-JP
p /#{} a/e .encoding # US-ASCII
The last Regexp /#{} a/e
is supposed to have EUC-JP
encoding but has US-ASCII
. So I am wondering what rule is applied in this case.
Actions
Like0
Like0Like0Like0Like0Like0Like0Like0Like0Like0