Project

General

Profile

Bug #6268

Characters encoding issues with UTF-8 for emoji and confusable symbols.

Added by Eugenie Lyzenko about 2 years ago. Updated about 2 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Target version:
-
Start date:
Due date:
% Done:

0%

billable:
No
vendor_id:
GCD
case_num:

gcd_fwd-3821c-13749__refs_6193.v1.bzr.patch Magnifier - FWD code patch (1.84 KB) Eugenie Lyzenko, 04/08/2022 06:31 PM

emojis_confusables_working_from_fwd_BUT_jvm_length_two_bytes_01.png - emoji and confusables (14.8 KB) Eugenie Lyzenko, 04/08/2022 06:33 PM

History

#1 Updated by Eugenie Lyzenko about 2 years ago

The patch to unlock negative ASC code conversion.

#2 Updated by Eugenie Lyzenko about 2 years ago

Small pic showing the end status for now. Please attach to the new bug issue along with this comment.

Using the gcd_fwd-3821c-13749__refs_6193.v1.bzr.patch patch, the xfer testcases pass for string equality comparison, but fail for length comparison.

Reproduction:
cd ~/testcases/deploy/client-chui && ./client-terminal.sh client:cmd-line-option:startup-procedure=field_io/test_fieldio.p

Pic:

emoji and confusables

#4 Updated by Tijs Wickardt about 2 years ago

  • Start date deleted (04/08/2022)

#5 Updated by Eugenie Lyzenko about 2 years ago

Greg,

What is our plan here? Do we need to work on this now or can be deferred?

#6 Updated by Greg Shah about 2 years ago

Defer.

#7 Updated by Tijs Wickardt about 2 years ago

The inconsistency regarding emoji length of the JVM versus other runtimes is nicely explained at Emojis, Java and Strings

Also available in: Atom PDF