unicodetools icon indicating copy to clipboard operation
unicodetools copied to clipboard

The algorithm to compute the derived property should use EAW more

Open kojiishi opened this issue 1 year ago • 2 comments

This issue is a continuation of https://github.com/kojiishi/unicode-auto-spacing/issues/16, originally raised by @asmusf.

kojiishi avatar Apr 06 '24 08:04 kojiishi

@asmusf The algorithm to compute the property was adjusted to:

  1. Add by scripts and script_extensions.
  2. Remove by EAW.

In other words, the algorithm is an intersection of scripts and EAW.

Does it make clear that EAW plays one of the primary roles for this property?

kojiishi avatar Apr 06 '24 12:04 kojiishi

With https://github.com/unicode-org/unicodetools/issues/768, ea=N|Na are exlucded from the W set, unless their script is one of CJK.

The current algorithm to compute the property is here. It's in Python with a library, but I hope it's readable without prior knowledge.

kojiishi avatar Apr 21 '24 07:04 kojiishi

I think the changes so far have addressed the feedback. Please let me know if any. Thank you for the feedback.

kojiishi avatar Jun 29 '24 08:06 kojiishi