GraphemeSplitter
GraphemeSplitter copied to clipboard
A C# implementation of the Unicode grapheme cluster breaking algorithm
http://ufcppfree.azurewebsites.net/Grapheme これの結果のUTF-32 sequenceのリンク、3桁以下のときは0埋めするようにできませんか。4桁ないと検索がうまくいかないので
.NET 5 will support [UAX29](https://www.unicode.org/reports/tr29/)-compliant grapheme splitting (as breaking changes of `StringInfo` and `TextElementEnumerator`). see: https://github.com/dotnet/runtime/pull/328 How should this repo be? - retire? - backport TextElementEnumerator from .NET 5?
```cs var utf8 = new byte[] { 240, 159, 145, 168, 226, 128, 141, 240, 159, 145, 168, 226, 128, 141, 240, 159, 145, 167, 226, 128, 141, 240, 159,...
- [x] Hangul - [ ] Emoji Flag Sequence - [ ] http://www.unicode.org/Public/10.0.0/ucd/auxiliary/GraphemeBreakTest.txt