OpenCC.NET icon indicating copy to clipboard operation
OpenCC.NET copied to clipboard

Found one missing Chinese character.

Open zydjohnHotmail opened this issue 3 years ago • 5 comments

Hello: I used your repo., basically, it works well. However, there is at least one tranditional chinese character missing in the library. See the following picture, the first character is missing. Its unicode seems to be: \u8F17 Please check. Thanks, UFOChinese

zydjohnHotmail avatar Jun 12 '22 15:06 zydjohnHotmail

Do you mean missing 𫐐 <-> 輗?I will test it later.

CosineG avatar Jun 13 '22 13:06 CosineG

Hello: I don't know this Chinese character. This is a name for a horse. But whatever you can add it in your library, it will be OK.

zydjohnHotmail avatar Jun 13 '22 14:06 zydjohnHotmail

Sorry for taking so long to reply. I tested and found out that the Simplified 𫐐 of the character 輗 is too rare and located in Unicode CJK Extension C, so many fonts don't contain it and jieba.NET can't parse it correctly. Although this pair of conversions exists in the dictionary now, it doesn't work properly either.

CosineG avatar Jun 26 '22 14:06 CosineG

Do you have any solution?

zydjohnHotmail avatar Jun 26 '22 14:06 zydjohnHotmail

Maybe you can raise an issue to anderscui/jieba.NET? And I don't know if the font on your computer supports the character.

CosineG avatar Jun 26 '22 14:06 CosineG