natural icon indicating copy to clipboard operation
natural copied to clipboard

Korean Language Support

Open keon opened this issue 10 years ago • 2 comments

I am planning to add Korean Language support of this library. Is there any example or guide besides in the code in the package that might help me to add this functionality?

keon avatar Nov 05 '15 16:11 keon

depends on what algorithms you are extending, you can look around the library youll see alot of things with the suffix _fr.js for french etc... theres no formal documentation and not really a standard way of doing things yet though it has been discussed (see #228 and previously #159)

kkoch986 avatar Nov 05 '15 20:11 kkoch986

@kkoch986 Ok, I have been studying the code, but Japanese Tokenizer seems to be really apart from others. Which should be true because Asian languages are different from Roman & greek based language system.

I think I am going to imitate whats written on the Japanese tokenizer, since Korean and Japanese are really similar. But I am not sure how this will be organized in the future.

I think one of the urgent thing on the todo list should be organizing the language system.

keon avatar Nov 12 '15 18:11 keon