Andy Chow
Andy Chow
现在的buffer大小是多少? 是include/cppjieba/Trie.hpp这个中const size_t MAX_WORD_LENGTH = 512的大小?是的话可否设置为2048?
如题,在进行Java版本lac单线程分词时,出现: > (1) *** Error in `java': double free or corruption (out): 0x00007f9e6c2cbf40 *** > > (2)terminate called after throwing an instance of 'std::bad_alloc' > what(): std::bad_alloc > Aborted >
可否共享一下训练语料?
分词速度相比jieba、ltp(python版)还是相差约8倍左右?如何改进?
``` File "train.py", line 41, in main logger.info('\n'.join(sorted(["%s : %s" % (str(k), str(v)) for k, v in FLAGS.__dict__['__flags'].items()]))) KeyError: '__flags' ```
falcon=3.0.0.dev1 ``` resp.body = f.read() UnicodeDecodeError: 'utf-8' codec can't decode byte 0x89 in position 0: invalid start byte ```

As the tool supply the funcion of "NSFC Subject Classifier", which input is chinese. so how can i train the same model for english version? Q: A) What is the...
Is it possible to open source the English model without the need to call the translation software, similar to the ai model?