feng smith

Results 4 issues of feng smith

最近在看 ik 的代码的时候,发现了一个 bug 。AnalyzeContext 第 121 行、第 124 行有点问题。 ![image](https://user-images.githubusercontent.com/6457270/119466115-e6bf0100-bd76-11eb-8944-963d1f12ba64.png) 缓冲区的大小是 4096 ,超过 3996 就可能需要把未处理过的字符拷贝到缓冲区的前面。 我构造了个 4100( > 3996 ,需要把未处理过的字符拷贝能缓冲区,能触发 bug)长的字符串 ,都是 垚 字,正确的应该是输出 4100 个 垚 字,实际上输出了 4101...

用 ik_smart 分词英文的时候,英文句号 . 和 . 之前的单词分到一起了。样例如下: ``` GET /_analyze { "analyzer": "ik_smart", "text": "In 1997, a group of twenty British women made history. Working " } ``` 分词结果是: ```json...

When I run ``` python scripts/prepare_data.py ``` finally print Cleaning data... sh: /Users/xxxxx/xxxxx/image-to-latex/data/scripts/find_and_replace.sh: No such file or directory But I find out that find_and_replace.sh the file located in ``` /Users/xxxxx/xxxxx/image-to-latex/scripts/find_and_replace.sh...

建表语句报错了: ``` CREATE TABLE IF NOT EXISTS mytable ( f2 IDENTITY(1, 10) CONSTRAINT pk PRIMARY KEY HASH AUTO_INCREMENT, f5 int NOT NULL UNIQUE NOT NULL, f6 int NULL CHECK f6...