John Bauer comments

Results 1062 comments of


                                            John Bauer

JSONDecodeError when calling Chinese jar in Python

I mean, removing the traceback defeats the purpose of sending us the error. However, the only line here which can cause that is the download, so whatever. Are you able...

Integration of Mocking framework

In general we try to avoid expanding the library footprint of corenlp. Do you have a few suggestions of classes you want tested more thoroughly?

Discrepancy tokensregex webserver and CoreNLP in Python?

I'm not sure this deserves to be closed. Didn't you have the expectation that the API does the escaping for you? Perhaps we should fix that in the stanza client.

Discrepancy tokensregex webserver and CoreNLP in Python?

I thought at first this was on the stanza side, but then I discovered it was an issue with the java code. The result is that although I fixed it,...

Discrepancy tokensregex webserver and CoreNLP in Python?

https://github.com/stanfordnlp/CoreNLP/commit/5e54ae4862d38de1e36020380b9dab48ab73eebc

Python StanfordcorenlpServer client identify ::: NER ignorecase is not working

At a basic level you are using the truecase annotator after everything else, so there's no way it will affect the results of the previous annotators. Easier would be to...

Python StanfordcorenlpServer client identify ::: NER ignorecase is not working

https://stanfordnlp.github.io/CoreNLP/caseless.html You should be able to adapt those model paths for your python code On Wed, Jan 8, 2020 at 12:15 AM BVSREDDY82 wrote: > Can you provide the sample...

PTBTokenizer Unrecognizable: (U+2063, decimal: 8291)

I think these are two separate issues & it's just a coincidence that the errors are printed this way. 1) We could figure out something to do with the "invisible...

PTBTokenizer Unrecognizable: (U+2063, decimal: 8291)

I suggest only tokenizing files of less than 2GB until we figure it out

PTBTokenizer Unrecognizable: (U+2063, decimal: 8291)

Just to confirm regarding the invisible separator: ``` java edu.stanford.nlp.pipeline.StanfordCoreNLP -annotators tokenize -file foo.txt [main] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Adding annotator tokenize Processing file /home/john/CoreNLP/foo.txt ... writing to /home/john/CoreNLP/foo.txt.out Untokenizable: ⁣...