John Bauer

Results 1062 comments of John Bauer

I mean, removing the traceback defeats the purpose of sending us the error. However, the only line here which can cause that is the download, so whatever. Are you able...

In general we try to avoid expanding the library footprint of corenlp. Do you have a few suggestions of classes you want tested more thoroughly?

I'm not sure this deserves to be closed. Didn't you have the expectation that the API does the escaping for you? Perhaps we should fix that in the stanza client.

I thought at first this was on the stanza side, but then I discovered it was an issue with the java code. The result is that although I fixed it,...

https://github.com/stanfordnlp/CoreNLP/commit/5e54ae4862d38de1e36020380b9dab48ab73eebc

At a basic level you are using the truecase annotator after everything else, so there's no way it will affect the results of the previous annotators. Easier would be to...

https://stanfordnlp.github.io/CoreNLP/caseless.html You should be able to adapt those model paths for your python code On Wed, Jan 8, 2020 at 12:15 AM BVSREDDY82 wrote: > Can you provide the sample...

I think these are two separate issues & it's just a coincidence that the errors are printed this way. 1) We could figure out something to do with the "invisible...

I suggest only tokenizing files of less than 2GB until we figure it out

Just to confirm regarding the invisible separator: ``` java edu.stanford.nlp.pipeline.StanfordCoreNLP -annotators tokenize -file foo.txt [main] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Adding annotator tokenize Processing file /home/john/CoreNLP/foo.txt ... writing to /home/john/CoreNLP/foo.txt.out Untokenizable: ⁣...