Julien Nioche
Julien Nioche
  There are two images attached that give the warning. When you use the -F switch in exiftool the offsets are fixed and some more fields are then visible...
@drewnoakes not yet, the comment above was a note to myself
@asciimoo did you manage to have a closer look at URLFrontier?
The API module is the best place to start. See [README](https://github.com/crawler-commons/url-frontier/blob/master/API/README.md) and [urlfrontier documentation](https://github.com/crawler-commons/url-frontier/blob/master/API/urlfrontier.md). If something is not clear, please ask a question on https://github.com/crawler-commons/url-frontier/discussions, we'd be more than happy...
@tikazyq did you manage to have a closer look at URLFrontier?
Hi. You know about [https://github.com/DigitalPebble/behemoth-elasticsearch]? It is probably in need of an update but should be a good starting point.
I've added a link to [https://github.com/DigitalPebble/behemoth/wiki/Behemoth-Modules]. > You want me to send a PR to add the elasticsearch module? > yep, would be the right place for it and the...
> I am however able to persist data into most recent release of ES now and want to push this into the codebase so I will send you a PR....
Hi Alex yes please create a patch if you can. Not clear which test creates this output but it could be a case of the main file being deleted but...
http://incubator.apache.org/opennlp/