Snify89

Results 25 comments of Snify89

Use these settings and change them accordingly: Images greater than this size are skipped. Note that every image is loaded completely into memory for doing OCR. This is the pixel...

> As soon as Docspell receives a(ny) document, it should be hashed at first. Then, either the document is being processed right away (no matter how) or it is in...

> The large files should only be an example timewise. The size doesn't matter in this example, as long as an upload is quicker than the processing itself. I would...

If you use docker, you should bind mount the solr directory to a specific location and set the owner and group ID to 8983 for that directory ``` solr: image:...

Solr9 works fine (afaik) if used with docker managed volumes. If you use bind mount to a specific location, you need to set the UID/GID to 8983. https://stackoverflow.com/questions/63993993/docker-persisted-volum-has-no-permissions-apache-solr @eikek You...

> * While the script doesn't see the dir as writeable you can create files and subdirectories inside /var/solr manually, so it obviously is writeable When you open the shell...

Depending on the joex settings/configuration, Docspell tries first to fetch the already (ORCd) text given in the pdf. If the fetched amount/length is lesser than the value in the configuration...

I had the same issue. Some (most actually) of my documents were poorly OCRed and I had to deal with this. In general, Docspell always keeps the original file, which...

> I tried your suggestion 1 from above and it served my purposes well. > > When writing the above comment, I wasn't fully thinking about that docspell creates its...

Yes only via API. Not sure about all the client API stuff, but it would be a new setting for the collective or user, which saves, if the custom field...