martinreynaert

Results 5 comments of martinreynaert

I am still a user of --textclass especially when working with OCRed text which needs to be post-corrected before tokenization.

Thanks for the further info! I'll check out whether it would make more sense for me to use --inputclass and --outputclass instead.

Hi Jan, Thanks! I have access to 'Couperus' now! Thanks for changing the title too, this was exactly what was going wrong. And yes, this will work for us! You'll...

Hi Jan, This is great news! I have in the meantime been able to upload a 1.4M corpus ;0) And my colleagues are now exploring it all! Thank you very...

Hi, I would not take things as far as suggested in the last update here. Far easier to restrict things to a single dir. Also, if one happens to have...