docformer
docformer copied to clipboard
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
Can the target size be changed. Currently when I try to change it throws `mat1 and mat2 shapes cannot be multiplied (3072x768 and 192x128)` . I tried with target size...
I am training docformer, 6 epochs completed. The predictions are not good. Can you please tell what is minimum number of epochs required to get better prediction on document classification?...
Hi, I have been meaning to convert the final o/p of docformer in the document classification notebook from .ckpt to onnx but am unable to do so. I have tried...
Thanks for your impressive work. I have one question. will you share the code of other pre-training tasks? Thanks for your reply. @uakarsh @shabie
## TODO I specially like this answer with `tesserocr` (faster than `pytesseract`): https://stackoverflow.com/a/69131832/7996306
Hello, Is it possible to train DocFormer on key-value (or Question/Answer) extraction task? If so, could you please explain the approach? Thanks!
Ran into this error while running the example notebook. ``` --------------------------------------------------------------------------- Exception Traceback (most recent call last) /tmp/ipykernel_33283/863471981.py in () 1 ## Using a single batch for the forward propagation...
Thanks for the great work! Do you have any plan to release the pre-trained model of docformer?
whether I can fine-tune the model of Docformer? can you give some instruction please. thanks