ReadingBank icon indicating copy to clipboard operation
ReadingBank copied to clipboard

ReadingBank: A Benchmark Dataset for Reading Order Detection

Results 8 ReadingBank issues
Sort by recently updated
recently updated
newest added

Thank you for sharing such a great paper. I am PHD student and doing research in document understanding, so your paper was extremely interesting and useful for me. Can you...

Hello, The link to the preprocessed data seems to be down : https://layoutlm.blob.core.windows.net/readingbank/dataset/ReadingBank.zip?sv=2022-11-02&ss=b&srt=o&sp=r&se=2033-06-08T16:48:15Z&st=2023-06-08T08:48:15Z&spr=https&sig=a9VXrihTzbWyVfaIDlIT1Z0FoR1073VB0RLQUMuudD4%3D I get the following error : This XML file does not appear to have any style information...

Hello there, thank you for your fantastic work. Is it possible to release a filtered version of the dataset, without any tables annotated? Background: The reading order of text can...

Do you have corresponding images for the data

I am assuming the version at https://huggingface.co/datasets/zilongwang/ReadingBank is accurate to your original. Could you release a version of this dataset which indicates what text is in what bounding box? Maybe...

Could you provide the script you used to generate the data for a document, given a MS Word document? Thanks.