CascadeTabNet icon indicating copy to clipboard operation
CascadeTabNet copied to clipboard

Questions about the prediction of the model

Open StanleyGan opened this issue 5 years ago • 3 comments

The prediction of the model is a list of 80 arrays. Which one represents the cell bounding boxes and which represents the table bounding boxes? I am interested in extracting the vertices for bounding box of table.

StanleyGan avatar Jun 08 '20 22:06 StanleyGan

You can refer main.py. I have extracted the model prediction in that file.

kshitijkapadni avatar Jun 11 '20 06:06 kshitijkapadni

load the weight(checkpoint_file:epoch_36.pth, config_file=cascade_mask_rcnn_hrnetv2p_w32_20e.py) has error about that: size mismatch for rpn_head.rpn_cls.weight: copying a param with shape torch.Size([3, 256, 1, 1]) from checkpoint, the shape in current model is torch.Size([9, 256, 1, 1]). size mismatch for rpn_head.rpn_cls.bias: copying a param with shape torch.Size([3]) from checkpoint, the shape in current model is torch.Size([9]). Are they know why? Thanks!

SdwHorizon avatar Jun 17 '20 07:06 SdwHorizon

@StanleyGan I think the array with 5 numbers is the boundary box of cell or table, and the first 4 numbers represent the coordinate of box. You can either plot them on your test image or calculate the location manually.

stemgene avatar Jun 25 '20 15:06 stemgene