dongduck comments

Results 12 comments of


                                            dongduck

Questions about two stage training in PETR

In PETR, they used randomly initialized queries without query positional encoding from reference points. So, section 3.3 seems correct with the code of this repo.

Questions about two stage training in PETR

while two-stage deformable DETR embedded their queries with their initial bboxes.

Questions about two stage training in PETR

_Hence, I think there is still some difference between section 3.3, where the locations of initial reference points are randomly initialized and learned._ -> sorry, I checked it. `The initial...

Questions about two stage training in PETR

I check the code and I think that the two-stage mode (default setting in the code) denotes that the initial reference points for the decoder are initialized from the top...

I have a question about sampling offset in the pose decoder.

yes, the number of sampling offsets for one reference point (i.e. one keypoint) is set as **1 for each head and feature level** So there are 32 sampling offsets for...

I have a question about sampling offset in the pose decoder.

![5](https://user-images.githubusercontent.com/67183356/183004248-6465d534-3c98-4fd5-9098-104c69e2d120.jpg) It could be visualized like this. green is the reference point and other points are sampling points. (low attentions are dismissed)

training, testing and demo testing of the petr multi-person pose model

**Evaluation** bash tools/dist_test.sh $CONFIG $CHECKPOINT $NUM_GPU --eval keypoints ex) CUDA_VISIBLE_DEVICES=1,2 bash tools/dist_test.sh configs/petr/petr_r50_16x2_100e_coco.py checkpoint/petr_r50_16x2_100e_coco.pth 2 --eval keypoints **Training** bash tools/dist_train.sh $CONFIG $NUM_GPU ex) CUDA_VISIBLE_DEVICES=1,2 bash tools/dist_train.sh configs/petr/petr_r50_16x2_100e_coco.py 2 **Inference** ex)...

training, testing and demo testing of the petr multi-person pose model

Did you download coco datasets and annotation files? you should download annotation files and images from [https://cocodataset.org/#download]. and fix data root on 'data_root = '/dataset/public/coco/', which is in [configs/_base_/datasets/coco_keypoint.py].

training, testing and demo testing of the petr multi-person pose model

I couldn't find the video inference code in this repository.. I recommend converting mp4 videos into png frames for inference... or fix codes for video. I am sorry that I...

Training

Convert your dataset to webdataset format. Then, specify the tar file location in cfg file.