global_translate_noise in CenterForm is different from that in CenterPoint.
Interesting work!
The translation of data aug in CenterForm is 0.5,
https://github.com/TuSimple/centerformer/blob/master/configs/waymo/voxelnet/waymo_centerformer.py#L132, while the translation in CenterPoint is 0. Also, I noticed that you used the np.random.uniform rather than np.random.normal like rotation and scale parameters. Could you explain the motivation of these modification and performance influence about them?
- In the newer version of Centerpoint, the global translation augmentation is also used: https://github.com/tianweiy/CenterPoint/blob/c4b4e0fc97ec6d42d6a31f559816d3305abdf750/configs/waymo/voxelnet/waymo_centerpoint_voxelnet_three_sweeps_20e_iou_pred.py#L116
- I used
np.random.uniformbased on other published papers and haven't triednp.random.normal. I don't think this will cause a big difference, but it could be worthwhile to try.
some sonfused... Do these data augmenting strategies ensure consistency between sequential frames? How exactly is the copy-paste strategy designed between sequential frames?
some sonfused... Do these data augmenting strategies ensure consistency between sequential frames? How exactly is the copy-paste strategy designed between sequential frames?
The pasted object will be added to all frames in the same way (same location and augmentation noises etc.). I just assume it is a static object in the scene. https://github.com/TuSimple/centerformer/blob/5a949b88ed7bb15aafb39bf78c95f1452063ebea/det3d/datasets/pipelines/preprocess_multiframe.py#L136-L141
some sonfused... Do these data augmenting strategies ensure consistency between sequential frames? How exactly is the copy-paste strategy designed between sequential frames?
The pasted object will be added to all frames in the same way (same location and augmentation noises etc.). I just assume it is a static object in the scene.
https://github.com/TuSimple/centerformer/blob/5a949b88ed7bb15aafb39bf78c95f1452063ebea/det3d/datasets/pipelines/preprocess_multiframe.py#L136-L141
I see, but if the network has velocity prediction branch, the static object assumption may confused the network, or maybe you have already set the velocity of the paste objects in gt_target to 0? By the way, why not use the velocity of the obejct label to figure out where the obejct is in the history frame and paste on it? Is it worth a try?