centerformer global_translate_noise in CenterForm is different from that in CenterPoint.

Interesting work! The translation of data aug in CenterForm is 0.5, https://github.com/TuSimple/centerformer/blob/master/configs/waymo/voxelnet/waymo_centerformer.py#L132, while the translation in CenterPoint is 0. Also, I noticed that you used the np.random.uniform rather than np.random.normal like rotation and scale parameters. Could you explain the motivation of these modification and performance influence about them？

Nov 21 '22 08:11 JingweiZhang12

In the newer version of Centerpoint, the global translation augmentation is also used: https://github.com/tianweiy/CenterPoint/blob/c4b4e0fc97ec6d42d6a31f559816d3305abdf750/configs/waymo/voxelnet/waymo_centerpoint_voxelnet_three_sweeps_20e_iou_pred.py#L116
I used np.random.uniform based on other published papers and haven't tried np.random.normal. I don't think this will cause a big difference, but it could be worthwhile to try.

Nov 22 '22 04:11 edwardzhou130

some sonfused... Do these data augmenting strategies ensure consistency between sequential frames? How exactly is the copy-paste strategy designed between sequential frames?

Jan 02 '23 04:01 Liaoqing-up

some sonfused... Do these data augmenting strategies ensure consistency between sequential frames? How exactly is the copy-paste strategy designed between sequential frames?

The pasted object will be added to all frames in the same way (same location and augmentation noises etc.). I just assume it is a static object in the scene. https://github.com/TuSimple/centerformer/blob/5a949b88ed7bb15aafb39bf78c95f1452063ebea/det3d/datasets/pipelines/preprocess_multiframe.py#L136-L141

Jan 02 '23 06:01 edwardzhou130

some sonfused... Do these data augmenting strategies ensure consistency between sequential frames? How exactly is the copy-paste strategy designed between sequential frames?

The pasted object will be added to all frames in the same way (same location and augmentation noises etc.). I just assume it is a static object in the scene.

https://github.com/TuSimple/centerformer/blob/5a949b88ed7bb15aafb39bf78c95f1452063ebea/det3d/datasets/pipelines/preprocess_multiframe.py#L136-L141

I see, but if the network has velocity prediction branch, the static object assumption may confused the network, or maybe you have already set the velocity of the paste objects in gt_target to 0? By the way, why not use the velocity of the obejct label to figure out where the obejct is in the history frame and paste on it? Is it worth a try?

Jan 02 '23 07:01 Liaoqing-up