Regarding acceleration in evaluating FPS
It appears that acceleration is used when evaluating FPS, which assumes fixed camera extrinsics and intrinsics for each sample. However, in the NuScenes dataset, camera parameters change even between frames. I suspect the appropriateness of making this assumption when measuring FPS, especially since other works don't seem to make this assumption.
In fact, we make this assumption following bevdet. Moreover, we believe that in practice, the extrinsics and intrinsics are fixed, so this assumption is reasonable, despite the performance degradation for nuscenes.
In fact, we make this assumption following bevdet. Moreover, we believe that in practice, the extrinsics and intrinsics are fixed, so this assumption is reasonable, despite the performance degradation for nuscenes.
Thanks for your response. Do you know why the camera parameters change between frames in nuscenes. Are they calibrated frame by frame?
In fact, we make this assumption following bevdet. Moreover, we believe that in practice, the extrinsics and intrinsics are fixed, so this assumption is reasonable, despite the performance degradation for nuscenes.
Thanks for your response. Do you know why the camera parameters change between frames in nuscenes. Are they calibrated frame by frame?
I don't think they calibrated it frame by frame but clip by clip. The dataset was recorded collectively from both Singapore and Boston, so the extrinsic may vary as they shipped the vehicle or they changed vehicle. It might be a more systematic way to attach intrins&extrins to each frame instead of adding extra tag to tell us when they recalibrated the cameras.
In fact, we make this assumption following bevdet. Moreover, we believe that in practice, the extrinsics and intrinsics are fixed, so this assumption is reasonable, despite the performance degradation for nuscenes.
Thanks for your response. Do you know why the camera parameters change between frames in nuscenes. Are they calibrated frame by frame?
I don't think they calibrated it frame by frame but clip by clip. The dataset was recorded collectively from both Singapore and Boston, so the extrinsic may vary as they shipped the vehicle or they changed vehicle. It might be a more systematic way to attach intrins&extrins to each frame instead of adding extra tag to tell us when they recalibrated the cameras.
Apologies for any confusion, and thank you for your patience. I have carefully checked, and it turns out that the camera parameters in the NuScenes dataset are indeed consistent across all scenes.