IVparagigm
IVparagigm
1)in Videodataset.py class Videodataset _make_dataset() ` frame_indices = list(range(segment[0], segment[1])) sample = { 'video': video_path, 'segment': segment, 'frame_indices': frame_indices, 'video_id': video_ids[i], 'label': label_id }` And I got the annotation like...
1)Yes, what I metioned is in this file. ` frame_indices = self.data[index]['frame_indices'] if self.temporal_transform is not None: frame_indices = self.temporal_transform(frame_indices) clip = self.__loading(path, frame_indices)` The `self.temporal_transform` make `frame_indices` into the...
> Due to the strides and max-pooling in I3D, it temporally downsamples the video by a factor of 8. So if you input 64 frames, you'll get 8 (temporal) output...