Jasmine
Results
2
issues of
Jasmine
Thanks for sharing your code. I use your code to train on UCF101 with the suggested hyper-parameters (i.e., lr=0.001, trans_linear_out_dim=1152, img_size=224, tasks_per_batch =16, num_test_tasks=10000) and the same data loader. However,...
The training of the first layer of HCTransformer requires a lot of computational resources overhead. Limited by computing resources, I can only reproduce your results on a single Nvidia 3090...