ClassyVision icon indicating copy to clipboard operation
ClassyVision copied to clipboard

Implement resize and train XRayVideo A/V with only resizing

Open arjish opened this issue 3 years ago • 3 comments

Summary: We want to check whether training XRayVideo with simply video resizing (in addition to other existing transformation like horizontal flipping and normalization) without random corp is sufficient.

The resize dimension is used as 224*224.

workflow: f362077622 (Note: in the workflow fcc_mvit_dataset_v4p2_arkc.yaml is used which I renamed to fcc_mvit_dataset_v4p2_onlyresize.yaml in this diff.)

As can be seen, the validation MAP goes to around .422 as opposed to 0.46 when random resized crop is used (f355567669) and rest of the configuration is kept the same. Hence, it is better to keep random resized crop.

Differential Revision: D38522980

arjish avatar Aug 09 '22 18:08 arjish

This pull request was exported from Phabricator. Differential Revision: D38522980

facebook-github-bot avatar Aug 09 '22 18:08 facebook-github-bot

This pull request was exported from Phabricator. Differential Revision: D38522980

facebook-github-bot avatar Aug 09 '22 18:08 facebook-github-bot

This pull request was exported from Phabricator. Differential Revision: D38522980

facebook-github-bot avatar Aug 12 '22 14:08 facebook-github-bot