[KeyError: 542]running SRe2L/validate/train_FKD.py with given config after mod pytorch code
cd /home/zhanglf/lzw/code/SRe2L_ ; /usr/bin/env /home/zhanglf/anaconda3/envs/iid/bin/python /home/zhanglf/.vscode-server/extensions/ms-python.debugpy-2024.8.0-linux-x64/bundled/libs/debugpy/adapter/../../debugpy/launcher 35311 -- /home/zhanglf/lzw/code/SRe2L_/SRe2L/validate/train_FKD.py --batch-size 1000 --gradient-accumulation-steps 2 --model resnet18 --cos -j 4 -T 20 --mix-type cutmix --output-dir ./save/val_rn18_fkd/rn18_\[4K\]_T20/ --train-dir /home/zhanglf/lzw/code/SRe2L_/syn_data/rn18_bn0.01_\[4K\]_x_l2_x_tv.crop --val-dir /data/ImageNet/val --fkd-path /home/zhanglf/lzw/code/SRe2L_/FKD_cutmix_fp16 wandb: Currently logged in as: lsy. Use wandb login --reloginto force relogin wandb: Tracking run with wandb version 0.16.6 wandb: Run data is saved locally in /home/zhanglf/lzw/code/SRe2L_/wandb/run-20240903_080243-ltj9xxlo wandb: Runwandb offline` to turn off syncing.
wandb: Syncing run generous-monkey-5
wandb: ⭐️ View project at https://wandb.ai/lsy/Temperature
wandb: 🚀 View run at https://wandb.ai/lsy/Temperature/runs/ltj9xxlo
======= FKD: dataset info ======
path: /home/zhanglf/lzw/code/SRe2L_/FKD_cutmix_fp16
num img: 1476
batch size: 1000
max epoch: 300
load data successfully
=> loading student model 'resnet18'
/home/zhanglf/anaconda3/envs/iid/lib/python3.9/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and will be removed in 0.15, please use 'weights' instead.
warnings.warn(
/home/zhanglf/anaconda3/envs/iid/lib/python3.9/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or None for 'weights' are deprecated since 0.13 and will be removed in 0.15. The current behavior is equivalent to passing weights=None.
warnings.warn(msg)
Epoch: 0
Traceback (most recent call last):
File "/home/zhanglf/anaconda3/envs/iid/lib/python3.9/runpy.py", line 197, in _run_module_as_main
return _run_code(code, main_globals, None,
File "/home/zhanglf/anaconda3/envs/iid/lib/python3.9/runpy.py", line 87, in _run_code
exec(code, run_globals)
File "/home/zhanglf/.vscode-server/extensions/ms-python.debugpy-2024.8.0-linux-x64/bundled/libs/debugpy/adapter/../../debugpy/launcher/../../debugpy/main.py", line 39, in
Could you print the value of self.epoch and img_idx to provide more information for debugging?
self.epoch您能否打印和的值img_idx以提供更多调试信息?
hi, thanks for replying
Epoch: 0 self.epoch: 0 img_idx: 542
hi,Epoch: 0 self.epoch: 0 img_idx: 542
I can't reproduce your error case. Did you follow the instructions at https://github.com/VILA-Lab/SRe2L/tree/main/SRe2L/validate to conduct the experiments? Or did you modify any settings, like batch-size? Please disclose more details which will help me to figure out why the error happened.
我无法重现您的错误情况。您是否按照https://github.com/VILA-Lab/SRe2L/tree/main/SRe2L/validate上的说明进行实验?或者您是否修改了任何设置,例如
batch-size?请透露更多详细信息,这将有助于我找出错误发生的原因。
ok, I will provide more info, thanks for reply