Error in outputs, batch_y = accelerator.gather_for_metrics((outputs, batch_y))
Hi, has anyone encountered this error? (I tried to increase batch_size to 64, 128) Training runs, but when it comes to vali_loss, vali_mae_loss = vali(args, accelerator, model, vali_data, vali_loader, criterion, mae_metric) in this row outputs, batch_y = accelerator.gather_for_metrics((outputs, batch_y)) I get the following error:
File ".local/lib/python3.11/site-packages/accelerate/accelerator.py", line 2242, in gather_for_metrics
data = self.gather(input_data)
^^^^^^^^^^^^^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/accelerator.py", line 2205, in gather
return gather(tensor)
^^^^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/utils/operations.py", line 378, in wrapper
return function(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/utils/operations.py", line 439, in gather
return _gpu_gather(tensor)
^^^^^^^^^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/utils/operations.py", line 358, in _gpu_gather
return recursively_apply(_gpu_gather_one, tensor, error_on_other_type=True)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/utils/operations.py", line 107, in recursively_apply
return honor_type(
^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/utils/operations.py", line 81, in honor_type
return type(obj)(generator)
^^^^^^^^^^^^^^^^^^^^
File ".local/lib/python3.11/site-packages/accelerate/utils/operations.py", line 110, in