8 * v100 parallel training
I can train normally on 2v100, but the following error occurred on 8v100. Have you encountered it? Thanks!
Sorry, but this seems not to be a problem of our code but a problem of multiprocessing in python. Could you please provide details about your running environment, the commands you're using, and any shell scripts involved?
Sorry, but this seems not to be a problem of our code but a problem of multiprocessing in python. Could you please provide details about your running environment, the commands you're using, and any shell scripts involved?
cuda=12.0,python=3.8, I ran the coco fully label experimental setup on 8 GPUs.
Sorry, but this seems not to be a problem of our code but a problem of multiprocessing in python. Could you please provide details about your running environment, the commands you're using, and any shell scripts involved?