AutoTransition icon indicating copy to clipboard operation
AutoTransition copied to clipboard

LOADER配置读取问题

Open qizailiu opened this issue 3 years ago • 4 comments

Traceback (most recent call last): File "tools/run_net.py", line 60, in main() File "tools/run_net.py", line 28, in main mp.spawn(run_worker, args=(cfg, train), nprocs=cfg.SYS.NUM_GPU) File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 230, in spawn return start_processes(fn, args, nprocs, join, daemon, start_method='spawn') File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 188, in start_processes while not context.join(): File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 150, in join raise ProcessRaisedException(msg, error_index, failed_process.pid) torch.multiprocessing.spawn.ProcessRaisedException:

-- Process 0 terminated with the following error: Traceback (most recent call last): File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap fn(i, *args) File "VideoTransitionEffects/AutoTransition-main/tools/run_net.py", line 55, in run_worker func(cfg) File "VideoTransitionEffects/AutoTransition-main/tools/train_net.py", line 111, in train dataloader: dict = build_loader(cfg=cfg, mode=("train", "val")) File "VideoTransitionEffects/AutoTransition-main/data/build.py", line 25, in build_loader assert not cfg.LOADER.USE_DALI or cfg.SYS.NUM_GPU > 0, "DALI dataloader need GPU to work properly" File "/root/anaconda3/lib/python3.7/site-packages/yacs/config.py", line 141, in getattr raise AttributeError(name) AttributeError: USE_DALI

cfg.LOADER.USE_DALI中看到没有相关的配置,请问一下这个逻辑是否必要的。

qizailiu avatar Aug 28 '22 05:08 qizailiu

我把这行注释掉了,就可以正常运行了。

cgnerds avatar Aug 29 '22 09:08 cgnerds

Traceback (most recent call last): File "tools/run_net.py", line 60, in main() File "tools/run_net.py", line 28, in main mp.spawn(run_worker, args=(cfg, train), nprocs=cfg.SYS.NUM_GPU) File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 230, in spawn return start_processes(fn, args, nprocs, join, daemon, start_method='spawn') File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 188, in start_processes while not context.join(): File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 150, in join raise ProcessRaisedException(msg, error_index, failed_process.pid) torch.multiprocessing.spawn.ProcessRaisedException:

-- Process 0 terminated with the following error: Traceback (most recent call last): File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 59, in _wrap fn(i, *args) File "VideoTransitionEffects/AutoTransition-main/tools/run_net.py", line 55, in run_worker func(cfg) File "VideoTransitionEffects/AutoTransition-main/tools/train_net.py", line 111, in train dataloader: dict = build_loader(cfg=cfg, mode=("train", "val")) File "VideoTransitionEffects/AutoTransition-main/data/build.py", line 25, in build_loader assert not cfg.LOADER.USE_DALI or cfg.SYS.NUM_GPU > 0, "DALI dataloader need GPU to work properly" File "/root/anaconda3/lib/python3.7/site-packages/yacs/config.py", line 141, in getattr raise AttributeError(name) AttributeError: USE_DALI

cfg.LOADER.USE_DALI中看到没有相关的配置,请问一下这个逻辑是否必要的。

Sorry for this mistake, we have imported DALI for performance test before. We will remove this in the next update. Thanks for your feedback!

很抱歉,我们在之前做性能测试的时候使用了DALI。这个判断是没必要的,之后更新会删除掉。感谢你的反馈!

[Fixed]

Yaojie-Shen avatar Aug 29 '22 11:08 Yaojie-Shen

感谢大佬,但是我在我的环境运营依然报错。我感觉是版本的问题,请问方便更新一下requirement中依赖包的版本嘛。 我是在默认单机四卡的运营环境。目前还没有定位出来问题。

qizailiu avatar Aug 31 '22 09:08 qizailiu

我把这行注释掉了,就可以正常运行了。

按照文档,有段错误。你那边遇到过吗请问

Traceback (most recent call last): File "tools/run_net.py", line 65, in main() File "tools/run_net.py", line 32, in main mp.spawn(run_worker, args=(cfg, train), nprocs=cfg.SYS.NUM_GPU) File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 230, in spawn return start_processes(fn, args, nprocs, join, daemon, start_method='spawn') File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 188, in start_processes while not context.join(): File "/root/anaconda3/lib/python3.7/site-packages/torch/multiprocessing/spawn.py", line 136, in join signal_name=name torch.multiprocessing.spawn.ProcessExitedException: process 0 terminated with signal SIGSEGV

一直没有找到解决方法

qizailiu avatar Sep 01 '22 08:09 qizailiu