yirunwang

Results 1 issues of yirunwang

Single GPU is OK, System hangs when I use multiple GPUs. Can someone help solve this? Thanks. python build.py --model_dir meta-llama/Llama-2-7b-chat-hf \ --dtype float16 \ --remove_input_padding \ --use_gpt_attention_plugin float16 \...

triaged