Luning Wang

Results 2 issues of Luning Wang

Hi, I encountered the error described in the title of this issue, while trying to run the gpt-2 example. Here is my command: ``` export CUDA_VISIBLE_DEVICES=7 torchrun --nnodes=1 --nproc_per_node=1 --rdzv_id=1...

An error occurred when I tried to run Mistral-7b-instruct-v0.2 model: ``` ... File "~/anaconda/envs/streamingllm/lib/python3.9/site-packages/transformers/models/mistral/modeling_mistral.py", line 88, in _make_sliding_window_causal_mask mask = torch.triu(mask, diagonal=-sliding_window) TypeError: bad operand type for unary -: 'NoneType'...