BitNet
BitNet copied to clipboard
error loading model: PrefetchVirtualMemory unavailable
when run the command "python run_inference.py -m models/ggml-model-i2_s.gguf " an error occurs. it echo "llama_model_load: error loading model: PrefetchVirtualMemory unavailable", who know that why?
can you tell us your environment and the model you are using?