BitNet icon indicating copy to clipboard operation
BitNet copied to clipboard

Official inference framework for 1-bit LLMs

Results 227 BitNet issues
Sort by recently updated
recently updated
newest added

I see the inference code - Can you please share the training code?

error information: ggml/src/ggml.c:21302: GGML_ASSERT(0 type && info->type < GGML_TYPE_COUNT) failed Error occurred while running command: Command '['3rdparty/llama.cpp/build/bin/llama-cli', '-m', 'models/BitNet-b1.58-2B-4T/ggml-model-i2_s.gguf', '-n', '128', '-t', '2', '-p', 'You are a helpful assistant', '-ngl',...

As other open issues (#158 and #180 ) mention, the compilation process stops after `ggml-bitnet-lut.cpp`, as `logs/compile.log` shows. I'm trying to build the BitNet project on macOS 15.4.1, and tried...

Hello The readMe advises to Launch setup_env python script, that will recompile llama-cpp-cli exécutable. I want to easily integrate the optimized new llama-cpp in a gradio or streamlit python app:...

I used Intel's AI Boost, and it's too slow...,NPU uasge 37% ```shell cd .\transformers-bitnet pip install . pip install torch accelerate intel-npu-acceleration-library gradio ``` ```python import torch import torch._dynamo torch._dynamo.config.suppress_errors...

I would like to run this model consistently in memory without loading it each time and I can't run it as a command line. It is useful. Is there a...

This pull request introduces a new script, run_inference_server.py, which leverages llama.cpp's built-in server for more convenient and efficient inference. The script is designed to start the server with various configurable...

When I ran the command "python setup_env.py -md models/BitNet-b1.58-2B-4T -q i2_s" it got stuck after displaying the message "INFO:root:Compiling the code using CMake." info: Apple M4 Pro cmake version 4.0.1...

This feels like a waste of a good full request. Microsoft configured their newly released bitnet model as using the `BitNetForCausalLM` architecture ([see](https://huggingface.co/microsoft/bitnet-b1.58-2B-4T-bf16/blob/main/config.json#L3)) while the conversion script in `utils` and...