BitNet icon indicating copy to clipboard operation
BitNet copied to clipboard

Official inference framework for 1-bit LLMs

Results 227 BitNet issues
Sort by recently updated
recently updated
newest added

This modification is a solution for wrong size memory allocation.(#143)

This modification is a solution for some data type inconsistencies.(#161)

1. In the ggml_compute_forward_mul_mat() function in ggml.c, ggml_qgemm_lut() is executed first, which I think is an accumulation operation. 1. Subsequently, in the ggml_compute_forward_mul_mat_one_chunk() function, ggml_vec_dot_i2_i8_s() is executed, which performs a...

A new SOTA bitnet model, Bonsai 0.5B, has come out. Seems to outperform larger bitnet models like Falcon 1B, 3B, TriLM 700M. Seems like they are going to release a...

I encountered this on Orange Pi 5 Plus (RK3588) with Ubuntu 22.04 LTS. The compilation environments (in a Conda env) are: - Python 3.9.21 - CMake 3.31.2 - Clang 18.1.8...

Thre are some declarations of build mode in the `setup_env.py` scritpt. Hence, I suggest adding the following code in it. # Additional argument ## Target: https://github.com/microsoft/BitNet/blob/caf17ec438808bcf9b3da012216c8624a29b8cdc/setup_env.py#L205 ```python parser.add_argument("--build-mode", type=str, choices=["Release",...

We built the sources of BitNet.cpp using clang. Now, we could not build the sources of it using clan version > 18. According to the README, we can install clang...

int16x8_t vec_c[4]; int8x16x2_t vec_v_left_0 = vzipq_s8(vec_v_0_left_tmp1, vec_v_0_left_tmp0); int8x16x2_t vec_v_right_0 = vzipq_s8(vec_v_0_right_tmp1, vec_v_0_right_tmp0); vec_c[0] += vec_v_left_0.val[0]; vec_c[0] += vec_v_right_0.val[0]; vec_c[1] += vec_v_left_0.val[1]; vec_c[1] += vec_v_right_0.val[1];

When I built the source codes of BitNet.cpp using gcc, some data type inconsistencies error occured. However, this error did not occur when I used clang. I investigated the root...

I installed the Debian system on my Android phone using Termux, and then followed the instructions step by step until I ran “python3 setup_env.py -md models/Falcon3-7B-Instruct-1.58bit -q i2_s” the following...