BitNet-Transformers issues

How long does inference on CPU cost?

Training may be on CPU, but deployment has to be on CPU for high scalability.

Does this code work for Original Transformer?

I just want to reproduce the paper results. Since the paper use only bitnet-transformer, so I wonder if I can replace FC with this BitLienar for Transformer.

ValkyriaLenneth

Low accuracy issue

Hello. First of all, thank you for sharing the code. I have one question about your work. I am wondering if you checked the accuracy after training was completed. When...

ttl10101

Question about BitLinear Implementation

Hi, I have a doubt in your BitLinear.forward() implementation. The BitNet paper says the output should be the form as ; y = binarized_weight(W) @ AbsMaxQuant(LN(x)) * betta*gamma/Q_b (LN is...

thwannbe

weird behivior/implementation error?

i took the code for BitLinearOptimized and added a small thing so I can run it standalone ```python super(BitLinearOptimized, self).__init__(in_features, out_features, bias,dtype=torch.bfloat16) #just added the right dtype ``` runing the...

nevakrien

Update README.md

Huggingface -> Hugging Face

eltociear

Great work. quick question on roadmap ETA?

4

Just wondering when you were planning on implementing BitLinear layer to use 1-bit weights and custom cuda kernel for 1-bit weight? super thirsty for the code ha. Appreciate you.

darkman111a

BitNet_Llama_model_test_huggingface_GPU.ipynb

3

I was testing in Colab and when I ran "model.model.layers[0].mlp.gate_proj.weight". I recieved very different results from yours. You got: Parameter containing: tensor([[ 0.0032, -0.0339, 0.0150, ..., 0.0041, -0.0048, 0.0061], [-0.0105,...

DewEfresh

BitNet-Transformers
BitNet-Transformers copied to clipboard

Metadata

How long does inference on CPU cost?

Does this code work for Original Transformer?

1

Low accuracy issue

Question about BitLinear Implementation

weird behivior/implementation error?

Update README.md

Great work. quick question on roadmap ETA?

BitNet_Llama_model_test_huggingface_GPU.ipynb

← Metadata

Owner

Metadata

BitNet-Transformers BitNet-Transformers copied to clipboard

Metadata

← Metadata

Owner

Metadata

BitNet-Transformers
BitNet-Transformers copied to clipboard