BitNet issues

Repeated tokens generated from 'generate.py' running on GPU

2

Dear Authors, Thanks for introducing the amazing project. When I tested the BitNet Inference Kernel on RTX 3090 with Ubuntu system, I followed the commands in README.md, but I got...

shengzhelyu65

fix: Improve grammar and clarify phrasing in README.md

### Description This pull request addresses a grammatical error and improves the clarity of a sentence in the `README.md` file. ### Changes Made - In the "Benchmark" section, the sentence...

snowykr

migrate to pyproject.toml from requirement.txt

2

why not using light wight all in one python tools like UV? conda are too bloated.

mashanz

I can't download the model

2

When I try to download the model to compile it, it stays in this state: (miniconda-bitnet-p3_9) bitnet@BitNet:~/BitNet$ hf download microsoft/BitNet-b1.58-2B-4T-gguf --local-dir models/BitNet-b1.58-2B-4T Fetching 3 files: 0%| | 0/3 [00:00

xXRagn0kXx

Patch for TL1 kernel generation setup on Arm64 (Mac M1 Pro).

2

Hey all! I am attempting to run and benchmark TL1 inference on my Mac M1 Pro device. After following the guide for the build process successfully, I ran `python setup_env.py...

patches-ml

Is it possible to use Falcon3 or LLaMA,which are already supported by the CPU,in GPU mode?

1

GoNow1231

kind of weird responses GGGGGGGGGGGG....

6

== Running in interactive mode. == - Press Ctrl+C to interject at any time. - Press Return to return control to the AI. - To return control without starting a...

auto-prog

LLVM bug triggered by BitNet's code patterns

2

On Apple M2 silicon it seems that the BitNet code is triggering a LLVM optimization bug. Others have reported this as the build running for hours: issue #251, #260, etc....

zymurgy

Questions regarding BitNet Integration in llama.cpp Inference Pipeline

2

I noticed that we’ve defined several custom functions for BitNet inference. However, I’m curious—how does llama.cpp know when and where to call these functions? Specifically, how are these BitNet-related functions...

nikhilrayaprolu

Practising PR

3

Before submitting this pull request,cheak the changes

Masum-2107040

BitNet
BitNet copied to clipboard

Metadata

Repeated tokens generated from 'generate.py' running on GPU

fix: Improve grammar and clarify phrasing in README.md

migrate to pyproject.toml from requirement.txt

I can't download the model

Patch for TL1 kernel generation setup on Arm64 (Mac M1 Pro).

Is it possible to use Falcon3 or LLaMA,which are already supported by the CPU,in GPU mode?

kind of weird responses GGGGGGGGGGGG....

LLVM bug triggered by BitNet's code patterns

Questions regarding BitNet Integration in llama.cpp Inference Pipeline

Practising PR

← Metadata

Owner

Metadata

BitNet BitNet copied to clipboard

Metadata

← Metadata

Owner

Metadata

BitNet
BitNet copied to clipboard