dbrx
dbrx copied to clipboard
Code examples and resources for DBRX, a large language model developed by Databricks
The tech report described the methodology of the inference efficiency measurement but not in detail. It compared the Llama2-70B and DBRX. We have great interests in the comparison. So we...
``` File "/home/paas/vllm/vllm/engine/llm_engine.py", line 222, in _init_tokenizer self.tokenizer: BaseTokenizerGroup = get_tokenizer_group( File "/home/paas/vllm/vllm/transformers_utils/tokenizer_group/__init__.py", line 20, in get_tokenizer_group return TokenizerGroup(**init_kwargs) File "/home/paas/vllm/vllm/transformers_utils/tokenizer_group/tokenizer_group.py", line 23, in __init__ self.tokenizer = get_tokenizer(self.tokenizer_id, **tokenizer_config) File...
Thanks for your great efforts first. I read the PR you opened in the TensorRT-LLM repo and noticed that EP +TP, PP + TP, and TP are supported during inference....
I can load the instruct model using the transformers loader and 8bit bits and bytes, I can get it to load evenly among multiple gpus. However, I cannot seem to...
HuggingFace -> Hugging Face
Corrected Text formatting
The most models' pretrainModel class attribute _supports_sdpa are True, why DBRX set False?
* Copy .env_example to .env and insert your huggingface token * docker compose up * wait
When evaluating humaneval, does dbrx use some specical prompts to improve performance?
just a space between words