ccdv-ai issues

Results 9 issues of


                                            ccdv-ai

Fail to load a tokenizer (CroissantLLM)

Trying to run the [colab](https://colab.research.google.com/drive/19lwcRk_ZQ_ZtX-qzFP3qZBBHZNcMD1hh?usp=sharing#scrollTo=2eSvM9zX_2d3) using a small model: ```python from unsloth import FastLanguageModel import torch max_seq_length = 2048 # Gemma sadly only supports max 8192 for now dtype =...

Bad generation with GGUF and OpenAI api

Hi I tried to generate some text using a [mixtral instruct GGUF model](https://huggingface.co/TheBloke/Mixtral-8x7B-Instruct-v0.1-GGUF) but the model only predicts nonsense. Something is either wrong with the tokenizer or the chat template....

[Bug]: "Prompt logprob is not supported by multi step workers" for ngram speculative decoding

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4...

bug

Edge case for local dataset loading if its a folder

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/axolotl-ai-cloud/axolotl/discussions/categories/ideas) didn't find any similar feature requests. - [X] I searched...

enhancement

Pass `trust_remote_code` to `load_dataset(...)` for `datasets>=2.20.0`

enhancement

help wanted

good first issue

Support loading a local hf dataset with `load_dataset`

### ⚠️ Please check that this feature request hasn't been suggested before. - [X] I searched previous [Ideas in Discussions](https://github.com/OpenAccess-AI-Collective/axolotl/discussions/categories/ideas) didn't find any similar feature requests. - [X] I searched...

enhancement

ccdv-ai

Fail to load a tokenizer (CroissantLLM)

Bad generation with GGUF and OpenAI api

[Bug]: "Prompt logprob is not supported by multi step workers" for ngram speculative decoding

Edge case for local dataset loading if its a folder

Pass `trust_remote_code` to `load_dataset(...)` for `datasets>=2.20.0`

Support loading a local hf dataset with `load_dataset`

Can't use `chat_template: phi_3` with `type: sharegpt`

Liger kernel support

Difference between encoder_only and decoder_only?