Medusa issues

Using Medusa with Whisper

5

Hi All, Thank you for your awesome work! Is it possible to integrate Medusa into [Whisper's Decoder](https://huggingface.co/openai/whisper-large-v2) enhancing the decoding speed? Do you have any plans supporting Whisper? Thanks in...

AvivSham

Containerization with Dockerfile to setup medusa

@ctlllll Please provide Dockerfile for Medusa . Lots of error resolving while doing the setup. It would be good to have containerized environment which supports training and inference both. Its...

gangooteli

Conversation roles must alternate user/assistant/user/assistant/

I checked. They are alternate File "/root/Medusa/medusa/train/train_legacy.py", line 183, in preprocess prompt = tokenizer.apply_chat_template(conversation, tokenize=False) File "/root/miniconda3/envs/fschat/lib/python3.9/site-packages/transformers/tokenization_utils_base.py", line 1743, in apply_chat_template rendered = compiled_template.render( File "/root/miniconda3/envs/fschat/lib/python3.9/site-packages/jinja2/environment.py", line 1304, in render...

gangooteli

How to use the finetuned mistal model for inference with Medusa

7

How to use the finetuned mistal model for inference with Medusa

pradeepdev-1995

Medusa Training Loss

9

When utilizing Axolotl, the training loss reduces to 0 following the gradient accumulation steps. Is this expected behaviour? With Torchrun, the training loss consistently remains NaN. Thanks for the help!!...

TomYang-TZ

[bug] fix preprocess function

There is a slight bug in the preprocess function, causing no difference between the content of targets and input_ids. Relevant modifications are proposed here： https://github.com/FasterDecoding/Medusa/pull/83#discussion_r1582080343

xiezipeng-ML

Token-wise the same generalization?

Is Medusa1 model generalize token-wise the same as the base model w.o. medusa head? I found change medusa choices will change the output.

Ageliss

ImportError: cannot import name 'is_flash_attn_available' from 'transformers.utils'

3

I got a error when I refer to https://github.com/FasterDecoding/Medusa to prepare to run the Demo . 1. The basic environment was successfully installed without any errors. ``` git clone https://github.com/FasterDecoding/Medusa.git...

imneov

Is there no way to inference without training?

3

Hi there, Thank you for the great work! I have some problem. In the Google colab environment ``` !git clone https://github.com/FasterDecoding/Medusa.git %cd Medusa !pip install -e . !python -m medusa.inference.cli...

MoOo2mini

Is there a bug in gen_model_answer_baseline.py?

1

I just tested gen_model_answer_baseline.py and gen_model_answer_medusa.py. Medusa can be generated normally, but there are some problems with gen_model_answer_baseline.py. Can you run this py file normally?

qspang

Medusa
Medusa copied to clipboard

Metadata

Using Medusa with Whisper

Containerization with Dockerfile to setup medusa

Conversation roles must alternate user/assistant/user/assistant/

How to use the finetuned mistal model for inference with Medusa

Medusa Training Loss

[bug] fix preprocess function

Token-wise the same generalization?

ImportError: cannot import name 'is_flash_attn_available' from 'transformers.utils'

Is there no way to inference without training?

Is there a bug in gen_model_answer_baseline.py?

← Metadata

Owner

Metadata

Medusa Medusa copied to clipboard

Metadata

← Metadata

Owner

Metadata

Medusa
Medusa copied to clipboard