Medusa issues

The implementation of stage 2 with axolotl

Thanks for the wonderful work. I am trying to improve the performance with medusa2. But when I start the training of stage 2 based on the model from stage 1,...

boxiaowave

PPL compute

Is there a plan to write script to calculate the PPL (perplexity) of the Medusa model?

yuyangxie96

Instruct data format

Currently training script's data loaders only supports chat based data, not instruct, I've made changes to my local to have this done properly can't seem to be able to open...

orhan6116

Are Medusa Heads computed in parallel or serially?

Hello authors, While reading your code, I noticed that the multiple Medusa Heads you proposed are computing results in parallel ``` for i in range(self.medusa): medusa_logits.append(self.medusa_head[i](hidden_states)) ``` (although the later...

userljz

jinja2.exceptions.UndefinedError: dict object has no element 0

2

I followed the training steps to train the llama2 model, but encountered the following error. I searched a lot, but still couldn't solve it. ``` UndefinedError File "/home/hs/anaconda3/envs/onebit/lib/python3.10/site-packages/torch/utils/data/dataloader.py", line 678,...

LLLL114

updated medusa models in huggingface?

Hi, based on https://github.com/FasterDecoding/Medusa/blob/main/notebooks/medusa_introduction.ipynb, "FasterDecoding/medusa-vicuna-7b-v1.3" should have 4 medusa_num_heads. However, in huggingface, it only has 2, https://huggingface.co/FasterDecoding/medusa-vicuna-7b-v1.3/blob/main/config.json. Do you have any plan to share the trained medusa-heads in huggingface?

hustxiayang

[ISSUE] The Pull Request at https://github.com/FasterDecoding/Medusa/pull/97 from Narsil/medusa2 needs to be rolled back.

Hello. After fine-tuning the Medusa head, I discovered an issue affecting inference performance and would like to share my findings. Normally, when a model is trained correctly, using TGI to...

super-ahn

train_legacy.py: try to fix indices bug in preprocess.

This seems a bug, also [reported by @xiezipeng-ML](https://github.com/FasterDecoding/Medusa/issues/101). Please review.

k-l-lambda

Medusa
Medusa copied to clipboard

Metadata

[New feature] exllama support

The implementation of stage 2 with axolotl

PPL compute

Fix TGI's medusa link

Instruct data format

Are Medusa Heads computed in parallel or serially?

jinja2.exceptions.UndefinedError: dict object has no element 0

updated medusa models in huggingface?

[ISSUE] The Pull Request at https://github.com/FasterDecoding/Medusa/pull/97 from Narsil/medusa2 needs to be rolled back.

train_legacy.py: try to fix indices bug in preprocess.

← Metadata

Owner

Metadata

Medusa Medusa copied to clipboard

Metadata

← Metadata

Owner

Metadata

Medusa
Medusa copied to clipboard