Deepanway comments

Results 24 comments of


                                            Deepanway

Save trained model

@ChasonLee , @huydan , @Muugii-bs Hey guys, I have made some modifications in the code so that further predictions can be made on some test examples. You can found it...

Save trained model

@Muugii-bs I don't actually save the parameters. Along with train and validation set I also pass the test set in the function train_conv_net() and it returns predicted test labels. In...

A question about DialogRNN

Adding @nmder to this thread as he is the author of the DialogueRNN code.

How to install mustango?

Please follow the steps here for installation: https://github.com/AMAAI-Lab/mustango#installation

Some weights of the model checkpoint at declare-lab/flacuna-13b-v1.0 were not used when initializing LlamaForCausalLM:

Hi, Flacuna is a LoRA-based model. You can refer to the [flacuna.py](https://github.com/declare-lab/flacuna/blob/main/flacuna.py) file to see how we load the weights in `self.model`.

Some weights of the model checkpoint at declare-lab/flacuna-13b-v1.0 were not used when initializing LlamaForCausalLM:

Can you add the warning message here? The warning you are encountering should also show the parameter names for which the pre-trained weights were not loaded. Also, assuming you have...

Some weights of the model checkpoint at declare-lab/flacuna-13b-v1.0 were not used when initializing LlamaForCausalLM:

This warning should come when you try to initialize `LlamaForCausalLM`directly from `"declare-lab/flacuna-13b-v1.0"`. In flacuna.py, we first initialize Llama from `"TheBloke/vicuna-13B-1.1-HF"` and then load the LoRA weights from our checkpoint. Otherwise,...

16 GB of GPU memory runs out

Hey, you can try the following: 1. Use a smaller text encoder and a smaller diffusion model if you are training from scratch. 2. Use the Adafactor / 8 Bit...

about tango-full-ft-audiocaps

Yes, you can use the `--hf_model` argument to pass the tango-full model checkpoint for doing that. The full command would be: ``` accelerate launch train.py \ --train_file="data/train_audiocaps.json" --validation_file="data/valid_audiocaps.json" --test_file="data/test_audiocaps_subset.json" \...

Question about inference_hf.py

The `inferece_hf.py` will compute the evaluation metrics (reported in the paper) from the checkpoints we uploaded in huggingface. For each text prompt, only one audio sample will be generated. The...