ZhiQi Huang comments

Results 11 comments of


                                            ZhiQi Huang

How to train Stable Video Diffusion model?

> sorry，ask me Hello！ Do you have a training script for Stable Video Diffusion? Could you please contact me

建个群抱团取暖~

求再发一次 +1

Testing the model with new data

> Hi, it seems like you made some modifications on the `audio_transformer.py` (Since your line 146 in `audio_transformer.py` is different from the line in this repo). And now the length...

Testing the model with new data

> The shape of your waveforms is incorrect. You should resample them to 16kHz. > > To do so, you could refer to `resample_data_generate_durations` function (actually the `resample_folder` func in...

Testing the model with new data

> 1. Yes, this ATST-SED model is designed for DESED dataset. These 10 classes are exactly the classes defined by the DESED dataset (DCASE challenge task 4). If you want...

Hello, may I ask if ATST Frame can publicly disclose the models and scripts used for inference?

Thanks for your reply~！ Yes, that is exactly what i need! So I wonder When you will release the “checkpoint and reference code of ATST Frame refined on strongly labeled...

Hello, may I ask if ATST Frame can publicly disclose the models and scripts used for inference?

> @Angelalilyer You could try this [checkpoint file](https://drive.google.com/file/d/1RWjnEnkDb-7fmNlqIcM-MovTnddOzLsh/view?usp=sharing), hope it helps! Thank you so much!!

Hello, may I ask if ATST Frame can publicly disclose the models and scripts used for inference?

> @Angelalilyer You could try this [checkpoint file](https://drive.google.com/file/d/1RWjnEnkDb-7fmNlqIcM-MovTnddOzLsh/view?usp=sharing), hope it helps! Is there a complete inference code available? I tried to modify "audiossl/audiossl/methods/atstframe/downstream/train_strong. py" but kept reporting errors. Sorry to...

Hello, may I ask if ATST Frame can publicly disclose the models and scripts used for inference?

> Did you solve the problem? I tried to write inference code myself, but I couldn't output predicted labels，may I ask if there is a relatively complete inference code for...

Hello, may I ask if ATST Frame can publicly disclose the models and scripts used for inference?

> I write a quick solution in a new pull request #13 , can you test it ? thanks!! I can run this code, but there are some labels in...