ZhiQi Huang

Results 11 comments of ZhiQi Huang

> sorry,ask me Hello! Do you have a training script for Stable Video Diffusion? Could you please contact me

求再发一次 +1

> Hi, it seems like you made some modifications on the `audio_transformer.py` (Since your line 146 in `audio_transformer.py` is different from the line in this repo). And now the length...

> The shape of your waveforms is incorrect. You should resample them to 16kHz. > > To do so, you could refer to `resample_data_generate_durations` function (actually the `resample_folder` func in...

> 1. Yes, this ATST-SED model is designed for DESED dataset. These 10 classes are exactly the classes defined by the DESED dataset (DCASE challenge task 4). If you want...

Thanks for your reply~! Yes, that is exactly what i need! So I wonder When you will release the “checkpoint and reference code of ATST Frame refined on strongly labeled...

> @Angelalilyer You could try this [checkpoint file](https://drive.google.com/file/d/1RWjnEnkDb-7fmNlqIcM-MovTnddOzLsh/view?usp=sharing), hope it helps! Thank you so much!!

> @Angelalilyer You could try this [checkpoint file](https://drive.google.com/file/d/1RWjnEnkDb-7fmNlqIcM-MovTnddOzLsh/view?usp=sharing), hope it helps! Is there a complete inference code available? I tried to modify "audiossl/audiossl/methods/atstframe/downstream/train_strong. py" but kept reporting errors. Sorry to...

> Did you solve the problem? I tried to write inference code myself, but I couldn't output predicted labels,may I ask if there is a relatively complete inference code for...

> I write a quick solution in a new pull request #13 , can you test it ? thanks!! I can run this code, but there are some labels in...