deepvqe icon indicating copy to clipboard operation
deepvqe copied to clipboard

Why the input tensor shape is (B,F,T,2)?

Open steven8274 opened this issue 2 years ago • 2 comments

Hi, Xiaobin, thanks for implementation for 'DeepVQE'.I'v read the paper and your codes.But I have some questions: Why the input tensor shape is (B,F,T,2)? What does 'B','F','T' mean? Thanks in advance.

steven8274 avatar Nov 10 '23 08:11 steven8274

Sorry for the confusion caused by my uncertainty. The input tensor is a batch of noisy spectrograms, where B means the batch size, and F and T refer to frequency bins and time frames, respectively. The final dimension is composed of the real and imaginary parts of the spectrogram.

Xiaobin-Rong avatar Nov 10 '23 10:11 Xiaobin-Rong

Sorry for the confusion caused by my uncertainty. The input tensor is a batch of noisy spectrograms, where B means the batch size, and F and T refer to frequency bins and time frames, respectively. The final dimension is composed of the real and imaginary parts of the spectrogram.

Thanks!

steven8274 avatar Nov 13 '23 03:11 steven8274