nateanl issues

Results 14 issues of


                                            nateanl

The rir_waveform should be flipped in reverberate, when use_fft=False?

When applying convolution on `waveforms` and `rir_waveform`, the `rir_waveform` is flipped in numpy or scipy implementations. But in speechbrain's implementation, the `rir_waveform` is directly used when [the solution is not...

bug

[Migration] TorchAudio Beamforming Module Migration

# TorchAudio Beamforming Module Migration ## Overview `torchaudio` supports an integrated [`MVDR`](https://pytorch.org/audio/stable/transforms.html#mvdr) module under `torchaudio.transforms`. To use it, users need to provide `ref_channel` and `solution` (options: [`ref_channel`, `evd`, `power`]) when...

Add training recipes for HuBERT model pre-training and ASR fine-tuning

### 🚀 The feature [Hidden-Unit BERT (HuBERT)](https://arxiv.org/pdf/2106.07447.pdf?fbclid=IwAR3hI4uGqc4mV5j-ob8R5yLu-BaamVoe9ncxUoVmgFLjJXsE1IevP0rdNYY), a self-supervised model for speech representations was proposed and wildly used in down-stream tasks, such as speech recognition, speech diarization, speaker identification, etc....

Add a feeze option in Wav2Vec2 and HuBERT bundles

### 🚀 The feature In some research cases, the Wav2Vec2 or HuBERT is expected to be frozen (i.e. make ``reuqires_grad=False`` for all params). - Users use it as a feature...

improvement

module: pipelines

triaged

nateanl

The rir_waveform should be flipped in reverberate, when use_fft=False?

[Migration] TorchAudio Beamforming Module Migration

Add training recipes for HuBERT model pre-training and ASR fine-tuning

Add a feeze option in Wav2Vec2 and HuBERT bundles

Add a DNN beamformer training pipeline to demonstrate usage of torchaudio.transforms.MVDR

Replace _get_mat_trace when torch.linalg.trace is ready to use.

torchaudio.io._compat.load_audio_fileobj returns shorter waveform than torchaudio.load for certain FLAC files

Add unit test for LibriMix dataset

Add simulate_rir_ism method for simulating RIR with Image Source Method

Implement L-BFGS-B optimizer and update InverseMelScale