NeMo issues

Adding new tokens to pretrained asr vocab

2

Is it possible to add new tokens to the tokeniser of a pretrained model. say I want to fine tune on some new data, which contains few new tokens eg....

evilc3

adapter tuning for Megatron GPT models

1

# What does this PR do ? Add a one line overview of what this PR aims to accomplish. **Collection**: [Note which collection this PR will affect] # Changelog -...

arendu

Add support for Apex distributed Adam optimizer with GPT-3

4

# What does this PR do ? Adds support for training GPT-3 with the [Apex implementation of the ZeRO optimizer](https://github.com/NVIDIA/apex/blob/master/apex/contrib/optimizers/distributed_fused_adam.py). **Collection**: NLP # Changelog - Add option for `distributed_fused_adam` optimizer...

timmoon10

[WIP] TTS tokenizers moved to collections.common.tokenizers

3

# What does this PR do ? Move TTS tokenizers to collections.common.tokenizers so they can be used in cross-collections pipelines. For implementing ASR/ST model training which generates samples on-the-fly with...

AlexGrinch

Mutiscale Diarization Decoder (MSDD) model and module files

1

Signed-off-by: Taejin Park # What does this PR do ? Add multiscale diarization decoder model module, and yaml files. Additionally, yaml files are also uploaded. - msdd_models.py Three classes are...

tango4j

Data Simulator

4

# What does this PR do ? Adds a multispeaker audio session simulator and corresponding documentation and tutorials. Collections: ASR, Tools # Changelog - Adds data simulator - Adds a...

chooper1

yidong72

prefix tuning for Megatron gpt models

14

# What does this PR do ? implements prefix tuning for Megatron GPT models. **Collection**: `examples/nlp/languge_modeling` # Changelog - Add specific line by line info of high level changes in...

arendu

NeMo
NeMo copied to clipboard

Metadata

Adding new tokens to pretrained asr vocab

adapter tuning for Megatron GPT models

Add support for Apex distributed Adam optimizer with GPT-3

[WIP] TTS tokenizers moved to collections.common.tokenizers

Mutiscale Diarization Decoder (MSDD) model and module files

Data Simulator

Update Aligner model and tutorial to add NGC checkpoint loading

upgrade to PTL 1.7

Adding RETRO model Faiss sharding index and KNN sharding index

prefix tuning for Megatron gpt models

← Metadata

Owner

Metadata

NeMo NeMo copied to clipboard

Metadata

← Metadata

Owner

Metadata

NeMo
NeMo copied to clipboard