Zhaofeng Lin comments

Results 8 comments of


                                            Zhaofeng Lin

Hello, how to add scores (lm, ctc, ngram) for each token in data.json?

I also have the same question, but it looks like the scores are merged in [self.merge_scores](https://github.com/espnet/espnet/blob/0d0428d3498a904fc5ee63e218fa392da7807a9b/espnet/nets/beam_search.py#L371)

Hydra Conflict Problem

I also have this issue, I find that renaming "config.yaml" in folder /configs works for me.

About adapting Tutel in fairseq

@ghostplant Thanks for the quick response. I removed `system.cache()`, because I added the aux_loss alongside with `return x, attn, aux_loss` in the transformer layer and stacked the aux_loss from each...

About adapting Tutel in fairseq

Hi, thanks for the very detailed explanation! I very appreciate your help. I think I have made it work by setting 'skip_allreduce' to True, and also inequivalent_tokens=True in forward (due...

About adapting Tutel in fairseq

However, I still remain confused about the `parallel_type`. I set `parallel_type = "data" `, and basically had `num_experts_per_device = int(args.moe_expert_num / num_gpus)`. In my current case, moe_expert_num=8, num_gpus=2, so num_experts_per_device=4....