Léon comments

Results 10 comments of


                                            Léon

How to run GPipe in distributed manner?

> I was able to run GPipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM in a single node with multiple GPUs. > > However, I am a little confused about how to run GPipe with...

How to run GPipe in distributed manner?

> > Could you point me to (or share) instructions on how to run the one_billion_wds using GPipe on multiple local GPUs? > > You can run it using the...

How to run GPipe in distributed manner?

> > > > Could you point me to (or share) instructions on how to run the one_billion_wds using GPipe on multiple local GPUs? > > > > > >...

Bug rep when run gpipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM

Not yet :(

Bug rep when run gpipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM

> Hey, were you able to resolve this issue? I'm facing the same problem. Hi, have you found the solution to this issue?

Bug rep when run gpipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM

> I did a lot of debugging and found out that there is an issue with input generation (for example, a tensor shaped (32, ) is being converted to (32,...

Bug rep when run gpipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM

> You can find it [here](https://github.com/adis98/Lingvo_modified) Thanks

Bug rep when run gpipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM

> You have a system with multiple gpus right? Could you let me know if it is running fine? If there are any bugs do let me know. sure, I...

Bug rep when run gpipe with lm.one_billion_wds.OneBWdsGPipeTransformerWPM

> I've tested the code (the image processing model) with 2 GPUS (8GB, M60 Tesla) and it seems to be working fine. oh, nice! I have not got time to...

Why Lingvo is so slow when training Librispeech960Wpm on a host with 6 GPUs

Hi, I have 4Gpus(v100) and I want to try to run this model. But I don't know what the number of _saver_max_to_keep_ & _worker_replicas_ means. Should I set the same...