anj-s
anj-s
This is something we have on our roadmap. Is it a different effort from what we have planned? 1. We need to land the tensor parallel code that is already...
I think this says "Documentation" but you meant tests? Can you update the description?
can you specify --max-tokens?
Can you try running this on 8 GPUs? I have a hunch of what might be wrong...
Lets try world size = 8 and MP size =2. Can you also confirm model params and inputs are on the same device before the FW pass? BTW the error...
Glad you are unblocked :) but we should still not fail if TP=2 and world size =2. Can you update the title of the issue to reflect this? I (or...
Yes, definitely. We do plan to support torchdynamo as the frontend in the near future.
Can you use /auth and use the Login with Google option which will map to using your Code Assist license. See instructions [here](https://github.com/google-gemini/gemini-cli/blob/main/docs/cli/authentication.md).
We should also fix the exist status to represent this more effectively.
Other related issues: b/427299140