Clément Dumas
Clément Dumas
Oh wait, seems like token indexing is supposed to work only with `tracer.invoke` calls. It would be nice if it also works on directly batched input, not sure how easy...
Ok but with UnifiedTransformer, token indexing doesn't work as padding side is right by default : ```py l = ["ab dfez zd", "a", "b"] from nnsight import LanguageModel model =...
@JadenFiotto-Kaufman I guess we can close this as UnifiedTransformer is no longer supported?
Isn't that already supported ? I've run remote code with `torch.arange` and it worked iirc
Hi, sorry I posted the wrong trace, I got a recursion error too 😅 I'll look into it
So if I understand correctly, as every stopping criteria is a different class inheriting StoppingCriteria it might not be possible for nnsight to support this argument on remote execution ?
Stopping criteria is an abstract class meant to be inherited. If I understand correctly you'd need to manually add a set of class inheriting StoppingCriteria right ?
Almost missed this. Would be good if it were flagged indeed
Not sure why you're using NNsight rather than LanguageModel here. Here is a MWE of the issue. If NNsight is used, both example fail, if LanguageModel is used both example...
Just to make sure: there is no magic uv flags to make this work on Windows without going through custom wheels / full build from scratch (according to https://github.com/Dao-AILab/flash-attention/issues/1469 this...