Brendan Dolan-Gavitt
Brendan Dolan-Gavitt
@hsheil Could you elaborate on exactly what you changed? I've tried some obvious things, like changing the `vocab_size` passed to `LSTM.lstm` to `vocab_size + 4000`, but still hit the same...
This is a really interesting idea! Do you know if Google Colab has any way to listen on a network port that can be reached from the outside world?
I also just found this, which looks like it might be a good fit since the FauxPilot server is already using Flask: https://www.geeksforgeeks.org/how-to-run-flask-app-on-google-colab/ I will look into putting together a...
Hmm would serverless work when the models are really big though? Loading the 16B model from disk -> GPU takes almost a minute, so I wouldn't want to have to...
I think the network latency involved would make that pretty slow?
Hmm, FasterTransformer has only been tested on Compute Capability >= 7.0, and the 1070 is 6.0. So it's possible something it uses is limited to more recent cards. For now...
It appears so unfortunately! It is probably possible to bypass this by changing the code of the extension, but that's going a bit beyond what I'm comfortable with. You may...
Sure, feel free to use the name!
Do you mean it doesn't work without having a valid token for Copilot? Or is something going wrong after that too?
You may want to dig into the code of the Copilot IntelliJ plugin to see if they have a similar debug setting to `debug.overrideProxyUrl` and `debug.overrideEngine` in the VSCode plugin?...