Philipp
Philipp
Hello, I try to re-implement your paper in Keras. Now, I'm struggling with your LSTM implementation. You use num_layers as n for the LSTM initialization, but the num_layers should be...
I managed to download the 7B and 13B models; from 30B onwards the URL did suddenly not work anymore, but only returned "Forbidden" (even for the 7B now)... ``` Connecting...
Hello everyone, thanks for the contribution of the benchmark! Is there any chance to run this from within a docker container (as our infrastructure requires to do so)? We run...
I just wanted to serve the `CohereForAI/c4ai-command-r-plus-4bit` model, but after I installed `bitsandbytes` I get this error when running: ``` entrypoint: [ "python3.9", "-m", "fastchat.serve.model_worker", "--model-names", "command-r-plus-4bit", "--model-path", "CohereForAI/c4ai-command-r-plus-4bit", "--worker-address",...
### ❓ Question Hi and thanks for the great work! I am using RecurrentPPO in a current project and it strikes me that on [L294](https://github.com/Stable-Baselines-Team/stable-baselines3-contrib/blob/25b43266e08ebe258061ac69688d94144799de75/sb3_contrib/ppo_recurrent/ppo_recurrent.py#L294) the `self._last_lstm_states` added to the...