Jean-Gab comments

Results 8 comments of


                                            Jean-Gab

[Bug] Deleting multiple selected blocks sends onChange event for only one block

The bug is worse than expected : only one of the blocks is actually deleted in the editor's internal structure. If we call editor.save() the other deleted blocks are still...

Upgrading the manually deployed k8s clusters using kubespary

We are in a similar situation, 1 master, 8 workers running 1.21. We are working right now on taking control of that cluster with Kubespray, we created a smaller cluster...

Upgrading the manually deployed k8s clusters using kubespary

We actually ran into so much trouble that we gave up on it for the time being. I don't remember exactly what the last issue was but it was a...

Automation of Benchmark Runs for Improved Performance Tracking

I'm in, I have hardware sleeping right now. I'll take it up with @nathanielsimard to start coordinating this effort next week, he just poked me about this on Discord.

[Question]: Using llama.cpp serve hosted model

Thanks for the suggestion of OpenAI, it did work for me although I've had to mess with the parameters a bit. I ended up with : ```python llm = OpenAI(api_key="somestring",...

Bug: Phi-3 4K output broken after 2000~ tokens (Reproducible)

Same issue here but with **Llama3 8B** on an RTX 4090 with CUDA, and it also completely breaks the server. When one generation goes beyond the context limit, all subsequent...

Bug: Phi-3 4K output broken after 2000~ tokens (Reproducible)

I just tried with the version suggested above, and it does not work either version: 2960 (https://github.com/ggerganov/llama.cpp/commit/6369bf04336ab60e5c892dd77a3246df91015147) The behavior is a the same but it is a lot slower on...

Bug: Phi-3 4K output broken after 2000~ tokens (Reproducible)

Indeed my behavior is slightly different but still degradation WITHIN the context length. I posted in this thread instead of opening a new issue since it had enough similarities that...