Phil comments

Repositories
Issues
Comments

Results 5 comments of


                                            Phil

Faster Inference & Training Roadmap

Really excited about optimized kernels for inference! Worth looking at https://github.com/zeux/calm - where the forward pass is implemented as a single cuda kernel Uses fp8 rather than int4/8 quantization.

Direct terminal interaction

That's an interesting idea - worth experimenting My intuition is that it would be too generic and be difficult to get working repeatedly

have pytest auto pip installed, if it was not already installed

Thanks! I don't think we should have the agent install dependencies though How about adding a top level requirements.txt?

retry requests incase openai rate limit is reached

How about https://pypi.org/project/retry/?

SSO Authentication for use with CVAT Python SDK to download/export datasets via API

+1 much needed feature