Kedar Potdar

Results 10 comments of Kedar Potdar

@ryletd Can you please try a fresh install and report back?

Feel free to clone the repos locally and point the application to it!

Thanks for reporting, we are looking into it.

Hello, is this an issue with Chat with RTX or the trt-llm-rag-windows repo?

Hello, can you please share the build.py command used for engine generation?

Reviewing from NV side. Please hold on merging till we finish review.

I'm not sure the mtp-mode: on actually triggers the dsr1_b200_fp4/fp8_trt_mtp.sh scripts.

@kimbochen can you please guide how to add logic in runner/launch b200 to account for mtp flag?

Thanks @csahithi . i have made changes accordingly. ran several sample sweeps: https://github.com/InferenceMAX/InferenceMAX/actions/runs/18050118322 https://github.com/InferenceMAX/InferenceMAX/actions/runs/18050505472 https://github.com/InferenceMAX/InferenceMAX/actions/runs/18051017127 https://github.com/InferenceMAX/InferenceMAX/actions/runs/18049715475