Shubhadeep Das comments

Results 14 comments of


                                            Shubhadeep Das

Error message has incorrect model engine name of nemo-infer instead of ai-playground

@mohammedpithapur Thanks for fixing the error message! It should be either `model_engine: "triton-trt-llm"` OR `model_engine: "ai-playground"`

Error message has incorrect model engine name of nemo-infer instead of ai-playground

@mohammedpithapur we now have a dedicated example to showcase AI playground in latest codebase! No config file changes are needed (this avoids the confusion you faced). Please checkout the example...

upload_pdf_files does not check for file type or format

@OlegSudakov Thanks for pointing this bug out. This should have been fixed with the latest v0.2.0 release. https://github.com/NVIDIA/GenerativeAIExamples/blob/main/notebooks/05_dataloader.ipynb Feel free to close this issue after checking.

Asking question which does not have a relevant snippet in the knowledge base leads to frontend error

@OlegSudakov Thanks for pointing this bug out. This should have been fixed with the latest v0.2.0 release. Can you please verify the same with latest codebase and close this issue?

Exception: [500] Internal Server Error

Hi @dsbyprateekg thanks for reporting this! There seems to be some stability issues in the server backend which causes this. Are you able to reproduce this error consistently for all...

langchain_nvidia_trt not working

Hey @rbgo404 You can deploy the tensorRT-based LLM model by following the steps here https://nvidia.github.io/GenerativeAIExamples/latest/local-gpu.html#using-local-gpus-for-a-q-a-chatbot This notebook interacts with the model deployed behind `llm-inference-server` container which should get started up...

ImportError: Apex was not found. Please see the NeMo README for installation instructions: https://github.com/NVIDIA/NeMo#megatron-gpt.

Thanks for reporting this. We are checking this and will get back to you shortly. @nealvaidya please help check this issue.

RAG using Llama2 7b model

Hey @saivineethabvns, yes this is possible! As a prerequisite step to run this notebook, you need to deploy this example following the steps here https://nvidia.github.io/GenerativeAIExamples/latest/local-gpu.html#using-local-gpus-for-a-q-a-chatbot Now in this step where...

Standalone examples.md has a bad link for obtaining API key.

Thanks for reporting this bug. @mikemckiernan to help fix this.

Standalone examples.md has a bad link for obtaining API key.

@CaptainMcCrank here is the correct link until we fix this in our codebase https://nvidia.github.io/GenerativeAIExamples/latest/api-catalog.html#get-an-api-key-for-the-mixtral-8x7b-instruct-api-endpoint