GenAIExamples Enable running ChatQnA example on Docker Kubernetes

One of OPEA goals is to empower developers to tackle GenAI on their laptops and/or single node settings. The genAI examples run on docker single node instances and for production environments, have been vetted on Kubernetes. Beng able to test on Kubernetes on a single node will make the transition from dev to prod even more smooth. https://docs.docker.com/desktop/kubernetes/

Aug 02 '24 15:08 mkbhanda

/assigntome

Sep 30 '24 19:09 arcyleung

Welcome @arcyleung ! You could try using for say the ChatQnA example TinyLlama/TinyLlama-1.1B-step-50K-105b and for re-ranker a smaller model too.

Sep 30 '24 20:09 mkbhanda

If I understand correctly the task is to add an example to run the ChatQnA example locally, using minikube or kind?

I'm leaning more towards kind since there is already a docker-based deployment, and I'd just have to create another dockerfile to for kind to spin up on this single-node testing use case.

Oct 12 '24 01:10 arcyleung

@arcyleung How are you doing with this issue?

Oct 31 '24 17:10 chickenrae

I've gotten the deployment to work on minikube and will write up a readme with the steps shortly. The cloud instance I was using had ran out of credits, my laptop only has 16GB RAM so deploying with the xeon manifests didn't work with a pod going OOM, but the rest of services did appear to work:

I'll try to find another xeon machine in the meantime

Nov 01 '24 05:11 arcyleung

@arcyleung Please place the PR URL here so I can close out this issue when this is complete.

Nov 03 '24 15:11 chickenrae

Please see the following PR for the instructions: https://github.com/opea-project/GenAIExamples/pull/1058

Nov 03 '24 23:11 arcyleung

@mkbhanda Could we close issue?

Nov 07 '24 06:11 xiguiw

Please see the following PR for the instructions: #1058

@arcyleung Thank you for your contribution!

Feb 11 '25 04:02 xiguiw

Thank you @arcyleung and @xiguiw !

Feb 13 '25 08:02 mkbhanda