Krzysztof Sandowicz
Krzysztof Sandowicz
Is there a way to extend timeout to wait for shard to be ready longer than 10minutes? I have similar issue while running OPEA's ChatQna deployed as kata-qemu-tdx (with TDX...
Thanks a lot. I extended it to 40min, but unfortunately shards preparation haven't finish within this time if I deploy TGI service as kata-qemu-tdx (with TDX protection). Any hint how...
TD VM (kata-qemu-tdx) pod is created without persistent storage, so while deploing new TGI pod, it has to download data model from network. Each instance of TGI service will do...
Yes, for security reason. Persistent storage should be used to share data model among multiple TGI replicas. Even for single TGI pod needs to download a data model (>32GB) from...
I agree that this can be closed. We already implemented ChatQnA deployment [#799](https://github.com/opea-project/GenAIInfra/pull/799) with TDX support enabled successfully
That is good workaround. TGI service downloaded all required files to start successfully
Thanks a lot. Workaround works for me. One more info. TGI service starts successfully if I deploy it using kubernetes as runc and as kata-qemu (runtimeClass=kata-qemu). It only fails one...
@xiguiw, I have reproduction of this issue on both 1.0 and latest version that is available from 11/06/2024. The same error is logged by ChatQnA UI while trying to get...