Krzysztof Sandowicz comments

Results 8 comments of


                                            Krzysztof Sandowicz

tgi gaudi fails with health test in ChatQnA

Is there a way to extend timeout to wait for shard to be ready longer than 10minutes? I have similar issue while running OPEA's ChatQna deployed as kata-qemu-tdx (with TDX...

tgi gaudi fails with health test in ChatQnA

Thanks a lot. I extended it to 40min, but unfortunately shards preparation haven't finish within this time if I deploy TGI service as kata-qemu-tdx (with TDX protection). Any hint how...

tgi gaudi fails with health test in ChatQnA

TD VM (kata-qemu-tdx) pod is created without persistent storage, so while deploing new TGI pod, it has to download data model from network. Each instance of TGI service will do...

tgi gaudi fails with health test in ChatQnA

Yes, for security reason. Persistent storage should be used to share data model among multiple TGI replicas. Even for single TGI pod needs to download a data model (>32GB) from...

tgi gaudi fails with health test in ChatQnA

I agree that this can be closed. We already implemented ChatQnA deployment [#799](https://github.com/opea-project/GenAIInfra/pull/799) with TDX support enabled successfully

[Bug] TGI doesn't start due to permission denied

That is good workaround. TGI service downloaded all required files to start successfully

[Bug] TGI doesn't start due to permission denied

Thanks a lot. Workaround works for me. One more info. TGI service starts successfully if I deploy it using kubernetes as runc and as kata-qemu (runtimeClass=kata-qemu). It only fails one...

[Bug] ChatQnA unable to retrieve info from file and link

@xiguiw, I have reproduction of this issue on both 1.0 and latest version that is available from 11/06/2024. The same error is logged by ChatQnA UI while trying to get...