Harshini Komali issues

Results 6 issues of


                                            Harshini Komali

Changes to support Ensemble Top Level Response Caching

Related PR: https://github.com/triton-inference-server/core/pull/338 Changes: Updated DetermineStatsModelVersion(), MergeStatistics() functions to handle cache hit scenario when ensemble top request is cached due to which composing models are not executed. Tests for DetermineStatsModelVersion()

Support top level response caching for ensemble models

ref slack thread: https://nvidia.slack.com/archives/CAZKCU4UV/p1677717244222069 Currently caching at the top-level request sent to ensemble scheduler is not supported. Implemented caching top level requests for ensemble models. In case of cache hit,...

Harshini Komali

Changes to support Ensemble Top Level Response Caching

Support top level response caching for ensemble models

Tests for Top Level Request Caching for Ensemble Models

Support top level response caching for ensemble models (#338)

Add a check if ModelParser pointer is nullptr

Updated tutorials/Conceptual_Guide/Part_5-Model_Ensembles