Gangmuk Lim

Results 19 issues of Gangmuk Lim

The meaning of the flag "use_datasets" is confusing. Below two flags are flags "use_datasets" and one more which is related to it named "datasets_use_prefetch". flags.DEFINE_boolean('use_datasets', True, 'Enable use of datasets...

Hi, I am using this bufferbloater and it is very useful tool to get insights with different load shedding configs. I found this repo is behind the bufferbloater version that...

## Pull Request Description Made it more thread safe especially regarding accessing TreeNode data structure. Made variables of TreeNode private and made all them accessed through Getter functions. **Important: Before...

### 🐛 Describe the bug Tagging related people, @varungup90 @Jeffwan Lint check was not kicked in for some PRs. (e.g., #701 ) For example, lint check for `test/e2e/model_adapter_test.go` should have...

area/github
area/cicd

### 🚀 Feature Description and Motivation Currently, AiBrix is supporting a simple prefix-aware routing. From data structure perspective, it is using hash table with fixed size of a block. A...

kind/enhancement
area/gateway
kind/feature
area/performance
area/scheduling
area/kv-cache

### 🚀 Feature Description and Motivation Currently, the radix tree cache does not support varying number of GPUs (pods). The corresponding tree nodes in RadixTree should be updated accordingly in...

kind/enhancement
area/gateway
priority/important-longterm

### 🐛 Describe the bug KPA never scales down after scaling up. Scaling up works but scaling down never happens even when there is 0 load, basically gpu_cache_usage_perc is 0....

kind/bug
kind/support
area/autoscaling
area/performance

### 🐛 Describe the bug This is the screenshot of kubectl get pod. You can see the STATUS is 'running' even if the pod is not actually ready (see the...

## Pull Request Description This PR fixes a few bugs in client and also improves failure request handling. Bugs - Header setting in AsyncOpenAI was not being done (routing-strategy header)...

### 🐛 Describe the bug Intermittently CI test is skipped in PR. Couldn't figure out when and why it happens. Leaving issue for future reference. ### Steps to Reproduce In...

triage/accepted
area/github
area/cicd