Mark Campbell

Results 15 issues of Mark Campbell

Added a Kubernetes Kueue batch job scheduler based on the Kubernetes Scheduler Note: variable `local_kueue="local-kueue-name"` is required in the scheduler args for the `queue-name` label and for priority add `kueue_priority_class="kueue-priority-class-name"`...

CLA Signed

## Why are these changes needed? Ingresses are created when the `enableIngress` bool is true but this offers no customisation for users who have specific needs with their ingresses/routes. Closes:...

# Issue link Closes #612 # What changes have been made Removed occurrences where an entire appwrapper is logged Bumped large informational logs up to level 4 and kept warning...

lgtm

# Issue link # What changes have been made # Verification steps ## Checks - [ ] I've made sure the tests are passing. - Testing Strategy - [ ]...

do-not-merge/work-in-progress
needs-rebase

# Issue link [RHOAIENG-6450](https://issues.redhat.com/browse/RHOAIENG-6450) # What changes have been made Updated CFSDK ray dependency to 2.20.0 # Verification steps ## Setup ### Notebook server ODH/RHOAI/Local * Clone this repository with...

lgtm

# Issue link [RHOAIENG-52](https://issues.redhat.com/browse/RHOAIENG-52), [RHOAIENG-4375](https://issues.redhat.com/browse/RHOAIENG-4375) # What changes have been made For `TokenAuthentication` the SDK will use the cert injected into a ODH/RHOAI Notebook by default in the `/etc/pki/tls/custom-certs/ca-bundle.crt` location...

## WHY Why is this change being made? There have been left over mentions of the `mcad.ibm.com` domain in the SDK. The majority of the work was completed in #341...

# Issue link Closes: [RHOAIENG-9259](https://issues.redhat.com/browse/RHOAIENG-9259) # What changes have been made Split the head cpu and memory resources to requests/limits similar to #547 Added depreciation warnings to the old vars...

needs-rebase
do-not-merge/hold

# Issue link [RHOAIENG-7805](https://issues.redhat.com/browse/RHOAIENG-7805) # What changes have been made Added a demo notebook and python script based on the [Ray Train & Pytorch Lightning example ](https://docs.ray.io/en/latest/train/getting-started-pytorch-lightning.html)provided by Ray. #...

do-not-merge/hold
approved
lgtm

**What this PR does / why we need it**: The following unit tests are added with this PR. * get_job_conditions * is_job_created * is_job_running * is_job_restarting * is_job_failed * is_job_succeeded...

size/XL