Alexei Karve
Alexei Karve
Look at the later blogs in the series for example. Search for "Build and Install the OKD Web Console" in https://community.ibm.com/community/user/cloud/blogs/alexei-karve/2022/03/02/microshift-9 and later blogs
The microshift binary from Apr 20, 2022 installed in /usr/bin by the rpm installer does not work with Fedora 36. It causes a crash-loop as seen with journalctl command. Replacing...
I had used cri-o:1.23 ``` rpm-ostree ex module enable cri-o:1.23 # Experimental enable rpm-ostree install cri-o cri-tools microshift ```
We can update the alt-names as shown in section "Update the kube-api certificate" in https://community.ibm.com/community/user/cloud/blogs/alexei-karve/2022/01/03/microshift-8
Works fine for me https://community.ibm.com/community/user/cloud/blogs/alexei-karve/2022/04/24/microshift-14 This issue should be closed if there are no further comments
The Ray cluster is missing the status.state https://github.com/ray-project/kuberay/issues/991 ``` oc create -k "github.com/ray-project/kuberay/ray-operator/config/default?ref=v0.5.0&timeout=90s" ``` The status from a RayCluster shows: ``` status: availableWorkerReplicas: 2 desiredWorkerReplicas: 1 endpoints: client: "10001" dashboard:...
Please ignore my previous comment, it works with the 0.5.0 (I was using 0.4.0 previously)
The problem is not with long documents, I tried by [splitting the long documents into chunks](https://github.com/foundation-model-stack/fms-fsdp/commit/c2a5c0253825f8eb98cbaea073b86408ce0060ea#diff-ab619e2534b6868a10d7d8f8cfe5c13d424585e16a3ac6c6e310c64ffdcf23a6R362-R409) Removing the SamplingDataSet that is used in multi-dataset handing allows us to bypass the...
Pull request https://github.com/foundation-model-stack/fms-fsdp/pull/90
I faced this problem when using the datasets/builder.py with multi node fine tuning. The default [filelock](https://github.com/tox-dev/filelock/tree/main/src/filelock) FileLock code uses the UnixFileLock because it finds the "import fcntl". On gpfs, the...