Matthew Beckers
Matthew Beckers
Another option that I was taught to do was use DVC's run-cache. In the CML runner: ``` dvc repro dvc push --run-cache ``` On local machine ``` dvc pull --run-cache...
@apls777 Thank you for the feedback, I have made all the requested changes.
Another option that I was taught to do was use DVC's run-cache. In the CML runner: ``` dvc repro dvc push --run-cache ``` On local machine ``` dvc pull --run-cache...
~~@pmrowla That gives me a different error~~ Nevermind - my authentication was slightly off. The error is the same 🙃
The error has since changed to ``` dvc exp push origin tipsy-more -v 2023-09-06 19:29:41,621 DEBUG: v3.18.0 (pip), CPython 3.9.16 on Linux-5.10.167-147.601.amzn2.x86_64-x86_64-with-glibc2.31 2023-09-06 19:29:41,621 DEBUG: command: /opt/conda/bin/dvc exp push origin...
I am experiencing the same issue. I think the only pattern I am noticing is that more jobs fail as I increase the number of parallel jobs ``` DVC version:...
@skshetry I replayed @nv-pipo's demonstration of the issue, and I can see the following error in the various celery workers logs for each failed job: ``` [2025-08-12 09:03:12,827: ERROR/MainProcess] Task...