Jack Urbanek

Results 20 comments of Jack Urbanek

I'm having a hard time parsing how the above is occurring. It would appear two agents are being established, however I would need more log info to debug, like the...

> I'm finding that the test fails because the messages aren't getting saved until the test class's `._teardown()` method, presumably during `self.operator.force_shutdown()` or `self.server.shutdown_mock()`, and thus the part of the...

Hm this is bizarre. Honestly I haven't worked on the chat service setup in a _long_ time, but over in Mephisto we've specified this sort of thing in our deployed...

Ran into this bug as well. I did some digging into this as Flyte is looking to be a perfect fit for us, but this list issue is a clear...

Unfortunately if we disable force download, we end up stalling out in multiple places, and the run has a high chance of failing outright early in changing. Perhaps the (_band-aid_)...

Hi @bhimrazy - launching a new run on litdata `main` this week, will report back.

@bhimrazy Unfortunately no dice - we end up with an overfull cache directory containing all of the `dat-0-x.bin` from with x between 0 to 40, and then around 1 in...

Hi @bhimrazy, of course, here's what I've got! - Number of nodes: 4 - Number of CPU cores and devices per node: 8 GPUs per node, 20 CPUs per GPU...

1. In this case, we're using just one 2. It's globally allocated and pointed to a directory on a shared (network) drive. We were having issues with the drive size...

I'm 100% interested in contributing here. I've been following along on the JSON IDL discussions, but that seems to be just the first step towards something like this being that...