axlearn icon indicating copy to clipboard operation
axlearn copied to clipboard

An integration of orbax checkpointer.

Open markblee opened this issue 1 year ago • 1 comments

Depends on https://github.com/apple/axlearn/pull/650.

The relevant changes are in checkpointer.py, checkpointer_test.py, and pyproject.toml. It also depends on an unreleased commit for orbax (for concurrency bounded serialization).

markblee avatar Aug 14 '24 21:08 markblee

Thanks Mark! Adding @cpgaffney1 to take a look as well (this will be the PR that gets merged for Orbax fyi)

jiya-zhang avatar Aug 14 '24 21:08 jiya-zhang