torchtitan icon indicating copy to clipboard operation
torchtitan copied to clipboard

Add tests to test each component

Open wanchaol opened this issue 2 years ago • 2 comments

The tentative tests we could add:

  1. test the llama debug model init and forward/backward works
  2. test checkpoint save/load works
  3. metrics logging test (metrics to be added)

wanchaol avatar Feb 02 '24 19:02 wanchaol

TODO: test correctness of checkpointable data loading in #279

tianyu-l avatar May 15 '24 23:05 tianyu-l

@wconstab is going to help set up an example multi-gpu unit test. We can add more functionality unit test following the example.

tianyu-l avatar May 16 '24 19:05 tianyu-l