torchtitan
torchtitan copied to clipboard
Add tests to test each component
The tentative tests we could add:
- test the llama debug model init and forward/backward works
- test checkpoint save/load works
- metrics logging test (metrics to be added)
TODO: test correctness of checkpointable data loading in #279
@wconstab is going to help set up an example multi-gpu unit test. We can add more functionality unit test following the example.