lhy101

Results 2 issues of lhy101

# Motivation - **Support Context Parallel (CP)**: Hetu CP is compatible with heterogeneous training and packing scenarios. - **Refactor Python APIs**: Consolidate shared functions and model & config definitions within...

Hi @insujang, Thank you for open-sourcing Oobleck—it’s an impressive piece of work! I noticed in the paper that there is a parameter f that controls the fault tolerance threshold. However,...