DeepSpeed
DeepSpeed copied to clipboard
Hooks to add sharding and parallelism for custom layers (Not Conv1D or Linear)
Hi, I've been trying to find where in DeepSpeed one would go about adding sharding and parallelism for a custom layer, that has more than 1 input.
https://www.deepspeed.ai/training/ lists Support for Custom Model Parallelism on it's feature overview, but the header above it (Model Parallelism) links to https://www.deepspeed.ai/features/#model-parallelism
Which results in a 404.
Is there any external hooks or guides on how to go about this?