Results 3 issues of Shanshan Zhong

目前有个将 [RWKV-v4](https://github.com/BlinkDL/RWKV-LM/tree/main/RWKV-v4) 迁移到 libai 的需求。由于 [RWKV-v4](https://github.com/BlinkDL/RWKV-LM/tree/main/RWKV-v4) 自定义了 cuda 算子 [wkv](https://github.com/BlinkDL/RWKV-CUDA),因此需要将 [wkv](https://github.com/BlinkDL/RWKV-CUDA) 迁移成 OneFlow 算子。该 pr 迁移了 [v2 版本的 wkv](https://github.com/BlinkDL/RWKV-CUDA/blob/main/wkv/cuda/wkv_cuda_v2.cu)。 - [wkv](https://github.com/BlinkDL/RWKV-CUDA).forward 用于推导 y 的值 -> flow._C.wkv - [wkv](https://github.com/BlinkDL/RWKV-CUDA).backward 用于计算参数的梯度 ->...

eager
api
community

[objaverse](https://github.com/allenai/objaverse-xl?tab=readme-ov-file) is a Universe of 10M+ 3D Objects, and its [rendering script](https://github.com/allenai/objaverse-xl/tree/main/scripts/rendering) is based on `bpy` which is not differentiable. So I want to **use Pytorch3D to render objaverse**. However,...

**Describe the bug** For zero3, all of the parameters are partitioned, so the weight shape of `nn.Linear` is 0, and grad cannot be set for some dimensions of weight. **To...

bug
training