Didan Deng

Results 11 issues of Didan Deng

Hi, I am curious about the parameter size and FLOPs (Floating Points Operations) of adanet ensemble model.

Hi, authors. Thanks for providing this repo. I'm currently using the Orthogonal module and define it as part of my model weights. When I tried to resume training from a...

I have read your paper and found it quite impressive, especially for your comparison results in Figure 6. This should be a regression task with a two-layer linear model. Unfortunately,...

Mindcv 实现的[GPSA layer代码](https://github.com/mindspore-lab/mindcv/blob/20d54a9f383a2332eb344cee748be63b0dedf437/mindcv/models/convit.py#L68) 中`get_attention` 函数没有对`attn` 进行normalization: ``` def get_attention(self, x: Tensor) -> Tensor: B, N, C = x.shape q = ops.reshape(self.q(x), (B, N, self.num_heads, C // self.num_heads)) q = ops.transpose(q,...

bug

**Problem** During validation, if I pass a non-existent checkpoint_path to the validate.py, it won't throw an error or give a warning. The model has the initialized weights, and the validation...

This PR allows user to set Ulysses Attention for diffusion model, e.g., qwen-image. ## Purpose To support various parallelism inference algorithms, this PR introduce: - `DiffusionParallelConfig` in `vllm_omni/diffusion/data.py`: Configuration for...

### 🚀 The feature, motivation and pitch For long sequence (image & video) generation, SP (Sequence Parallelism) is widely adopted to accelerate the inference speed. Especially, Ulysses-SP and Ring-Attention are...

good first issue

### Motivation. This RFC discusses about the API design for DIT model acceleration plugins: parallelism, attention backends, and cache methods. ### Proposed Change. - Attention Backends and Parallelism Supports: 1....

enhancement
good first issue

PLEASE FILL IN THE PR DESCRIPTION HERE ENSURING ALL CHECKLIST ITEMS (AT THE BOTTOM) HAVE BEEN CONSIDERED. ## Purpose This PR aims to run Qwen-Image with CFG Parallellism, that is...

### Your current environment The output of python collect_env.py ```text Your output of `python collect_env.py` here ``` `wget https://raw.githubusercontent.com/vllm-project/vllm-omni/main/vllm_omni/collect_env.py` 404 not found. ### 🐛 Describe the bug The main branch...

bug
help wanted