philipwan issues

Results 10 issues of


                                            philipwan

Fix IR gap for shape inference: update input from initializer (#2901)

* Update input from initializer if missing * add a test and comments * remove useless modification * fix flake8 * do not update shape if exisit in input *...

build tf v2.5.0 falied

According to your method, I compiled tensorflow_aarch64 on tf2.5, but it reported an error: `ERROR: /root/tensorflow/tensorflow/python/BUILD:2531:29: Linking of rule '//tensorflow/python:gen_candidate_sampling_ops_py_wrappers_cc' failed (Exit 1): gcc failed: error executing command /usr/bin/gcc @bazel-out/host/bin/tensorflow/python/gen_candidate_sampling_ops_py_wrappers_cc-2.params...

efficient_attention_forward_cutlass op is incompatible with Torch JIT

# 🐛 Bug RuntimeError: unsupported output type: int, from operator: xformers::efficient_attention_forward_cutlass ## Command ## To Reproduce StableDiffusionXL model use with torch.jit and enable_xformers_memory_efficient_attention function ``` m.enable_xformers_memory_efficient_attention torch.jit.trace(m) ``` Steps to...

Only StramDiffusion run same prompt?

According to examples file and implementation of **pipeline.py**, every time using new prompt, you need to warm up at least **denoising_steps_num** times. Am I understanding it wrong? If different prompts...

Inference Only

Can Customers use FM module for inference without training？

support for open-sora-plan v1.2?

open-sora-plan update to v1.2 version, model struct changed from spatial + temporal to (T*H*W) transformer block; It has a huge sequence_length, This inference cost is unacceptable，Can **PAB** provide additional support?...

引擎修改了原模型结构，有性能数据展示吗

如题，想了解下在新的结构中模型运行效果如何

Can nunchaku support more Video-generator model？

In addition to flux, does nunchaku support prompt-to-video models, such as OpenSoraPlan, Cogvideo，etc. Flux is also a transformer-based model

enhancement

video model

[custom_mask] RuntimeError: single_prefill_with_kv_cache_sm90 failed with error: operation not supported

When I calling flashinfer.single_prefill_with_kv_cache with param custom_mask, the error is ``` (VllmWorker rank=1 engine_index=0 pid=3101376) ERROR 10-26 11:09:15 [logger.py:146] File "/usr/local/lib/python3.12/dist-packages/flashinfer/prefill.py", line 1162, in single_prefill_with_kv_cache (VllmWorker rank=1 engine_index=0 pid=3101376) ERROR...

bug

Will TeaCache support SD3(.5) ？

SD3 also has MMDiT module, can TeaCache support this model and bring small loss of accuracy