Chenghua comments

Results 72 comments of


                                            Chenghua

How to use the onnx-mlir in my own project

@ZizhouJia Have you successfully integrated onnx-mlir into your MLIR project? I'm writing a small inference framework that needs to take onnx-mlir and convert it to VM-bytecode to this framework. Is...

新增需求征集（Collect Feature Request）

需求描述：SAM 中 Generate masks by sampling a grid over the image with this many points to a side. 如 meta 的 SAM everything demo. [ref code](https://github.com/facebookresearch/segment-anything/blob/6fdee8f2727f4506cfbbe553e23b895e27956588/segment_anything/automatic_mask_generator.py#L35)。需求场景：产生 low-level 任务中全图不同物体的 mask。...

How to build with third-party dependencies

I have encountered the same issue. The dynamic libraries fail to load because the `pip install ./foo` command only copies the `_core.xxxxxx.pyd` file to the `site-packages/foo/` directory, without also copying...

How to inference mllm by Python?

The mllm project currently only provides C++ API. Since the front end of mllm has not yet fully stabilized, we have not prioritized the work on Python bindings. If you...

How to calibrate a w8a8 quantized model？

Got it, thanks for the reply!

How to calibrate a w8a8 quantized model？

I used the following code to test the performance of w8a8. ```python @torch.no_grad() def generate(model, tokenizer, device, prompt, max_new_tokens): inputs = tokenizer(prompt, return_tensors="pt", padding=True) start = time.time() outputs = model.generate(...

Chenghua

How to use the onnx-mlir in my own project

新增需求征集（Collect Feature Request）

How to build with third-party dependencies

How to inference mllm by Python?

How to calibrate a w8a8 quantized model？

How to calibrate a w8a8 quantized model？

How to calibrate a w8a8 quantized model？

The Llama 7B model works on my Android phone, but other models do not.

How to integrate ONNX dialects in my project

运行run_qwen_qnn.sh显示无法与dsp硬件通信，求助，可有偿请教