a3413209 issues

Repositories
Issues
Comments

Results 3 issues of

a3413209

Using iphone 12 run model time consuming different

Hello, I used the model you provided to measure the time consumption, iphone12 results vary greatly, while iphone13 is basically the same

iOS deployment of mlc-llm has compilation problems

By process: 1、Install TVM Unity and compile successfully 2、Get the model weight 3、Build the model to the library exist python3 build.py --model vicuna-v1-7b --type float16 --target iphone --quantization-mode int3 --quantization-sym...

documentation

The accuracy of cpu only, cpu and gpu, and ALL are different, and the result of cpu only is accurate.

The accuracy of cpu only, cpu and gpu, and ALL are different, and the result of cpu only is accurate.