automaticcat comments

Results 98 comments of


                                            automaticcat

failed to start model with nvidia

@CameronNguyen130820 we postponed solving this issue since it might related to outdated GPU doesn't support some operation

`gpt-3.5-turbo` model cannot be used for summary with `map_reduce`

i try to change the code just like the fix, but not working

epic: Jan supports multiple CPU/GPU Hardware optimizations

https://www.npmjs.com/package/systeminformation

epic: Jan supports multiple CPU/GPU Hardware optimizations

Intel optimization: Instructions set: AVX2 - Consumer grade AVX512 - Before gen 12 or Consumer grade AVX_VNNI - Consumer grade with OneAPI AVX512_VNNI - Server grade

epic: Jan supports multiple CPU/GPU Hardware optimizations

Windows WSL https://github.com/janhq/jan/issues/912 oneAPI for Windows https://github.com/janhq/jan/issues/911 AMD GPU for Windows https://github.com/janhq/jan/issues/913 AMD CPU for Windows https://github.com/janhq/jan/issues/914 Intel GPU (maybe crossing with oneAPI) https://github.com/janhq/jan/issues/915

Unable to use Intel UHD GPU acceleration with BLAS

> I believe that Intel oneMKL should actually run on an Intel GPU: https://www.intel.com/content/www/us/en/docs/oneapi/optimization-guide-gpu/2023-0/offloading-onemkl-computations-onto-the-gpu.html I think we should bring this issue back, iGPU offloading at least the prompt eval is...

automaticcat

failed to start model with nvidia

`gpt-3.5-turbo` model cannot be used for summary with `map_reduce`

epic: Jan supports multiple CPU/GPU Hardware optimizations

epic: Jan supports multiple CPU/GPU Hardware optimizations

epic: Jan supports multiple CPU/GPU Hardware optimizations

Unable to use Intel UHD GPU acceleration with BLAS

Hub should autodetect users' RAM or VRAM to Recommend Models

Investigate PagedAttention KV-cache memory management for faster inference

[Feature Request] 4bit and 2bit and 1bit quantization support

Change model license to Apache License, Version 2.0