cortex.cpp issues

[WIP] Nitro model management

3

Feature for https://github.com/janhq/nitro/issues/175 - [x] Load multiple models - [ ] Add GET `models` to return models list - [ ] CUDA support for multiple model request at the same...

hiro-v

P2: nice to have

How can one set the parameters via CLI when starting a server?

2

Is this already possible? If not, could this be a feature? For example: ``` .\nitro 4 127.0.0.1 5000 --ngl 20 ``` on a Windows11 machine with NVIDIA GPU.

syddharth

I still got 'Illegal instruction' bug under cuda with ubuntu 22.04.

1

I still got this bug. _Originally posted by @lovehunter9 in https://github.com/janhq/nitro/issues/273#issuecomment-1878192726_

lovehunter9

P1: important

good first issue

help wanted

epic: Support legacy hardware

2

**Problem** AVX2 is not available on older gen coreI and a lot of users cannot use Jan app due to this issue **Success Criteria** One more distribution for AVX only

tikikun

P0: critical

type: epic

Chore: setup sample integration test

duongcongtoai

bug: nitro cuda windows low performance on machine has multiple GPUs - tested using Jan App

3

**Describe the bug** My windows machine has 3 GPUs, when I enabled all 3 GPUs, the token speed was slow (6-9/s) and it even not able to load tinyllama 1B....

hiento09

type: bug

bug: Busy waiting is causing cpu usage

**Describe the bug** The current wait of dealing with waiting is not very optimal and cause many issues regarding performance FIX: - Need to implement wait using CV properly to...

tikikun

type: bug

feat: Nitro with Chatgptbox

1

**Problem** Add docs about using Nitro with Chatgptbox https://github.com/josStorer/chatGPTBox

hahuyhoang411

type: feature request

feat: Nitro chat completion with image supporting local image path

1

**Problem** - The current implementation for `chat/completion` with only support for base64 as `image_url.url` makes it hard for using curl to test out quickly. Using something like `file://` makes it...

hiro-v

good first issue

type: feature request

cortex.cpp
cortex.cpp copied to clipboard

Metadata

Update README.md

[WIP] Nitro model management

How can one set the parameters via CLI when starting a server?

I still got 'Illegal instruction' bug under cuda with ubuntu 22.04.

epic: Support legacy hardware

Chore: setup sample integration test

bug: nitro cuda windows low performance on machine has multiple GPUs - tested using Jan App

bug: Busy waiting is causing cpu usage

feat: Nitro with Chatgptbox

feat: Nitro chat completion with image supporting local image path

← Metadata

Owner

Metadata

cortex.cpp cortex.cpp copied to clipboard

Metadata

← Metadata

Owner

Metadata

cortex.cpp
cortex.cpp copied to clipboard