robberlang

Results 5 issues of robberlang

This PR allows inputs of batch queries (using `input_batch`) to be sent to the model container all together (limited to the maximum batch size), rather than individually. It does this...

This fixes Issue #758 by using an upgrade lock. As well, it makes `RPCService::send_message` thread-safe by using atomic int, and removes a redundant call to `TaskExecutionThreadPool::create_queue` and uses `std::unique_ptr` rather...

A deadlock can occur when a model container replica is removed, crippling all communication between the frontend and all model containers. Any requests that are not in the cache will...

### Description marian-conv does not work properly with models trained with `tied-embeddings` and `tied-embeddings-all` both set to false. This PR removes quantization of parameters with name ending in _Wt, which...

### Bug description A fatal error occurs when decoding with a model that was converted to `packed8avx512` GEMM type from a .npz formatted model using `marian-conv`. I have a few...

bug