Minh-Thuc issues

Results 11 issues of


                                            Minh-Thuc

Rviz can't be build in meta-ros1-melodic

Hello, I'm trying to use meta-ros on `branch zeus` for my card which run only version zeus of yocto. i used steps in [wiki].(https://github.com/ros/meta-ros/wiki/OpenEmbedded-Build-Instructions#add-meta-ros-to-an-existing-openembedded-project) I want to run the simulation...

tensor parallel by nccl + mpi

WIP for the feature tensor parallel. There are some points to investigate: - Make new version of converter to move forward the number heads before the appearance of weight, bias...

convert opennmt py on the fly and inference with model in memory

Convert the model (currently support opennmt-py) and save to the memory => inference without saving model. Customizing the wrapper to save the memory used. TODO: implementation for other converters

Mixtral support

I tried to support Mixtral by implementing MoE. The current problem is the inaccurancy in tensor after passing the MHA layer causes the wrong selection between the experts for certain...

update cpp standard + new way of linking with find_package

This changes will enable the library to be integrated with conan2.

Quantzation AWQ GEMM + GEMV

Support quantization 4 bit with AWQ. There are 2 stable versions available: ``gemm`` and ``gemv``. Currently, I only add AWQ for Llama and Mistral converter. Other models could be added...

Ctranslate2 Pypi exceeds limit 20GB

Currently, the Pypi hub of Ctranslate2 exceeds 20GB, the release 4.3.0 is missing for python 3.8 and 3.9. Request is sent to pypi to upgrade the limit. Some points to...

build

python-release

Minh-Thuc