Liubov Talamanova
Liubov Talamanova
### Details: - *Add SoftSign to GNA hardware config* - *...* ### Tickets: - *88345*
### Context After applying quantization to the ov.Model in Neural Network Compression Framework ([NNCF](https://github.com/openvinotoolkit/nncf/tree/develop)), the quantization parameters, including `subset_size`, are dumped to the meta section of the OpenVINO IR. `subset_size`...
### Changes Represent symmetrically quantized weights in signed data type with no zero point ### Reason for changes * To detect the quantization type without analyzing zero-point values * Signed...
### Changes ### Reason for changes Fix bug in `_get_ratio_defining_params` method ### Tests test_shared_gather_all_layers
### Details: - *item1* - *...* ### Tickets: - *ticket-id*
I am trying to load the [meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct-SpinQuant_INT4_EO8) model, but I am encountering issues with the output. I converted the model to Hugging Face format using the following command: ```bash python...
Ticket: 161649