Shivam Sahni
Shivam Sahni
## Summary Monkey patches layer norm in mllama for conditional generation ## Testing Done Tested monkey patching works as intended - Hardware Type: - [ ] run `make test` to...
### 🚀 The feature, motivation and pitch Currently we only have examples for text based models here: https://github.com/linkedin/Liger-Kernel/tree/main/examples/huggingface. An example showing how to run mllama vision model end to end...
### System Info Platform: Linux-5.15.148.2-2.cm2-x86_64-with-glibc2.35 Python version: 3.10.14 Bitsandbytes version: 0.43.1 Safetensors version: 0.4.5 Accelerate version: 0.34.2 Accelerate config: not found PyTorch version (GPU?): 2.4.0+cu124 (True) Tensorflow version (GPU?): 2.16.2...
## Summary Remove redundant code by refactoring ## Testing Done - Hardware Type: - [ ] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code...
### 🚀 The feature, motivation and pitch We want to support various alignment and distillation loss functions. Refer this PR on ORPO: #362 ## Progress ### Alignment - [x] ORPO...
### 🐛 Describe the bug Getting `ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)` when doing inference using HF `from_pretrained()` with `device_map="auto"`. ### Error ``` File...