Shivam Sahni issues

Results 6 issues of


                                            Shivam Sahni

Monkey patches layer norm in mllama

## Summary Monkey patches layer norm in mllama for conditional generation ## Testing Done Tested monkey patching works as intended - Hardware Type: - [ ] run `make test` to...

Add end-to-end example of mllama

### 🚀 The feature, motivation and pitch Currently we only have examples for text based models here: https://github.com/linkedin/Liger-Kernel/tree/main/examples/huggingface. An example showing how to run mllama vision model end to end...

Paged optimizer resuming from checkpoint - attributeError: 'int' object has no attribute 'cpu'

### System Info Platform: Linux-5.15.148.2-2.cm2-x86_64-with-glibc2.35 Python version: 3.10.14 Bitsandbytes version: 0.43.1 Safetensors version: 0.4.5 Accelerate version: 0.34.2 Accelerate config: not found PyTorch version (GPU?): 2.4.0+cu124 (True) Tensorflow version (GPU?): 2.16.2...

Refactor chunked preference functions and distillation base class

## Summary Remove redundant code by refactoring ## Testing Done - Hardware Type: - [ ] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code...

[RFC] Liger FlexChunkLoss: Alignment and Distillation loss

### 🚀 The feature, motivation and pitch We want to support various alignment and distillation loss functions. Refer this PR on ORPO: #362 ## Progress ### Alignment - [x] ORPO...

ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)

### 🐛 Describe the bug Getting `ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)` when doing inference using HF `from_pretrained()` with `device_map="auto"`. ### Error ``` File...