Alberto Mario Ceballos-Arroyo comments

Results 7 comments of


                                            Alberto Mario Ceballos-Arroyo

Xml encoding in xml_tools.py

Hi Peter! Sorry for being late with this, the situation in my country (Colombia) has been delicate the last few weeks and I wasn't able to do the MR. In...

add Unified-IO

I'd like to work on this issue, is there any documentation on adding new models that I should follow?

add Unified-IO

Hi @kumar-devesh , I'm working on it (made some progress toward getting a working version of the Discrete VAE in Torch) but @osanseviero told me that it would be better...

create new page for translations contributors

Hi all! Are we (translators) supposed to just put our info here as a comment? Thx!

[BUG] <title> gradient reduction issues when running training script with latest released model 2-6

I'm having the same issue on Python 3.10, CUDA 12.1 and Torch 2.3.1. If I train without Zero 2/3 the issue goes away but this limits me to training only...

memory issue, prepare_model_for_kbit_training

Having the the same issue when using a non-quantized base model and trying to finetune with QLORA int4, getting the following mem comsumption (Ministral 8B, seq length 4096, bsz 1),...

memory issue, prepare_model_for_kbit_training

Thanks for the prompt reply @matthewdouglas . If that's the case, I suppose it might be a matter of "savings at scale" since I'm using a single GPU and bsz=1....