Alexia Jolicoeur-Martineau
Alexia Jolicoeur-Martineau
With these small changes, you can get the fid statistics by running with --mode "fid_stats". It loops through the dataset for 1 epoch and extract the FID statistics. That makes...
https://colab.research.google.com/drive/1m_QXVbHbeoho5tF9TUYq6R-PSz3536EU?usp=sharing
I am almost always getting non-significant results with _bartCause_ and almost always getting very significant results with the _tmle_ package (when only using dbarts in tmle). I have no idea...
Wether I'm using mpi or not, if I have 2 gpus with one node, it will always ignore the second gpu. I am unable to use multiple gpus. How do...
I cannot replicate the DPO results for zephyr. I use a modified version of config_full.yaml with the only difference being that I set gradient_accumulation_steps: 4 instead of 2 because I...
In run_sft.py and run_dpo.py, it says that it applies the chat template. But this is not actually done. In the code below, column_names contains all the names of the columns,...