Rafael Josip Penić comments

Results 15 comments of


                                            Rafael Josip Penić

Running out of memory when resuming the training from a checkpoint

@awaelchli Sadly, I cannot share the full code. However, I probably should have mentioned in my original post that I am using gradient checkpointing during training. After additional experimentation I...

Running out of memory when resuming the training from a checkpoint

@kylesargent Are you using gradient checkpointing? It seems that there was some sort of issue with gradient checkpointing in older Pytorch versions (

error when running small inference code: "list_to_cuuint64_array"

Hello, thank you for using RiNALMo! 😄 - Can you provide us with the script `/projects/p32327/RNAFOLD/RiNALMo-main/try.py` you are using to get RiNALMo's embeddings? Is it the same inference example code...

error when running small inference code: "list_to_cuuint64_array"

- What Pytorch version are you using? (You can check this with `python -c 'import torch;print(torch.__version__)'`) - Have you tried installing RiNALMo into "fresh" Conda environment? Something like this: ```bash...

error when running small inference code: "list_to_cuuint64_array"

Also, please check your gcc/g++ versions with `g++ --version` and `gcc --version`. According to [this issue](https://github.com/Dao-AILab/flash-attention/issues/224) you might want to update your gcc/g++ compiler if you have an outdated version...

error when running small inference code: "list_to_cuuint64_array"

I think it is very possible that outdated gcc version is creating problems for flash-attention package (hence the C code errors). Try updating `g++` version. Not sure how complicated that...

Inference

Hello, we are currently focused on some other aspects of the model and therefore downstream task inference script is not high on our priority list (at least right now). You...

No Flash-attn variant

Hello Marek! Hope you are doing well! Please check the `non_flash` branch in the repository. There we replaced FA mechanism with "ordinary" attention, which should be runnable on CPUs and...

No Flash-attn variant

(1) Ah, this is true. Currently, we use the rotary positional embedding implementation from the flash attention library. To resolve this issue, you need to either install flash-attention (which should...

No Flash-attn variant

Non-FA code on the main branch is legacy code that is not compatible with FA weights. As I said, please check the `non_flash` branch (`git checkout non_flash`) which contains a...