onur güngör

Results 6 comments of onur güngör

I had the same issue. However, when I switched to this code ``` gcloud compute tpus tpu-vm ssh tpu-vm-name --zone=europe-west4-a --worker=all --command="python3 -c 'import jax; jax.distributed.initialize(); jax.process_index() == 0 and...

Hi, I couldn't reproduce this problem. When does the condition pred_tag == 4 occurs? Does it happen when the training data size is high? My model gives this output: ```...

Hi, thanks for the interest. We're working on it 👍🏼

Thank you @helizac! How did you do this? llama.cpp repo was not supporting T5 models, I see there are some developments yesterday https://github.com/ggerganov/llama.cpp/issues/5763 Did you do it yourself, if so...

Hi, thank you for your comment. We saw them and started to use them but we hit a wall again, which we solved by fixing some part of the code....