Akim Tsvigun

Nebius AI Amsterdam, Netherlands

Results 19 comments of


                                            Akim Tsvigun

How to use my own additional vocabulary dictionary?

@peregilk Good afternoon, and thank you so much for your comprehensive responses. I would like to ask you a small question, you say: _"Bert will learn an embedding for ("good"-"##ness")...

Unsure whether the behavior is expected

This does not seem to affect: the following code returns with a success. ``` log_probs_N_K_C = torch.Tensor([ [[0.1, 0.2, 0.3, 0.4], [0.15, 0.15, 0.3, 0.4]], [[0.1, 0.2, 0.3, 0.4], [0.15,...

Layer Normalization

I see this code is damaged. Here is the image (A.5 in the paper):

Layer Normalization

A similar question regards dropout in the FeedForward layer. You have it added twice, while in the paper they add it only in the end:

Integration with Nebius AI Studio added

@ksolo may I kindly ask you to review it before it diverges too much from the main? thanks!

Integration with Nebius AI Studio added

@ksolo could you please approve? Fixed all your suggestions.

Integration with Nebius AI Studio added

Hi @ksolo, thank you! Sure, will do.

Integration with Nebius AI Studio added

@dcbartlett fixed your suggestions. Kindly merge when you are available!

Integration with Nebius AI Studio added

@dcbartlett kind ping here, fixed your suggestions. Could you please approve?

Integration with Nebius AI Studio added

@dcbartlett sorry for tagging you, you approved the changes but didn't merge. May I ask you to approve it again and merge?

1
2
›