FasterTransformer icon indicating copy to clipboard operation
FasterTransformer copied to clipboard

Support for mbart models?/ Could we get the output logits of decoders before the beam search?

Open junleiz opened this issue 4 years ago • 5 comments

The mbart model is implemented by hugging face

junleiz avatar Jul 28 '21 04:07 junleiz

It requires to modify the source codes.

byshiue avatar Jul 28 '21 04:07 byshiue

@byshiue Thank you for your reply. If I just use the python interface, could I get the logits output of the decoder?

junleiz avatar Jul 28 '21 04:07 junleiz

If you use decoder op, then the output is the results of transformer block. If you use decoding op, then you need to modify the op and FT source codes.

byshiue avatar Jul 28 '21 04:07 byshiue

@byshiue Thank you. I will try to implement the mbart model

junleiz avatar Jul 28 '21 04:07 junleiz

@leoozy hi,I also want to use mbart model is implemented by hugging face, can you implement it for mbart model? and if yes, can you share it with me? Thank you very much.

xiuzhilu avatar Oct 22 '21 09:10 xiuzhilu