Abheesht
Abheesht
Hello! Have the pretrained models for TK been released? If not, could you please share them? Would be of great help! :)
@mattdangerw, I'd like to take this up. Thanks!
Hey, @innat! Are you still working on this? If not, I'd like to work on this. Thanks! :)
Thanks, @innat! @mattdangerw, I can take this up (if we want this layer) :)
Awesome! Thanks, @chenmoneygithub. Will add it to the doc :)
Hello, @mattdangerw! As discussed, I'd be glad to take this up. Thanks! :)
Linking the PR here: https://github.com/keras-team/keras-io/pull/890
Hello, @chenmoneygithub! I think the reason is as follows: XLNet has multiple factorisation orders since it permutes the input sequence. Suppose our input text is [1, 2, 3, 4], and...
This figure explains it well: 
A more concrete explanation:  https://www.borealisai.com/en/blog/understanding-xlnet/