Irene

Results 8 comments of Irene

Hi @stefan-it , thank you for the information provided! For what concerns the vocabulary size, instead? Could you kindly tell me how many GB is the vocabulary? Thanks

Ok, I see. Thank you very much, you have been very helpful!

I would like to use the output of BERTNLU as the input for the dst; however, it is not clear for me how to pass the data from one module...

Thank you for the info. Nevertheless, the Colab tutorial refers to an overall evaluation (nlu + dst+ nlg). What if I would like to evaluate the nlu+dst only, in order...

From the code you posted it doesn't seem that the module is evaluated with F1 scores or a similar measure... perhaps I don't understand your point...

Ok, thanks, I'll try in this way!

Ok, so evaluate.py is used to compare the performance of different NLU while if I want to test only BERTNLU I should use test.py? It is not clear to me...

Hi, thanks for your reply. Could you please specify if in test.py the Recall, Precision and F1 score were micro-averaged?