Condenser icon indicating copy to clipboard operation
Condenser copied to clipboard

Have you tried condenser pretraining on RoBERTa ?

Open 1024er opened this issue 3 years ago • 1 comments

I pretrained a condeser-roberta-base on the same data and hyperparameters, but the results on downstream tasks were not high.

Have you ever tried condenser pretraining on RoBERTa-base ?

Thank you

1024er avatar May 26 '22 04:05 1024er

Same data no. I have trained with openwebtext (a open version of web text, part of Roberta training data) with a base architecture Roberta. It does better on sentence similarity task but not on retrieval tasks, when compared with Bert condenser. As a side note, we observed previously that vanilla Roberta base is typically inferior to vanilla Bert base on retrieval tasks.

We have just started test runs with condenser-roberta-large and therefore not much to say there yet.

luyug avatar May 26 '22 12:05 luyug