Hongwu Peng
Results
2
issues of
Hongwu Peng
The GC-DPR has two steps 1. The first step did a full batch forward without gradient, to get the full batch contrastive learning loss and corresponding embedding gradient. 2. The...
May I ask how you transform STS datasets into triplet? It seems like STS has a similarity score as ground truth, and how to include this information in the triplet...