Andy Yang
Results
2
issues of
Andy Yang
confused about output_multiplier_scale, from the code they will multiply an output_multiplier_scale to logits before output. and that output_multiplier=0.5773502691896257, same as apply a default temperature with sqrt(3) but why? quite confused,...
At first, I have a Tokenizer that works very well. I saved it. But later I want to add an PostProcessor to it, and I write a template. It just...
Stale