Andy Yang

Results 2 issues of Andy Yang

confused about output_multiplier_scale, from the code they will multiply an output_multiplier_scale to logits before output. and that output_multiplier=0.5773502691896257, same as apply a default temperature with sqrt(3) but why? quite confused,...

At first, I have a Tokenizer that works very well. I saved it. But later I want to add an PostProcessor to it, and I write a template. It just...

Stale