Jize Cao

Results 3 comments of Jize Cao

The loss is a valid value. The anomaly detection shows the backtrace is ` queries = self._query_projection(queries) File "/home/caojize/anaconda3/envs/r2c/lib/python3.6/site-packages/torch/nn/modules/module.py", line 493, in __call__ result = self.forward(*input, **kwargs) File "/home/caojize/anaconda3/envs/r2c/lib/python3.6/site-packages/torch/nn/modules/linear.py", line...

What do you mean by "which queries tensor creates this issue?" ? It seems like the output of ``_query_projection`` doesn't have inf/nan value. I check the weight of that function,...

It's about one month and still no one reply ... The issue is relatively critical because I have tried different language models with current codebase and none of them has...