wxwmd

Results 4 issues of wxwmd

thanks for your work. it helps me a lot.👍

The attention computation is the most time-consuming part during inference. The attention implementation in this project is ```python class DecoderScaledDotProductAttention(nn.Module): def __init__(self, temperature): super().__init__() self.temperature = temperature self.INF = float("inf")...

### Description i want to build wheel of ray and follow the instruction of [README-building-wheels.md](https://github.com/ray-project/ray/blob/master/python/README-building-wheels.md). However, i get the following error: This is because Bazel fails to load the dependencies...

core
community-contribution