Rafael Alberto Rivera Soto

Results 1 issues of Rafael Alberto Rivera Soto

Hello Everyone, I'm looking to use the standalone S4D replacement layer [here](https://github.com/HazyResearch/state-spaces/blob/main/src/models/sequence/ss/standalone/s4d.py) as a drop-in replacement to the attention mechanism in a transformer model. I'm wondering what is the best...