Mitch Naylor

Results 2 issues of Mitch Naylor

# What does this PR do? Fixes #19982 This pull request adds [Mega: Moving Average Equipped Gated Attention](https://arxiv.org/abs/2209.10655), which is the current leader of the [LRA benchmark](https://paperswithcode.com/sota/long-range-modeling-on-lra). Adapted from the...

Hi Jimmy, I'm trying to illustrate GOSDT with the diabetes dataset located [here](https://www.kaggle.com/uciml/pima-indians-diabetes-database), and it seems that the time limit is being ignored. I've tried with continuous features, as well...