Yannic Kilcher
Yannic Kilcher
Reddit could provide a good source for training data, especially since the tree-like structure allows for multiple continuations of a conversation, which is amenable to ranking. Probably, not every subreddit...
We want to be able to run fully on user-submitted data. Thus, when a user is asked to provide a prompt, and does so, we would like to be able...
May community members spend a lot of time scraping, building, or otherwise assembling datasets that could be useful for training the assistant. We want to collect all of this data...
Text labels are described in #40 . Make sure the user can submit those for every post they see.
we want to run integration tests of the entire stack: backend + discord bot frontend. this requires pulling both up, including possibly a temporary postgres database for the backend, initializing...
we want to run integration tests of the entire stack: backend + nextjs frontend. this requires pulling both up, including possibly a temporary postgres database for each, initializing the databases,...
- [x] research pros and cons of different testing frameworks for nextjs applications - [x] decide on a test framework (jest) - [ ] implement a few example tests -...
there are several frameworks that simulate browser-interactions and are able to quickly test whether the UI is behaving as expected. - [x] research the pros and cons of different frameworks...
use pytest for this note: this issue concerns unit tests, meaning if e.g. a database is required, it should be mocked
The frontends will want to display leaderboards. The backend needs to be able to compute these, and should expose API endpoints for the frontends to request their data. Leaderboards can...