api
api copied to clipboard
Query crawler
Changes
Fetch query from journal, validate query, execute query and push to bucket if required.
How to test these changes?
Tested locally
Related issues
@bugout-dev check
- [ ] Add env MOONSTREAM_S3_DATA_BUCKET
- [ ] Add env MOONSTREAM_S3_DATA_BUCKET_PREFIX
We need to cherry pick from this PR later.
Although we will not use queries_crawler to crawl public data, we will use it as a replacement for the existing Query API data producer (which is currently in mooncrawl/stats_worker/queries.py.
The queries_crawler should be a CLI which exposes the same functionality but in a more modular way (and should be invoked from systemd on prod).
We will revisit this after our current batch of urgent work.