possible to store in S3 and query via Athena
Request
I would like to know if it is possible to store CLP compressed data in S3 and query it through Athena using SQL?
Possible implementation
NA
Hi @tannaga,
Thanks for your interest. We have support for storing to S3 in our cloud, but we haven't open-sourced it yet. We are also working on a connector for Presto/Athena. We'll let you know when a prototype is available to try out.
Out of curiosity (if you can share), how much data are you looking to query and what kind of queries do you typically run?
Thank you! Looking forward to the connector. If there is anyway I can contribute, please let me know.
CLP compression and search on compressed data seems promising. So I wanted to explore that for logs first and then expand it for other data type like metrics/traces. We have terabytes of logs per day. Was planning to experiment with S3/Athena.
Any update?
Hey @gerilya, we're working on adding support to CLP to allow it to read and write from S3. We hope to have something you can try in about a month or sooner. We haven't yet prioritized the plugin for Athena/Presto, but we'll be considering our priorities for the next quarter soon.
To guide our development, could you share what kind of use cases you have that you'd like to use CLP with S3 and/or Athena/Presto?