Does Iceberg support Iceberg-based joined table of other Iceberg tables?
Feature Request / Improvement
Does Iceberg support Iceberg-based joined table of other Iceberg tables?
Background
- We want to make transactional data lake with AWS Glue + Iceberg + AWS S3.
- Let's call that as L1 (layer 1 S3 bucket)
- L0 (layer 0) data is from AWS DMS (full load + CDC)
- It will be L0 -> Glue + Iceberg -> L1
- There will be more than hundreds of tables in L1.
Question
- We want to make a L2 table which is a joined table from multiple L1 tables.
- We want the L2 table as transactional data lake as well.
- Is this possible?
- Is it possible to get CDC (difference) of Iceberg from time between timeA and timeB?
- If so, we can build L2 Iceberg table from the CDC file set.
Any guidance or opinion is welcomed.
Query engine
Spark
I assume that there is no such support yet. So I raised this question in Feature Request.
This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.
This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'