iceberg icon indicating copy to clipboard operation
iceberg copied to clipboard

Does Iceberg support Iceberg-based joined table of other Iceberg tables?

Open byongwu opened this issue 3 years ago • 1 comments

Feature Request / Improvement

Does Iceberg support Iceberg-based joined table of other Iceberg tables?

Background

  • We want to make transactional data lake with AWS Glue + Iceberg + AWS S3.
    • Let's call that as L1 (layer 1 S3 bucket)
  • L0 (layer 0) data is from AWS DMS (full load + CDC)
  • It will be L0 -> Glue + Iceberg -> L1
  • There will be more than hundreds of tables in L1.

Question

  • We want to make a L2 table which is a joined table from multiple L1 tables.
  • We want the L2 table as transactional data lake as well.
    • Is this possible?
  • Is it possible to get CDC (difference) of Iceberg from time between timeA and timeB?
    • If so, we can build L2 Iceberg table from the CDC file set.

Any guidance or opinion is welcomed.

Query engine

Spark

byongwu avatar Sep 22 '22 16:09 byongwu

I assume that there is no such support yet. So I raised this question in Feature Request.

byongwu avatar Sep 22 '22 16:09 byongwu

This issue has been automatically marked as stale because it has been open for 180 days with no activity. It will be closed in next 14 days if no further activity occurs. To permanently prevent this issue from being considered stale, add the label 'not-stale', but commenting on the issue is preferred when possible.

github-actions[bot] avatar Mar 22 '23 00:03 github-actions[bot]

This issue has been closed because it has not received any activity in the last 14 days since being marked as 'stale'

github-actions[bot] avatar Apr 05 '23 00:04 github-actions[bot]