dbt-databricks icon indicating copy to clipboard operation
dbt-databricks copied to clipboard

Allow users to opt out of optimize calls on liquid-clustering/zordering

Open benc-db opened this issue 1 year ago • 5 comments

Describe the feature

Users want to capability to schedule their own optimize runs.

Describe alternatives you've considered

Currently dbt-databricks runs it after every call to merge in data, but some users would rather schedule out of band

Who will this benefit?

Users than update incremental tables frequently and want to batch optimize

benc-db avatar Jun 17 '24 15:06 benc-db

Isn't this handled by --vars '{DATABRICKS_SKIP_OPTIMIZE: true}'? If we create another way to do it I'd be all for it, as the current vars argument is a pain, especially if a databricks workflow with a dbt task, but functionality-wise I thought we had this already.

NodeJSmith avatar Aug 06 '24 21:08 NodeJSmith

Oh yeah! I had forgotten that existed, but I do think it would be better to opt out on the config level.

benc-db avatar Aug 06 '24 21:08 benc-db

I support this being on the config level especially as databricks now offers predictive optimization which runs optimize automatically as needed.

Bazsy avatar Sep 26 '24 18:09 Bazsy

Config level would a lot easier for larger projects running many models with different requirements and/or writing to different catalogs where on only some the predictive optimization is enabled.

mmansikka avatar Oct 16 '24 07:10 mmansikka

Hey @benc-db Why dbt send the OPTIMIZE from DBT? I think it's kind of weird DBT send the OPTIMIZE to DBX, I think if users want a OPTIMIZE, they can execute one by themselves.

gaoshihang avatar May 12 '25 20:05 gaoshihang