databricks-accelerators
databricks-accelerators copied to clipboard
Accelerate the use of Databricks for customers [public repo]
Two joint dfs should be grouped by a third (clock) df. But Error is thrown: ```java.lang.NoClassDefFoundError: Could not initialize class com.twosigma.flint.rdd.function.group.Intervalize$``` ```flint_0_6_0_databricks-68352.jar/ts/flint/dataframe.py in groupByInterval(self, clock, key, inclusion, rounding) 724 with...
documentation enhanced to be simplify steps to make it compatible to currently available daabricks runtimes.
What versions of Spark (or Databricks) has the current Jar in this repository been tested against?
Thanks @kevrasm for solving the clock issue. I tried to use the new jar but I am facing another issue with databricks 5.2 ML. After successfully creating a clock, I...
**The groupByInterval is not working with clock.** **Code** clock = clocks.uniform(sqlContext, frequency="1day", offset="0ns", begin_date_time="2016-01-01", end_date_time="2017-01-01") intervalized = flint_ShipMode.groupByInterval(clock) print(type(ShipMode)) ShipMode.printSchema() ShipMode.count() class 'pyspark.sql.dataframe.DataFrame'> root |-- uid: string (nullable = true)...