eel-sdk
eel-sdk copied to clipboard
Big Data Toolkit for the JVM
you don't filter a _SUCCESS file,can't read a parquet dir?
Im trying to use `JsonSource` and write it as Parquet using `ParquetSink`. The issue I'm running into is that I get a `ClassCastException` whenever the Parquet writer encounters a `Row`...
I get this error when using HiveSink to write to a Hive table with a `MAP` column. ``` java.lang.RuntimeException: Unsupported hive type [map] at scala.sys.package$.error(package.scala:27) at io.eels.component.hive.HiveSchemaFns$.fromHiveType(HiveSchemaFns.scala:56) at io.eels.component.hive.HiveSchemaFns$.fromHive(HiveSchemaFns.scala:25) at...
- Spark can partition data on a **JDBC** data frame by by specifying the following binding parameters which are all **longs**: **lowerBound**, **upperBound**, **numPartitions** and and **partition key** column -...
Fixes #385
I'm trying to use eel to write data from a jdbc query to ORC format. Unfortunately when I run the code, it fails with `scala.matcherror (of class java.sql.Date)` because one...
An overwrite option for all sink types that write to HDFS **Proposal** Sink.withOverwrite **AffectedSinks** - Parquet - AvroParquet - Avro - Orc - Csv
Hi! I set up a ParquetSource with a JDBCSink and ran into memory issues. The parquet files are stored in an s3 bucket and have been written by spark (snappy-compressed...
Users of EEL will find it useful to have eel distributed without the hadoop dependencies (hive/parquet/orc etc) as these are provided by the hadoop distribution. Whilst they are needed at...
Hi, I am encountering a problem. I need to get Jboss started before I can attempt to run another software. The following is the error message after executing the Standalone...