spark-avro issues

How to parse Avro messages while read a stream of messages from Kakfa in Spark 2.2.0?

21

The below code reads the messages from Kafka and the messages are in Avro so how do I parse the message and put it into a dataframe in Spark 2.2.0?...

kant111

Add method for conversion of RDD[GenericRecord] to DataFrame

9

In response to issues #211 and #201

cbyn

Consume Avro stream from Kafka topic

3

Hi The current version reads .avro files from HDFS (or) any FS file path. I am storing my avro file stream in Kafka. Do you have a utility to read...

ananth3010

spark-structured-streaming with Avro kafka messages

2

Hello, I want to use spark-structured-streaming to process data fetched from kafka messages, and then store as rows cassandra database. I need one clarification. The message in kafka are serialized...

amoussoubaruch

Support for logical datatypes like Decimal type

14

Avro doesn't support very big numbers directly. It supports it through logicalTypes where you can specify value as string type but send the actual data type of the field as...

cpbhagtani

read decimal logicalType

2

Hi, The decimal logicalTypes are seen in DF as binary and values are in hexadecimal Ex. for "**3.12**" : ``` org.apache.spark.sql.DataFrame = [col1: binary] df.select("col1").show() +-------+ | col1| +-------+ |[01...

eliviu

spark-avro merged into spark 2.4

Would it be possible to add a notice at the beginning of the readme to warn that this datasource is merged into spark 2.4 and all users of spark 2.4+...

fxbonnet

Avro schemas with a self-reference yield StackOverflowError

When I have a schema which has a reference to itself, it causes an infinite recursion and thus a StackOverflowError, SchemaConverters should have some sort of bail-out if it reads...

quadrokeith

can't add avro_2.11:4.0.0 in spark-shell

3

LM-SJC-11001988:~ mayangupta$ spark-shell --packages com.databricks:spark-avro_2.11:4.0.0 Ivy Default Cache set to: /Users/mayangupta/.ivy2/cache The jars for the packages stored in: /Users/mayangupta/.ivy2/jars :: loading settings :: url = jar:file:/Library/spark-2.2.1-bin-hadoop2.7/jars/ivy-2.4.0.jar!/org/apache/ivy/core/settings/ivysettings.xml com.databricks#spark-avro_2.11 added as a...

seekmayank

Custom read schema support for Spark 1.6

We are using Spark 1.6 in our clusters and want to use this library to read avro files. As part of reading the avro files, we want to able to...

nshah99

spark-avro
spark-avro copied to clipboard

Metadata

How to parse Avro messages while read a stream of messages from Kakfa in Spark 2.2.0?

Add method for conversion of RDD[GenericRecord] to DataFrame

Consume Avro stream from Kafka topic

spark-structured-streaming with Avro kafka messages

Support for logical datatypes like Decimal type

read decimal logicalType

spark-avro merged into spark 2.4

Avro schemas with a self-reference yield StackOverflowError

can't add avro_2.11:4.0.0 in spark-shell

Custom read schema support for Spark 1.6

← Metadata

Owner

Metadata

spark-avro spark-avro copied to clipboard

Metadata

← Metadata

Owner

Metadata

spark-avro
spark-avro copied to clipboard