spline-spark-agent icon indicating copy to clipboard operation
spline-spark-agent copied to clipboard

Spline agent for Apache Spark

Results 82 spline-spark-agent issues
Sort by recently updated
recently updated
newest added

This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to upgrade io.github.classgraph:classgraph from 4.8.168 to 4.8.170. :information_source: Keep your dependencies up-to-date....

This PR was automatically created by Snyk using the credentials of a real user.Snyk has created this PR to upgrade commons-io:commons-io from 2.15.1 to 2.16.0. :information_source: Keep your dependencies up-to-date....

Addressing #705 , * Makes changes to failing tests to perform different behavior based on Spark version * No new functionality actually required to meet compatibility TODO: - [x] Create...

code like this, while spline cannot get `Input Data Source` ```python from pyspark.sql import SparkSession spark = (SparkSession.builder .config('spark.sql.queryExecutionListeners', 'za.co.absa.spline.harvester.listener.SplineQueryExecutionListener') .config('spark.spline.producer.url', 'http://master-1-1:8080/producer') .enableHiveSupport() .getOrCreate() ) def generate_data(): data = [...

feature
investigating

I found that spline in pyspark doesn't track some in memory operations like collect, head and toPandas. Operations count and show are tracked as expected. I used spline with bundle-3.2...

bug
help wanted

there seems to be binary incompatible changes in API of Delta and Spark SQL that Spline core compiled against Spark 2.4 version cannot work with. E.g. RDDPlugin Todo: 1. Check...

enhancement
help wanted
dependency: Spark 3.4+

I have a statment in a Notebook in Databricks that is ```sql create table lineage_data.lineagedemo.dinner_1 AS SELECT recipe_id, concat(app," + ", main," + ",dessert) AS full_menu FROM lineage_data.lineagedemo.menu ``` the...

bug
help wanted
dependency: Spark 3.0+

Hi @wajda I have a spark code that I got from someone in the organization that has Scala Code They have a Configuration.conf like this If I add in the...

When adding the Spline agent bundle to an AWS Glue Python script (Spark 3.3, Python 3), lineage is produced when using the standard patterns like `df = spark.read.csv(file_path, header=True, inferSchema=True)...

feature

Hi, I have an AWS EMR v6.11 (spark 3.2) where I want to run a spark application via spark-submit, with runtime java 17. My application uses spline agent as a...