bigdata-docker-compose issues

How to install matplotlib and other python libraries?

I need to use matplotlib to create data visualization. I have many methods from internet. however they are not worked. Can you share the way to install python libraries in...

khairulas

master:7077 port should be master:8088 in docker-compose

shubhendu-jain

Hue support

Hi, ever think of adding hue to this docker-compose.yml?

natilivni

Spark is not connected to Hive correctly.

I have already create a new database in Hive, however, when I excute the following code in spark-shell, it seems that spark goes to the wrong metastore_db as shown in...

dukechain2333

How to use local IntelliJ IDEA to write spark(scala)?

Hello! I'm new to docker and Big Data Dev. I want to use local IntelliJ IDEA to connect to the master and write spark code in scala. I've already created...

dukechain2333

FAILED: SemanticException Failed to get a spark session: org.apache.hadoop.hive.ql.metadata.HiveException: Failed to create Spark client for Spark session 5cc55cce-4bfa-4609-9322-7931a736689f

I was trying to excute hive-sql in hive cli and this happened ``` hive> SELECT ip, dt, count(*) as count > FROM case_data_sample > GROUP BY ip,dt > ORDER BY...

dukechain2333

spark sql can not read hive tables

1

This is a known issue (see [README of the base image](https://github.com/panovvv/hadoop-hive-spark-docker#version-compatibility-notes)). I've managed establish a connection from spark to hive by simply upgrading to Spark 3.0.1, but I'm getting a...

maxhardt

Zeppelin Spark ISSUE

When trying to run a basic spark function on Zeppelin : spark.range(1000 * 1000 * 1000).count(). I get the following error : java.lang.RuntimeException: java.io.IOException: org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refused

Moltomay

Default setting for Zeppelin livy.sql interpreter causes RuntimeException

2

Ran into another issue when using Zeppelin 9 image. When using the livy interpreter for SQL (`%livy.sql`), there is a stactrace (`java.lang.RuntimeException: Fail to callRemoteFunction, because connection is broken`) that...

AtticusJoy

Hive COUNT query fails with both Spark and MapReduce execution engines: Spark client creation failed and MapRedTask returns error code 2

1

hive> SELECT COUNT(*) FROM grades; Query ID = root_20251008102316_5a87bba2-fac6-467e-8d5c-07233e835b4d Total jobs = 1 Launching Job 1 out of 1 In order to change the average load for a reducer (in...

fwv

bigdata-docker-compose
bigdata-docker-compose copied to clipboard

Metadata

How to install matplotlib and other python libraries?

master:7077 port should be master:8088 in docker-compose

Hue support

Spark is not connected to Hive correctly.

How to use local IntelliJ IDEA to write spark(scala)?

FAILED: SemanticException Failed to get a spark session: org.apache.hadoop.hive.ql.metadata.HiveException: Failed to create Spark client for Spark session 5cc55cce-4bfa-4609-9322-7931a736689f

spark sql can not read hive tables

Zeppelin Spark ISSUE

Default setting for Zeppelin livy.sql interpreter causes RuntimeException

Hive COUNT query fails with both Spark and MapReduce execution engines: Spark client creation failed and MapRedTask returns error code 2

← Metadata

Owner

Metadata

bigdata-docker-compose bigdata-docker-compose copied to clipboard

Metadata

← Metadata

Owner

Metadata

bigdata-docker-compose
bigdata-docker-compose copied to clipboard