bigdata-docker-compose icon indicating copy to clipboard operation
bigdata-docker-compose copied to clipboard

Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.

Results 10 bigdata-docker-compose issues
Sort by recently updated
recently updated
newest added

I need to use matplotlib to create data visualization. I have many methods from internet. however they are not worked. Can you share the way to install python libraries in...

Hi, ever think of adding hue to this docker-compose.yml?

I have already create a new database in Hive, however, when I excute the following code in spark-shell, it seems that spark goes to the wrong metastore_db as shown in...

Hello! I'm new to docker and Big Data Dev. I want to use local IntelliJ IDEA to connect to the master and write spark code in scala. I've already created...

I was trying to excute hive-sql in hive cli and this happened ``` hive> SELECT ip, dt, count(*) as count > FROM case_data_sample > GROUP BY ip,dt > ORDER BY...

This is a known issue (see [README of the base image](https://github.com/panovvv/hadoop-hive-spark-docker#version-compatibility-notes)). I've managed establish a connection from spark to hive by simply upgrading to Spark 3.0.1, but I'm getting a...

When trying to run a basic spark function on Zeppelin : spark.range(1000 * 1000 * 1000).count(). I get the following error : java.lang.RuntimeException: java.io.IOException: org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refused

Ran into another issue when using Zeppelin 9 image. When using the livy interpreter for SQL (`%livy.sql`), there is a stactrace (`java.lang.RuntimeException: Fail to callRemoteFunction, because connection is broken`) that...

hive> SELECT COUNT(*) FROM grades; Query ID = root_20251008102316_5a87bba2-fac6-467e-8d5c-07233e835b4d Total jobs = 1 Launching Job 1 out of 1 In order to change the average load for a reducer (in...