bigdata-docker-compose
bigdata-docker-compose copied to clipboard
Hadoop, Hive, Spark, Zeppelin and Livy: all in one Docker-compose file.
I need to use matplotlib to create data visualization. I have many methods from internet. however they are not worked. Can you share the way to install python libraries in...
Hi, ever think of adding hue to this docker-compose.yml?
I have already create a new database in Hive, however, when I excute the following code in spark-shell, it seems that spark goes to the wrong metastore_db as shown in...
Hello! I'm new to docker and Big Data Dev. I want to use local IntelliJ IDEA to connect to the master and write spark code in scala. I've already created...
I was trying to excute hive-sql in hive cli and this happened ``` hive> SELECT ip, dt, count(*) as count > FROM case_data_sample > GROUP BY ip,dt > ORDER BY...
This is a known issue (see [README of the base image](https://github.com/panovvv/hadoop-hive-spark-docker#version-compatibility-notes)). I've managed establish a connection from spark to hive by simply upgrading to Spark 3.0.1, but I'm getting a...
When trying to run a basic spark function on Zeppelin : spark.range(1000 * 1000 * 1000).count(). I get the following error : java.lang.RuntimeException: java.io.IOException: org.apache.thrift.transport.TTransportException: java.net.ConnectException: Connection refused
Ran into another issue when using Zeppelin 9 image. When using the livy interpreter for SQL (`%livy.sql`), there is a stactrace (`java.lang.RuntimeException: Fail to callRemoteFunction, because connection is broken`) that...
hive> SELECT COUNT(*) FROM grades; Query ID = root_20251008102316_5a87bba2-fac6-467e-8d5c-07233e835b4d Total jobs = 1 Launching Job 1 out of 1 In order to change the average load for a reducer (in...