CaffeOnSpark icon indicating copy to clipboard operation
CaffeOnSpark copied to clipboard

Distributed deep learning on Hadoop and Spark clusters.

Results 79 CaffeOnSpark issues
Sort by recently updated
recently updated
newest added

I got an error when I was trying to join two datasets which are from postgres and csv file the error message is like this: org.apache.spark.SparkException: Job aborted due to...

[root@node93 ~]# pyspark --master yarn \ > --driver-library-path "${CAFFE_ON_SPARK}/caffe-grid/target/caffe-grid-0.1-SNAPSHOT-jar-with-dependencies.jar" \ > --driver-class-path "${CAFFE_ON_SPARK}/caffe-grid/target/caffe-grid-0.1-SNAPSHOT-jar-with-dependencies.jar" \ > --conf spark.driver.extraLibraryPath="${LD_LIBRARY_PATH}" \ > --py-files ${CAFFE_ON_SPARK}/caffe-grid/target/caffeonsparkpythonapi.zip \ > --files ${CAFFE_ON_SPARK}/data/caffe/_caffe.so \ > --jars "${CAFFE_ON_SPARK}/caffe-grid/target/caffe-grid-0.1-SNAPSHOT-jar-with-dependencies.jar" Python...

Hi... I Like to share a couple of things: First as I'm new to hadoop, for teaching purposes I followed a couple of tutorials for installing, Then, I overcome the...

I'm installing hive on windows, following tutorial from http://sandeeppatil101.blogspot.mx/2017/05/step-1-download-hive-2.html...when I reached the póint to run "schematool -initSchema -dbType mysql" I've got the following error: ---- 2018-05-14 10:18:22,583 main WARN Unable...

hello, After i run 100 iters, there is an err "java.lang.UnsupportedOperationException: empty.reduceLeft", it look like somthing wrong with my DataFrame data, but the same data can work well in another...

Can I use data_param instead of memory_data_param to define DataSource in DataLayer? If not, how to apply different size images as training dataset?

Hi, there. I'm using CaffeOnSpark to extract deep feature(dimention is 4096) from pictures. The model I use is [vgg_face](http://www.robots.ox.ac.uk/~vgg/software/vgg_face/),the content of solver.prototxt is `net: "VGG_FACE_deploy.prototxt" type: "Adam" test_iter: 30 test_interval:...

If solver config file have some mistakes, cluster won't return failed soon, after a long time,return core dumps. How can I solve this problem.

I met the same problem. RDMABuffer::RDMABuffer(RDMAChannel* channel, uint8_t* addr, size_t size) : channel_(channel), addr_(addr), size_(size) { //******************************************************* // case 1: Use cpu memory ibv_reg_mr() is ok, but some code is...