CaffeOnSpark
CaffeOnSpark copied to clipboard
Distributed deep learning on Hadoop and Spark clusters.
I got an error when I was trying to join two datasets which are from postgres and csv file the error message is like this: org.apache.spark.SparkException: Job aborted due to...
[root@node93 ~]# pyspark --master yarn \ > --driver-library-path "${CAFFE_ON_SPARK}/caffe-grid/target/caffe-grid-0.1-SNAPSHOT-jar-with-dependencies.jar" \ > --driver-class-path "${CAFFE_ON_SPARK}/caffe-grid/target/caffe-grid-0.1-SNAPSHOT-jar-with-dependencies.jar" \ > --conf spark.driver.extraLibraryPath="${LD_LIBRARY_PATH}" \ > --py-files ${CAFFE_ON_SPARK}/caffe-grid/target/caffeonsparkpythonapi.zip \ > --files ${CAFFE_ON_SPARK}/data/caffe/_caffe.so \ > --jars "${CAFFE_ON_SPARK}/caffe-grid/target/caffe-grid-0.1-SNAPSHOT-jar-with-dependencies.jar" Python...
Hi... I Like to share a couple of things: First as I'm new to hadoop, for teaching purposes I followed a couple of tutorials for installing, Then, I overcome the...
I'm installing hive on windows, following tutorial from http://sandeeppatil101.blogspot.mx/2017/05/step-1-download-hive-2.html...when I reached the póint to run "schematool -initSchema -dbType mysql" I've got the following error: ---- 2018-05-14 10:18:22,583 main WARN Unable...
hello, After i run 100 iters, there is an err "java.lang.UnsupportedOperationException: empty.reduceLeft", it look like somthing wrong with my DataFrame data, but the same data can work well in another...
Can't figure out how to fix this. I'm trying to download the program Alice on my MacBook Pro and I already updated java but I keep getting an error when...
Can I use data_param instead of memory_data_param to define DataSource in DataLayer? If not, how to apply different size images as training dataset?
Hi, there. I'm using CaffeOnSpark to extract deep feature(dimention is 4096) from pictures. The model I use is [vgg_face](http://www.robots.ox.ac.uk/~vgg/software/vgg_face/),the content of solver.prototxt is `net: "VGG_FACE_deploy.prototxt" type: "Adam" test_iter: 30 test_interval:...
If solver config file have some mistakes, cluster won't return failed soon, after a long time,return core dumps. How can I solve this problem.
I met the same problem. RDMABuffer::RDMABuffer(RDMAChannel* channel, uint8_t* addr, size_t size) : channel_(channel), addr_(addr), size_(size) { //******************************************************* // case 1: Use cpu memory ibv_reg_mr() is ok, but some code is...