tensorflow-opencl icon indicating copy to clipboard operation
tensorflow-opencl copied to clipboard

Issue with running Tensorflow with OpenCL - Ubuntu 14.04.3 (Trusty) - AMD R5 Radeon M335 GPU

Open rahasayantan opened this issue 8 years ago • 8 comments


System information

  • Have I written custom code (as opposed to using a stock example script provided in TensorFlow): No

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04):14.04.3-->Trusty

  • TensorFlow installed from (source or binary):Source

  • TensorFlow version (use command below):1.0 (Steps-> Downloaded tensorflow from https://github.com/benoitsteiner/tensorflow-opencl, ./configure - to configure project)

  • Bazel version (if compiling from source):0.4.5

  • CUDA/cuDNN version:NA

  • OPENCL Version: Number of platforms: 1 Platform Profile: FULL_PROFILE Platform Version: OpenCL 2.0 AMD-APP (1800.11) Platform Name: AMD Accelerated Parallel Processing Platform Vendor: Advanced Micro Devices, Inc. Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices

  • GPU model and memory: Platform Name: AMD Accelerated Parallel Processing Number of devices: 2 Device Type: CL_DEVICE_TYPE_GPU Board name: AMD Radeon (TM) R5 M335 Memory: 4096M

  • Exact command to reproduce: run the python script -- ipython keras_code.py

  • ** G++/GCC version**: g++-4.9 (Ubuntu 4.9.4-2ubuntu1~14.04.1) 4.9.4

I have compiled CPP programs, they work fine.

  • ComputeCPP: 0.1.1

-- ** Python**: I am using Anaconda distribution Python for 2.7.2. (Anaconda - 2.4.3)

Describe the problem

I have compile tensorflow, and deployed the same -> No issues here. when I try to run the code I run into the following error:

2017-04-23 14:01:15.180795: W ./tensorflow/core/common_runtime/sycl/sycl_util.h:44] No OpenCL GPU found that is supported by ComputeCpp, trying OpenCL CPU 2017-04-23 14:01:15.180843: F ./tensorflow/core/common_runtime/sycl/sycl_util.h:53] No OpenCL GPU nor CPU found that is supported by ComputeCpp Aborted (core dumped)

I have attached the code file. Please note this is a simplified version of the file. The logic is:

  1. Read data from files,
  2. Pass it through a NN I am using Keras as the Functional programming API on top of Tensorflow.

tensorflow-code-throwing-error.txt Please let me know if there are any fixes or if I can do something to get round this issue. Thanks and regards Sayantan

rahasayantan avatar Apr 23 '17 08:04 rahasayantan

More details: Output of Computecpp_info:


ComputeCpp Info (CE 0.1.1)


Toolchain information:

GLIBCXX: 20150426 This version of libstdc++ is supported.


Device Info:

Discovered 1 devices matching: platform : device type :


Device 0:

Device is supported : NO - Vendor is supported, device not CL_DEVICE_NAME : Hainan CL_DEVICE_VENDOR : Advanced Micro Devices, Inc. CL_DRIVER_VERSION : 1800.11 (VM) CL_DEVICE_TYPE : CL_DEVICE_TYPE_GPU




I dont think Compute cpp supports Hainan AMD GPU. Not sure if this is a tensorflow issues at all. If possible please advice how, of at all, can I proceed with this issue.

Regards

rahasayantan avatar Apr 23 '17 09:04 rahasayantan

Further updates: I upgraded ComputeCPP to 0.1.4 version. But I still have the same error. Please advise.

Regards Sayantan

rahasayantan avatar Apr 24 '17 04:04 rahasayantan

Hello @rahasayantan,

It is indeed an issue with computecpp so you could probably have more help here: https://github.com/lukeiwanski/tensorflow

Just a quick question though did you install ocl-icd-opencl-dev opencl-headers? Also the issue may come from your kernel version as this page advises to downgrade to 3.19: http://deep-beta.co.uk/setting-up-tensorflow-with-opencl-using-sycl/

Zakor94 avatar Apr 24 '17 11:04 Zakor94

@Zakor I did install ocl-icd-opencl-dev opencl-headers. I will see if I find any help on the sites you shared. Thanks for your help.

Regards Sayantan

rahasayantan avatar Apr 25 '17 18:04 rahasayantan

@rahasayantan : Could you share performance numbers in relation to pip-only tensorflow and optimized tensorflow? I am interested to see how much more performant tensorflow-opencl is with a beefier dGPU. Thank you ahead of time! For my iGPU, since all computation was shifted from CPU to the tiny iGPU, it turned out to be less performant. But there is hope for you :-)

thornhale avatar Apr 26 '17 06:04 thornhale

@thornhale: Currently the Tensorflow on GPU I have is crashing because of Device unsupported error. Raised an issue on Tensorflow Git. They have asked me to wait till the latest merge is complete and then I need to recompile and retest. I will let you know my findings once I have GPU setup and working.

Regards Sayantan

rahasayantan avatar Apr 27 '17 10:04 rahasayantan

Updates: I downloaded the latest version from https://github.com/lukeiwanski/tensorflow-opencl. This one gives coredump. There are challenges installing from tensorflow's main git repo and there seems to be issues on compiling protobuf.

rahasayantan avatar Jun 04 '17 06:06 rahasayantan

@rahasayantan do you have any updates on this?

p4vook avatar Nov 06 '20 18:11 p4vook