Poisson_flow icon indicating copy to clipboard operation
Poisson_flow copied to clipboard

Error running after environment is configured on the server

Open WanLang0 opened this issue 3 years ago • 1 comments

Hi,I use the Ubuntu 20.04 system, and the server in the Python 3.8/cuda11.6 environment runs this command python3 main.py --config ./configs/poisson/cifar10_ddpmpp.py --mode train --workdir poisson_ddpmpp The following problems have occurred. Would you please check them? Thank you

WARNING:tensorflow:From /root/miniconda3/envs/myconda/lib/python3.8/site-packages/tensorflow_gan/python/estimator/tpu_gan_estimator.py:42: The name tf.estimator.tpu.TPUEstimator is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimator instead.

WARNING:tensorflow:From /root/miniconda3/envs/myconda/lib/python3.8/site-packages/tensorflow_gan/python/estimator/tpu_gan_estimator.py:42: The name tf.estimator.tpu.TPUEstimator is deprecated. Please use tf.compat.v1.estimator.tpu.TPUEstimator instead.

1 Physical GPUs, 1 Logical GPUs
I0129 10:36:33.654870 140394431960256 xla_bridge.py:356] Unable to initialize backend 'tpu_driver': NOT_FOUND: Unable to find driver in registry given worker: 
I0129 10:36:33.660157 140394431960256 xla_bridge.py:356] Unable to initialize backend 'rocm': NOT_FOUND: Could not find registered platform with name: "rocm". Available platform names are: Interpreter Host CUDA
I0129 10:36:33.661039 140394431960256 xla_bridge.py:356] Unable to initialize backend 'tpu': module 'jaxlib.xla_extension' has no attribute 'get_tpu_client'
I0129 10:36:33.661505 140394431960256 xla_bridge.py:356] Unable to initialize backend 'plugin': xla_extension has no attributes named get_plugin_device_client. Compile TensorFlow with //tensorflow/compiler/xla/python:enable_plugin_device set to true (defaults to false) to enable this.
2023-01-29 10:36:33.680217: W tensorflow/core/platform/cloud/google_auth_provider.cc:184] All attempts to get a Google authentication bearer token failed, returning an empty token. Retrieving token from files failed with "NOT_FOUND: Could not locate the credentials file.". Retrieving token from GCE failed with "FAILED_PRECONDITION: Error executing an HTTP request: libcurl code 6 meaning 'Couldn't resolve host 

WanLang0 avatar Jan 29 '23 11:01 WanLang0

Hi,

It seems that the snippet only contains partial error message. Could you please provide full one? Thanks.

In addition, we recommend using our latest model PFGM++ for research projects (https://github.com/Newbeeer/pfgmpp)

Best, Yilun

Newbeeer avatar Feb 09 '23 03:02 Newbeeer