oabuhamdan
oabuhamdan
Hi @svekars. I signed the CLA. Thanks
I have the same issue. Any solution yet?
> changing line 956 of /home/jiapy/virtualEnv/py3.6torch1.1/lib/python3.6/site-packages/torch/utils/cpp_extension.py from ['ninja', '-v'] to ['ninja', '--version'] seems solved the problem. Worked for me!
You can do one of two: First solution ``` inputs = keras.Input(shape=(500, ), dtype="int64") one_hot = tf.keras.layers.Lambda(lambda x: tf.one_hot(x, depth=max_tokens), output_shape=(500, max_tokens))(inputs) x = keras.layers.Bidirectional(keras.layers.LSTM(32))(one_hot) x = keras.layers.Dropout(0.5)(x) outputs =...
I came here to open this issue, and you already did. I second this issue. I fixed the package itself by adding ``` if scaler is not None and self.precision...
@delock that's not the case with me. I tried `DS_BUILD_CPU_ADAM=1 pip install deepspeed` and I got ``` gcc -pthread -B /opt/shared/anaconda/2024.02/compiler_compat -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /opt/shared/anaconda/2024.02/include -fPIC...
Hi @delock I followed your advice. The compilation is stuck at some point when building from source or when using pip. This is when using ```shell DS_BUILD_CPU_ADAM=1 pip install deepspeed...
Now when using ``` DS_BUILD_CPU_ADAM=1 ./install.sh ``` It gives the same above error but telling which line ``` gcc: fatal error: Killed signal terminated program cc1plus compilation terminated. error: command...
When running ``` DS_BUILD_CPU_ADAM=1 pip install . -vv ``` or ``` DS_BUILD_CPU_ADAM=1 pip install deepspeed -vv ``` I have this error ``` running build_ext building 'deepspeed.ops.adam.cpu_adam_op' extension creating build/temp.linux-x86_64-cpython-311 creating...
OK so the building is stuck sometimes due to having ~/.cache/torch_extensions/. Removing the dir and rebuilding causes the error above.