❗ NOTE: for Ubuntu 20.04 and 18.04 there is no need to build TensorFlow from sources.
- Install TensorFlow for all users on Ubuntu 18.04.
- cuDNN Installation Guide
- Which NVIDIA cuDNN release type for TensorFlow on Ubuntu 16.04
- Install CUDA 9.2 and cuDNN 7.1 for PyTorch (GPU) on Ubuntu 16.04
- Install TensorFlow with pip
- Part1: How to install Tensorflow GPU with CUDA 10.0 for python on Ubuntu
- Part2: How to install Tensorflow GPU with CUDA 10.0 for python on Ubuntu
sudo apt update
sudo apt upgrade
sudo apt install python-pip python3-pip \
python2.7 python3 \
python-numpy python3-numpy \
python-six python3-six \
python-wheel python3-wheel \
python-mock python3-mock
# Keras — deep learning framework running on Theano or TensorFlow
sudo apt install python3-keras
sudo apt install keras-doc # documents
# Keras for Python 2.7
sudo su
cd ~
umask 022
sudo pip install -U Keras # Deep Learning for humans
# Lasagne — deep learning library build on the top of Theano
sudo apt install python-lasagne
sudo apt install python3-lasagne
sudo apt install lasagne-doc # documents
# Check for necessary packages
python3 --version # 3.6 or higher
pip3 --version # 9.0 or higher
virtualenv --version # 15.1 or higher
nvidia-smi # NVIDIA driver version
nvcc --version # CUDA version, 10.0 or higher
which nvcc # usually /usr/local/cuda/bin/nvcc
# Verify Theano, Keras
python -c "import theano; print(theano.__version__);"
python3 -c "import theano; print(theano.__version__);"
python -c "import keras; print(keras.__version__);"
python3 -c "import keras; print(keras.__version__);"
# Set permissions 644 for files and 755 for directories
sudo su
cd ~
umask 022
sudo pip install -U --user keras_applications==1.0.5 --no-deps
sudo pip install -U --user keras_preprocessing==1.0.3 --no-deps
Download cuDNN from nVidia web-site
To download it you have to register, fill the web form and pass the survey.
Choose 3 packages:
- cuDNN Runtime Library for Ubuntu18.04 (Deb)
- cuDNN Developer Library for Ubuntu18.04 (Deb)
- cuDNN Code Samples and User Guide for Ubuntu18.04 (Deb)
Install only runtime library if you using precompiled binaries that are ready to go. If you want to build, re-build TensonFlow or develop your own API, install developer library after installation of runtime library. Install code samples library if you need examples.
# Install the Runtime library
sudo dpkg -i libcudnn7_7.4.1.5-1+cuda10.0_amd64.deb
# Install the Developer library
sudo dpkg -i libcudnn7-dev_7.4.1.5-1+cuda10.0_amd64.deb
# Install the code samples and the cuDNN Library User Guide
sudo dpkg -i libcudnn7-doc_7.4.1.5-1+cuda10.0_amd64.deb
# Indicates that CuDNN version 7.4.1 is installed.
cat /usr/include/x86_64-linux-gnu/cudnn_v*.h | grep CUDNN_MAJOR -A 2
#define CUDNN_MAJOR 7
#define CUDNN_MINOR 4
#include "driver_types.h"
# Copy the cuDNN sample to a writable path
cp -r /usr/src/cudnn_samples_v7/ $HOME/Documents
# Go to your writable samples
cd $HOME/Documents/Samples/cudnn_samples_v7/mnistCUDNN
# Compile the mnistCUDNN sample
make clean && make
# Run the mnistCUDNN sample
If cuDNN is properly installed and running on your Linux system, you will see a message similar to the following:
Result of classification: 1 3 5
Test passed!
# Create and modify cuDNN.conf file in directory /etc/ld.so.conf.d/
sudo touch /etc/ld.so.conf.d/cuDNN.conf
sudo nano /etc/ld.so.conf.d/cuDNN.conf
Add path for CUPTI (CUDA Profiling Tools Interface) libraries into cuDNN.conf
# CUPTI libraries
Press <Ctrl>+<X>
and save during exit.
# Reload ldconfig environment
sudo ldconfig
NVIDIA Collective Communications Library (NCCL) implements multi-GPU and multi-node collective communication primitives that are performance optimized for NVIDIA GPUs.
Go to https://developer.nvidia.com/nccl/nccl-download and attend survey to download Nvidia NCCL.
Download following after completing survey.
Download NCCL v2.3.7, for CUDA 10.0, Nov 8, 2018 --> Local installer for Ubuntu 18.04. File: nccl-repo-ubuntu1804-2.3.7-ga-cuda10.0_1-1_amd64.deb
# Install NCCL deb package
sudo apt install ./nccl-repo-ubuntu1804-2.3.7-ga-cuda10.0_1-1_amd64.deb
# Error
The public CUDA GPG key does not appear to be installed.
To install the key, run this command:
sudo apt-key add /var/nccl-repo-2.3.7-ga-cuda10.0/7fa2af80.pub
# Run command to install the key
sudo apt-key add /var/nccl-repo-2.3.7-ga-cuda10.0/7fa2af80.pub
sudo apt update
sudo apt install libnccl2 libnccl-dev
There must be 64-bit Python installed. TensorFlow does not work on 32-bit Python installation.
With Ubuntu 18.04, using the command sudo pip install packagename
does not install into global path. In order to install the modules
in the global path (it keeps looking at the local-user python path):
Unfortunately this installation doesn't work with the latest version of CUDA, 10.0 at this moment.
# Change to root user
sudo su
# Change directory to HOME
cd ~
# Set permissions 644 for files and 755 for directories
umask 022
# Install TensorFlow current release with GPU support
# in the global path for Python 2.7
pip install tensorflow-gpu
# For Python 3.x
pip3 install tensorflow-gpu
Bazel is an open-source build and test tool similar to Make, Maven, and Gradle. It uses a human-readable, high-level build language. Bazel supports projects in multiple languages and builds outputs for multiple platforms. Bazel supports large codebases across multiple repositories, and large numbers of users.
How-to install Bazel for Ubuntu
Download Bazel Linux installer named
from the Bazel releases page on GitHub:
Downgrade to bazel 0.17.2 Otherwise it'll be and errors during TensorFlow build.
# Install required packages
sudo apt install pkg-config zip g++ zlib1g-dev unzip python
# Uninstall Bazel
rm -fr ~/.bazel
rm -fr ~/.cache/bazel
rm ~/bin/bazel
# Install Bazel
# Create ~/Documents/Install/bazel folder
mkdir -p ~/Documents/Install/bazel
cd ~/Documents/Install/bazel
# Get installer for version 0.17.2
wget https://github.com/bazelbuild/bazel/releases/download/0.17.2/bazel-0.17.2-installer-linux-x86_64.sh
# Set executable
chmod +x bazel-0.17.2-installer-linux-x86_64.sh
# Install for current user. Use --help for additional options
./bazel-0.17.2-installer-linux-x86_64.sh --user
# Set up your environment
echo 'export PATH="$PATH:$HOME/bin"' >> ~/.bashrc
# Reload environment variables
source ~/.bashrc
sudo ldconfig
SYCL / CUDA / ROCm are mututally exclusive
❗ SYCL / CUDA / ROCm are mututally exclusive. At most 1 GPU platform can be configured. Don't install them ❗
In other words you could create either CUDA or ComputeCPP, but not in the same time.
Install OpenCL and OpenGL for Python
# OpenCL and OpenGL for Python # OpenCL (Open Computing Language) header files sudo apt install opencl-headers # OpenCL (Open Computing Language) C header files sudo apt install opencl-c-headers # C++ headers for OpenCL development sudo apt install opencl-clhpp-headers # Python module to access OpenCL parallel computation API sudo apt install python-pyopencl # Python 3 module to access OpenCL parallel computation API sudo apt install python3-pyopenclInstall ComputeCpp. Download ComputeCpp Community Edition from CodePlay website.
# Get ComputeCpp-CE-1.0.3-Ubuntu.16.04-64bit.tar.gz and tar xvzf ComputeCpp-CE-1.0.3-Ubuntu.16.04-64bit.tar.gz cd ComputeCpp-CE-1.0.3-Ubuntu-16.04-x86_64/ # Copy files and dirs to /usr/local/ sudo cp -a ./bin/. /usr/local/bin/ sudo cp -a ./doc/. /usr/local/doc/ sudo cp -a ./include/. /usr/local/include/ sudo cp -a ./lib/. /usr/local/lib/ # Check installation computecpp_info
Install cuBLAS — Dense Linear Algebra on GPUs Download CUDA Toolkit with cuBLAS: https://developer.nvidia.com/cublas
sudo apt install ./cuda-repo-ubuntu1804-10-0-local-10.0.130-410.48_1.0-1_amd64.deb
sudo apt update
sudo apt install cuda
sudo reboot
Download TensorRT file from NVIDIA website https://developer.nvidia.com/tensorrt
# Install TensorRT
sudo apt install ./nv-tensorrt-repo-ubuntu1804-cuda10.0-trt5.0.2.6-ga-20181009_1-1_amd64.deb
sudo apt update
#sudo apt upgrade
sudo apt install tensorrt
sudo apt install \
libnvinfer5 \
libnvinfer-dev \
python-libnvinfer \
python3-libnvinfer \
python-libnvinfer-dev \
python3-libnvinfer-dev \
# Verify installation
dpkg -l | grep -i TensorRT
# Install TensorFlow
# Create ~/Documents/Install/TensorFlow folder
mkdir -p ~/Documents/Install/TensorFlow
cd ~/Documents/Install/TensorFlow
# Get sources
git clone https://github.com/tensorflow/tensorflow.git
cd tensorflow
# Switch to stable version of TensorFlow, current is r1.12
# TensorFlow API Versions: https://www.tensorflow.org/versions
git checkout r1.12
Answer for the following questions:
Please specify the location of python. [Default is /usr/bin/python]: /usr/bin/python3
Please input the desired Python library path to use. Default is [/usr/lib/python3/dist-packages]
Do you wish to build TensorFlow with Apache Ignite support? [Y/n]: Y
Do you wish to build TensorFlow with XLA JIT support? [Y/n]: Y
# Warning: CUDA and SYCL are mututally exclusive.
Do you wish to build TensorFlow with OpenCL SYCL support? [y/N]: N
Do you wish to build TensorFlow with ROCm support? [y/N]: N
Do you wish to build TensorFlow with CUDA support? [y/N]: Y
Please specify the CUDA SDK version you want to use. [Leave empty to default to CUDA 9.0]: 10.0
Please specify the location where CUDA 10.0 toolkit is installed. Refer to README.md for more details. [Default is /usr/local/cuda]: /usr/local/cuda
Please specify the cuDNN version you want to use. [Leave empty to default to cuDNN 7]: 7.4.1
Please specify the location where cuDNN 7 library is installed. Refer to README.md for more details. [Default is /usr/local/cuda]: /usr/local/cuda
Do you wish to build TensorFlow with TensorRT support? [y/N]: Y
Please specify the location where TensorRT is installed. [Default is /usr/lib/x86_64-linux-gnu]: /usr/lib/x86_64-linux-gnu
Please specify the NCCL version you want to use. If NCCL 2.2 is not installed, then you can use version 1.3 that can be fetched automatically but it may have worse performance with multiple GPUs. [Default is 2.2]: 2.3.7
NCCL libraries found in /usr/lib/x86_64-linux-gnu/libnccl.so
This looks like a system path.
Assuming NCCL header path is /usr/include
Please specify a list of comma-separated Cuda compute capabilities you want to build with.
You can find the compute capability of your device at: https://developer.nvidia.com/cuda-gpus.
Please note that each additional compute capability significantly increases your build time and binary size. [Default is: 6.1]: 5.2,6.1
Do you want to use clang as CUDA compiler? [y/N]: N
Please specify which gcc should be used by nvcc as the host compiler. [Default is /usr/bin/x86_64-linux-gnu-gcc-6]: /usr/bin/gcc
Do you wish to build TensorFlow with MPI support? [y/N]: N
Please specify optimization flags to use during compilation when bazel option "--config=opt" is specified [Default is -march=native]: -march=native
Would you like to interactively configure ./WORKSPACE for Android builds? [y/N]: N
Full input and output text
./configure WARNING: An illegal reflective access operation has occurred WARNING: Illegal reflective access by com.google.protobuf.UnsafeUtil (file:/home/lab225/.cache/bazel/_bazel_lab225/install/792a28b07894763eaa2bd870f8776b23/_embedded_binaries/A-server.jar) to field java.lang.String.value WARNING: Please consider reporting this to the maintainers of com.google.protobuf.UnsafeUtil WARNING: Use --illegal-access=warn to enable warnings of further illegal reflective access operations WARNING: All illegal access operations will be denied in a future release WARNING: --batch mode is deprecated. Please instead explicitly shut down your Bazel server using the command "bazel shutdown". You have bazel 0.17.2 installed. Please specify the location of python. [Default is /usr/bin/python]: /usr/bin/python3 Found possible Python library paths: /usr/lib/python3.6/dist-packages /usr/lib/python3/dist-packages /usr/local/lib/python3.6/dist-packages Please input the desired Python library path to use. Default is [/usr/lib/python3.6/dist-packages] /usr/lib/python3/dist-packages Do you wish to build TensorFlow with Apache Ignite support? [Y/n]: Y Apache Ignite support will be enabled for TensorFlow. Do you wish to build TensorFlow with XLA JIT support? [Y/n]: Y XLA JIT support will be enabled for TensorFlow. Do you wish to build TensorFlow with OpenCL SYCL support? [y/N]: N No OpenCL SYCL support will be enabled for TensorFlow. Do you wish to build TensorFlow with ROCm support? [y/N]: N No ROCm support will be enabled for TensorFlow. Do you wish to build TensorFlow with CUDA support? [y/N]: Y CUDA support will be enabled for TensorFlow. Please specify the CUDA SDK version you want to use. [Leave empty to default to CUDA 9.0]: 10.0 Please specify the location where CUDA 10.0 toolkit is installed. Refer to README.md for more details. [Default is /usr/local/cuda]: /usr/local/cuda Please specify the cuDNN version you want to use. [Leave empty to default to cuDNN 7]: 7.4.1 Please specify the location where cuDNN 7 library is installed. Refer to README.md for more details. [Default is /usr/local/cuda]: /usr/local/cuda Do you wish to build TensorFlow with TensorRT support? [y/N]: Y TensorRT support will be enabled for TensorFlow. Please specify the location where TensorRT is installed. [Default is /usr/lib/x86_64-linux-gnu]:/usr/lib/x86_64-linux-gnu Please specify the NCCL version you want to use. If NCCL 2.2 is not installed, then you can use version 1.3 that can be fetched automatically but it may have worse performance with multiple GPUs. [Default is 2.2]: 2.3.7 NCCL libraries found in /usr/lib/x86_64-linux-gnu/libnccl.so This looks like a system path. Assuming NCCL header path is /usr/include Please specify a list of comma-separated Cuda compute capabilities you want to build with. You can find the compute capability of your device at: https://developer.nvidia.com/cuda-gpus. Please note that each additional compute capability significantly increases your build time and binary size. [Default is: 3.5,7.0]: 5.2,6.1 Do you want to use clang as CUDA compiler? [y/N]: N nvcc will be used as CUDA compiler. Please specify which gcc should be used by nvcc as the host compiler. [Default is /usr/bin/x86_64-linux-gnu-gcc-6]: /usr/bin/gcc Do you wish to build TensorFlow with MPI support? [y/N]: N No MPI support will be enabled for TensorFlow. Please specify optimization flags to use during compilation when bazel option "--config=opt" is specified [Default is -march=native]: -march=native Would you like to interactively configure ./WORKSPACE for Android builds? [y/N]: N Not configuring the WORKSPACE for Android builds. Preconfigured Bazel build configs. You can use any of the below by adding "--config=<>" to your build command. See tools/bazel.rc for more details. --config=mkl # Build with MKL support. --config=monolithic # Config for mostly static monolithic build. --config=gdr # Build with GDR support. --config=verbs # Build with libverbs support. --config=ngraph # Build with Intel nGraph support. Configuration finished
For your information:
TensorRT is installed in the same folder as file
. It is/usr/lib/x86_64-linux-gnu
by default. -
Show version of NCCL:
dpkg -l | grep -i nccl
(2.3.7). -
Show where NCCL library is installed:
find / -name '*libnccl*' 2>/dev/null
(/usr/lib/x86_64-linux-gnu). -
Compute capability for
Nvidia GeForce GTX TITAN X
. -
Compute capability for
Nvidia GeForce GTX 1080 Ti
(see https://developer.nvidia.com/cuda-gpus). -
I have not used MPI support myself. Read
file before MPI installation.
Build a pip package for TensorFlow
cd ~/Documents/Install/TensorFlow/tensorflow
# Create $TMP variable for Bazel version 0.17.2
export TMP=~/tmp
mkdir -p $TMP
bazel build --config=opt \
--config=cuda \
If you have the WARNING:
WARNING: The following rc files are no longer being read,
please transfer their contents or import their path into one of the standard rc files:
This is an issue with bazel 0.19.0,
either use bazel 0.18.0, or
add the contents of file ~/Documents/Install/TensorFlow/tensorflow/tools/bazel.rc
on top of (hidden) file ~/Documents/Install/TensorFlow/tensorflow/.tf_configure.bazelrc
I used Bazel version 0.17.2. It's old, but it works for now (date 2019.02.13).
# Create backup
cd ~/Documents/Install/TensorFlow/tensorflow
cp .bazelrc .bazelrc.2019.02.13.backup
# Import path to bazel.rc file on top of hidden .bazelrc
echo import ~/Documents/Install/TensorFlow/tensorflow/tools/bazel.rc > temp_file.txt
cat .bazelrc >> temp_file.txt
mv temp_file.txt .bazelrc
# Build a pip package for TensorFlow
bazel build --config=opt \
--config=cuda \
Building time for CPU Intel Core i7-7800X: 1 hour and 10 min
Build whl (wheel) file
bazel-bin/tensorflow/tools/pip_package/build_pip_package tensorflow_pkg_python3
Install tensorflow with pip
# Install TensorFlow for all users
cd ~/Documents/Install/TensorFlow/tensorflow
sudo mkdir -p /root/Install/tensorflow_pkg_python3
sudo cp -a tensorflow_pkg_python3/. /root/Install/tensorflow_pkg_python3
sudo su
cd ~
umask 022
cd ~/Install/tensorflow_pkg_python3
sudo pip3 install tensorflow*.whl
# For virtual environment use
##sudo apt-get install virtualenv
##virtualenv tf_1.12_cuda10.0 -p /usr/bin/python3
##source tf_1.12_cuda10.0/bin/activate
##pip3 install tensorflow*.whl
There are several differences:
# During config enter path to Python 2.7
Please specify the location of python. [Default is /usr/bin/python]: /usr/bin/python
Please input the desired Python library path to use. Default is [/usr/lib/python2.7/dist-packages]
# Build whl (wheel) file
bazel-bin/tensorflow/tools/pip_package/build_pip_package tensorflow_pkg_python2
# Install TensorFlow for Python 2.7 and for all users
cd ~/Documents/Install/TensorFlow/tensorflow
sudo mkdir -p /root/Install/tensorflow_pkg_python2
sudo cp -a tensorflow_pkg_python2/. /root/Install/tensorflow_pkg_python2
sudo su
cd ~
umask 022
cd ~/Install/tensorflow_pkg_python2
sudo pip install tensorflow*.whl
# Check for Python 3.6
python3.6 -c "import tensorflow as tf; \
tf.enable_eager_execution(); \
print(tf.reduce_sum(tf.random_normal([1000, 1000])));"
# Check for Python 2.7
python2.7 -c "import tensorflow as tf; \
tf.enable_eager_execution(); \
print(tf.reduce_sum(tf.random_normal([1000, 1000])));"
There could be runtime error, which reports that NumPy is too old:
RuntimeError: module compiled against API version 0xc but this version of numpy is 0xb
RuntimeError: module compiled against API version 0xc but this version of numpy is 0xb
In this case you should reinstall NumPy through pip
and not through sudo apt install python-numpy
In my case I had NumPy 1.13 installed through apt
However NumPy 1.15 is required for TensorFlow 1.12.
So I have to reinstall NumPy through pip
for Python 2.7.
# Uninstall python-numpy (NumPy 1.13 for Python 2.7)
sudo apt purge --auto-remove python-numpy
# There is an error when "import tensorflow as tf" for Python 2.7
# Error: Couldn't import dot_parser, loading of dot files will not be possible.
# To fix it, uninstall pydot2 and install pydot instead
# But first uninstall pydot through APT
sudo apt purge --auto-remove python-pydot
# Reinstall all necessary packages through pip
sudo su
cd ~
umask 022
# Reinstall packages via pip for Python 2.7
sudo pip install -U numpy # NumPy 1.15
sudo pip install -U scipy
sudo pip install -U matplotlib
sudo pip install -U scikit-image
sudo pip install -U scikit-learn
sudo pip install -U opencv-contrib-python
sudo pip install -U pandas
sudo pip install -U Theano
sudo pip install -U Lasagne
sudo pip install -U TheanoLM
# Hierarchical datasets for Python
sudo pip install -U tables
# A collection of tools for Python
sudo pip install -U pytools
# Statistical computations and models for Python
sudo pip install -U statsmodels
# Python wrapper for OpenCL — Compilation failure
##sudo pip install -U pyopencl
# Python Webtrends connector — No matching version
##sudo pip install -U pywt
# Fix error with import of dot_parser
sudo pip uninstall pydot2
sudo pip install pydot
exit # exit from root
# Check installation of TensorFlow for Python 2.7 once again
python2.7 -c "import tensorflow as tf; \
tf.enable_eager_execution(); \
print(tf.reduce_sum(tf.random_normal([1000, 1000])));"