J'ai mis à jour Tensorflow version 1.0 et installé CUDA 8.0 avec le version cudnn 5.1 et les pilotes nvidia à jour 375.39. Mon matériel NVIDIA est celui qui est sur Amazon Web Services en utilisant l'instance p2.xlarge, un Tesla K-80. Mon système d'exploitation est Linux 64 bits.Comment installer CUDA 8.0 dans la dernière version de Tensorflow (1.0) dans l'instance AWS p2.xlarge, pilotes AMI ami-edb11e8d et nvidia à jour (375.39)
Je reçois le message d'erreur suivant chaque fois que j'utilise la commande: tf.Session()
[[email protected] CUDA]$ python
Python 2.7.12 (default, Sep 1 2016, 22:14:00)
[GCC 4.8.3 20140911 (Red Hat 4.8.3-9)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import tensorflow as tf
I tensorflow/stream_executor/dso_loader.cc:135] successfully opened CUDA library libcublas.so.8.0 locally
I tensorflow/stream_executor/dso_loader.cc:135] successfully opened CUDA library libcudnn.so.5 locally
I tensorflow/stream_executor/dso_loader.cc:135] successfully opened CUDA library libcufft.so.8.0 locally
I tensorflow/stream_executor/dso_loader.cc:135] successfully opened CUDA library libcuda.so.1 locally
I tensorflow/stream_executor/dso_loader.cc:135] successfully opened CUDA library libcurand.so.8.0 locally
>>> sess = tf.Session()
W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE3 instructions, but these are available on your machine and could speed up CPU computations.
W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.1 instructions, but these are available on your machine and could speed up CPU computations.
W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations.
W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations.
W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX2 instructions, but these are available on your machine and could speed up CPU computations.
W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use FMA instructions, but these are available on your machine and could speed up CPU computations.
E tensorflow/stream_executor/cuda/cuda_driver.cc:509] failed call to cuInit: CUDA_ERROR_NO_DEVICE
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:158] retrieving CUDA diagnostic information for host: ip-172-31-7-96
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:165] hostname: ip-172-31-7-96
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:189] libcuda reported version is: Invalid argument: expected %d.%d or %d.%d.%d form for driver version; got "1"
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:363] driver version file contents: """NVRM version: NVIDIA UNIX x86_64 Kernel Module 375.39 Tue Jan 31 20:47:00 PST 2017
GCC version: gcc version 4.8.3 20140911 (Red Hat 4.8.3-9) (GCC)
"""
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:193] kernel reported version is: 375.39.0
Je suis complètement désemparés sur la façon de résoudre ce problème. J'ai essayé différentes versions de drivers Nvidia et CUDA mais ça ne marche toujours pas.
Toutes les indications seront appréciées.
Peut-être votre pilote de GPU n'est pas installé correctement. Quel est le résultat de l'exécution de 'nvidia-smi'? Avez-vous effectué une vérification de l'installation de CUDA comme indiqué dans le [guide d'installation de cuda linux] (http://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#abstract)? –
Merci pour votre réponse rapide. Le nvidia-smi a fonctionné, et je n'ai pas suivi la "vérification" décrite sur le site. J'ai décidé de repartir de zéro sur un système Redhat 7.3. Cela a fonctionné au début, donc aucune aide supplémentaire n'était nécessaire. – basuam