最終更新:2018-02-23 (金) 12:40:22 (2226d)
NVIDIA Deep Learning SDK
Top / NVIDIA Deep Learning SDK
http://docs.nvidia.com/deeplearning/sdk/index.html
ツール
Deep Learning Primitives (CUDA® Deep Neural Network library™ (cuDNN))
- High-performance building blocks for deep neural network applications including convolutions, activation functions, and tensor transformations.
Deep Learning Inference Engine (TensorRT )
- High-performance deep learning inference runtime for production deployment.
Deep Learning for Video Analytics (NVIDIA DeepStream SDK?)
- High-level C++ API and runtime for GPU-accelerated transcoding and deep learning inference.
Linear Algebra (CUDA® Basic Linear Algebra Subroutines library™ (cuBLAS))
- GPU-accelerated BLAS functionality that delivers 6x to 17x faster performance than CPU-only BLAS libraries,
Sparse Matrix Operations (NVIDIA CUDA® Sparse Matrix library™ (cuSPARSE))
- GPU-accelerated linear algebra subroutines for sparse matrices that deliver up to 8x faster performance than CPU BLAS (MKL), ideal for applications such as natural language processing.
Multi-GPU Communication (NVIDIA® Collective Communications Library ™ (NCCL?))
- Collective communication routines, such as all-gather, reduce, and broadcast that accelerate multi-GPU deep learning training on up to eight GPUs.