Quadro Volta

PNY NVIDIA Authorized Partner

Reinventing the Workstation with Real-Time Ray Tracing and AI

Compute. Deep Learning. Rendering. Simulation. Visualization.

The NVIDIA Quadro® GV100 is reinventing the workstation to meet the demands of next-generation real-time ray tracing, AI, simulation, and VR enhanced workflows. It’s powered by NVIDIA Volta, delivering the extreme memory capacity, scalability, and performance that designers, architects, and scientists need to create, build, and solve the impossible.

Volta GPU Architecture

Based on state-of-the-art 12nm FFN (FinFET NVIDIA) high-performance manufacturing process customized for NVIDIA to incorporate 5120 CUDA cores, the Quadro GV100 GPU is the most powerful computing platform for HPC, AI, VR and graphics workloads on professional desktops. It includes 21.1 billion transistors on die size of 815 mm2. Able to deliver more than 7.4 TFLOPS of double precision (FP64), 14.8 TFLOPS of single-precision (FP32), 29.6 TFLOPS of half-precision (FP16), 59.3 TOPS of integer-precision (INT8), and 118.5 TFLOPs of tensor operation capability, it supports a wide range of compute-intensive workloads flawlessly.

Tensor Cores

New mixed-precision cores purpose-built for deep learning matrix arithmetic, delivering 8x TFLOPS for training, compared to previous generation. Quadro GV100 utilizes 640 Tensor Cores; each Tensor Core performs 64 floating point fused multiply-add (FMA) operations per clock, and each SM performs a total of 1024 individual floating point operations per clock.

 

High Speed HBM2 Memory

Built with Volta’s vastly optimized 32GB HBM2 memory subsystem for the industry’s fastest graphics memory (870 GB/s peak bandwidth), Quadro GV100 is the ideal platform for latency-sensitive applications handling large datasets. Quadro GV100 offers 2x memory capacity and delivers 20% more memory bandwidth compared to previous generation. HBM2 also provides native support for Error Correcting Code (ECC) without capacity or performance penalties.

Mixed-Precision Computing

Double the throughput and reduce storage requirements with 16-bit floating point precision computing to enable the training and deployment of larger neural networks. With independent parallel integer and floating point data paths, the Volta SM (Streaming Multiprocesssor) is also much more efficient on workloads with a mix of computation and addressing calculations.

VOLTA Quadro Lineup

 
Quadro P600
  • CUDA Parallel Processing Cores 5120
  • TENSOR PROCESSING CORES 640
  • COMPUTE PERFORMANCE
    FP64 7.4 TFLOPS, FP32 14.8 TFLOPS, FP16 29.6 TFLOPS, INT8 118.5 TFLOPS
  • DEEP LEARNING TFLOPS
    118.5 (FP16 matrix multiply with FP16 or FP32 accummulate)
  • Memory Bandwidth 870 GB/s
  • Maximum Power Consumption 250 W
  • System Interface PCI Express 3.0 x16
  • Display Connectors 4x DP 1.4
  • Maximum Display Resolution 4K, 5K or 8K HDR
  • Form Factor 10.5” L x 4.376” H Dual-Slot
NEW Quadro GV100
 

Volta Architecture Accessories

 
Quadro Sync II
  • Per Sync II Up to 16 Displays | Up to 4 GPUs
  • Per System with 2 Sync II Up to 32 Displays | Up to 8 GPUs
  • Per Cluster (1 or 2 Sync II Per Node) Up to 50 Nodes | Up to 200 GPUs
  • Power Connectors 6-pin PCIe or SATA
  • Form Factor 6.0" L x 4.2" H Single Slot
  • Compatibility Quadro GV100, GP100, P6000, P5000 and P4000
Quadro Sync II
NV Link
  • Bandwidth Up to 200 GB/s (Bidirectional (Two Bridges Required)
  • 2-way 2-Slot Spacing
  • GPU Peer-to- Peer Communications
  • Low Latency CPU to CPU Communications
  • Compatibility Quadro GV100
  • PNY Part Number NVLINK2-2W2S-KIT
NVIDIA NVLink