Nvidia Tesla
<templatestyles src="Module:Hatnote/styles.css"></templatestyles>
Nvidia Tesla is Nvidia's brand name for their products targeting stream processing and/or general purpose GPU. Products use GPUs from the G80 series onward. Both the underlying microarchitecture of the initial GPUs "Tesla" and the Tesla product line take their name from pioneering electrical engineer Nikola Tesla.
Contents
Overview
With their very high computational power (measured in floating point operations per second or FLOPS) compared to microprocessors, the Tesla products target the high performance computing market.[1] As of 2012[update], Nvidia Teslas power some of the world's fastest supercomputers, including Titan at Oak Ridge National Laboratory and Tianhe-1A, in Tianjin, China.
The lack of ability to output images to a display was the main difference between Tesla products and the consumer level GeForce cards and the professional level Quadro cards, but the latest Tesla C-class products include one Dual-Link DVI port.[2] For equivalent single precision output, Fermi-based Nvidia GeForce cards have four times less dual-precision performance[citation needed]. Tesla products primarily operate:[3]
- in simulations and in large scale calculations (especially floating-point calculations)
- for high-end image generation for applications in professional and scientific fields
- with the use of OpenCL or CUDA.
Nvidia intends to offer ARMv8 processor cores embedded into future Tesla GPUs as part of Project Denver.[4] This will be a 64-bit follow on to the 32-bit Tegra chips.
Market
The defense industry currently accounts for less than a sixth of Tesla sales, but Sumit Gupta predicts further sales to the geospatial intelligence market.[5]
Specifications and configurations
- 1 Specifications not specified by NVIDIA assumed to be based on the GeForce 8800GTX
- 2 Specifications not specified by NVIDIA assumed to be based on the GeForce GTX 280
- 3 Specifications not specified by NVIDIA are assumed to be based on the GeForce 400 Series
- 4 With ECC on, a portion of the dedicated memory is used for ECC bits, so the available user memory is reduced by 12.5%. (e.g. 3 GB total memory yields 2.625 GB of user available memory.)
- 5 For calculating the processing power see Tesla (microarchitecture)#Performance, Fermi (microarchitecture)#Performance, Kepler (microarchitecture)#Performance, or Maxwell (microarchitecture)#Performance. A number range specifies the minimum and maximum processing power at, respectively, the base clock and maximum boost clock.
- 6 Specifications not specified by NVIDIA assumed to be based on the Quadro FX 5800
- 7 GPU Boost is a default feature that increases the core clock rate while remaining under the card's predetermined power budget. Multiple boost clocks are available, but this table lists the highest clock supported by each card.[6]
- 8 Core architecture version according to the CUDA programming guide.
- For the basic specifications of Tesla, refer to the GPU Computing Processor specifications.
- Due to Tesla's non-output nature, fillrate and graphics API compatibility are not applicable.
Model | Micro-architecture | Chips | Core clock (MHz) |
Shaders | Memory | Processing Power (GFLOPS)5 | Compute capability8 |
TDP (watts) |
Notes/Form factor | ||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Thread Processors (total) |
Base Clock (MHz) | Max Boost Clock (MHz)7 |
Bus type | Bus width (bit) |
Size (MB) |
Clock (MT/s) |
Bandwidth (GB/s) |
Single Precision (MAD+MUL) |
Single Precision (MAD or FMA) |
Double Precision (FMA) |
|||||||
C870 GPU Computing Module1 | Tesla | 1× G80 | 600 | 128 | 1350 | N/A | GDDR3 | 384 | 1536 | 1600 | 76.8 | 518.4 | 345.6 | No | 1.0 | 170.9 | Internal PCIe GPU (full-height, dual-slot) |
D870 Deskside Computer1 | 2× G80 | 600 | 256 | 1350 | N/A | GDDR3 | 2× 384 | 2× 1536 | 1600 | 2× 76.8 | 1036.8 | 691.2 | No | 1.0 | 520 | Deskside or 3U rack-mount external GPUs | |
S870 GPU Computing Server1 | 4× G80 | 600 | 512 | 1350 | N/A | GDDR3 | 4× 384 | 4× 1536 | 1600 | 4× 76.8 | 2073.6 | 1382.4 | No | 1.0 | 1U rack-mount external GPUs, connect via 2x PCIe (x16) | ||
C1060 GPU Computing Module2 | 1× GT200 | 602 | 240 | 1296[7] | N/A | GDDR3 | 512 | 4096 | 1600 | 102.4 | 933.12 | 622.08 | 77.76 | 1.3 | 187.8 | Internal PCIe GPU (full-height, dual-slot) | |
S1070 GPU Computing Server "400 configuration"2 | 4× GT200 | 602 | 960 | 1296 | N/A | GDDR3 | 4× 512 | 4× 4096 | 1538.4 | 4× 98.5 | 3732.5 | 2488.3 | 311.0 | 1.3 | 800 | 1U rack-mount external GPUs, connect via 2x PCIe (x8 or x16) | |
S1070 GPU Computing Server "500 configuration"2 | 1440 | N/A | 4147.2 | 2764.8 | 345.6 | ||||||||||||
S1075 GPU Computing Server2[8] | 4× GT200 | 602 | 960 | 1440 | N/A | GDDR3 | 4× 512 | 4× 4096 | 1538.4 | 4× 98.5 | 4147.2 | 2764.8 | 345.6 | 1.3 | 1U rack-mount external GPUs, connect via 1x PCIe (x8 or x16) | ||
Quadro Plex 2200 D2 Visual Computing System6 | 2× GT200GL | 648 | 480 | 1296 | N/A | GDDR3 | 2× 512 | 2× 4096 | 1600 | 2× 102.4 | 1866.2 | 1244.2 | 155.5 | 1.3 | Deskside or 3U rack-mount external GPUs with 4 dual-link DVI outputs | ||
Quadro Plex 2200 S4 Visual Computing System6 | 4× GT200GL | 648 | 960 | 1296 | N/A | GDDR3 | 4× 512 | 4× 4096 | 1600 | 4× 102.4 | 3732.5 | 2488.3 | 311.0 | 1.3 | 1200 | 1U rack-mount external GPUs, connect via 2x PCIe (x8 or x16) | |
C2050 GPU Computing Module[9] | Fermi | 1× GF100 | 575 | 448 | 1150 | N/A | GDDR5 | 384 | 30724 | 3000 | 144 | No | 1030.4 | 515.2 | 2.0 | 247 | Internal PCIe GPU (full-height, dual-slot) |
M2050 GPU Computing Module[10] | N/A | 3092 | 148.4 | No | 225 | ||||||||||||
C2070 GPU Computing Module[9] | 1× GF100 | 575 | 448 | 1150 | N/A | GDDR5 | 384 | 61444 | 3000 | 144 | No | 1030.4 | 515.2 | 2.0 | 247 | Internal PCIe GPU (full-height, dual-slot) | |
C2075 GPU Computing Module[11] | N/A | 3000 | 144 | No | 225 | ||||||||||||
M2070/M2070Q GPU Computing Module[12] | N/A | 3132 | 150.336 | No | 225 | ||||||||||||
M2090 GPU Computing Module[13] | 1× GF110 | 650 | 512 | 1300 | N/A | GDDR5 | 384 | 61444 | 3700 | 177.6 | No | 1331.2 | 665.6 | 2.0 | 225 | Internal PCIe GPU (full-height, dual-slot) | |
S2050 GPU Computing Server | 4× GF100 | 575 | 1792 | 1150 | N/A | GDDR5 | 4× 384 | 4× 30724 | 3092 | 4× 148.4 | No | 4121.6 | 2060.8 | 2.0 | 900 | 1U rack-mount external GPUs, connect via 2x PCIe (x8 or x16) | |
S2070 GPU Computing Server | N/A | 4× 61444 | No | ||||||||||||||
K10 GPU Accelerator[14] | Kepler | 2× GK104 | N/A | 3072 | 745 | ? | GDDR5 | 2× 256 | 2× 4096 | 5000 | 2× 160 | No | 4577 | 190.7 | 3.0 | 225 | Internal PCIe GPU (full-height, dual-slot) |
K20 GPU Accelerator[15][16] | 1× GK110 | N/A | 2496 | 706 | ? | GDDR5 | 320 | 5120 | 5200 | 208 | No | 3524 | 1175 | 3.5 | 225 | Internal PCIe GPU (full-height, dual-slot) | |
K20X GPU Accelerator[17] | 1× GK110 | N/A | 2688 | 732 | ? | GDDR5 | 384 | 6144 | 5200 | 250 | No | 3935 | 1312 | 3.5 | 235 | Internal PCIe GPU (full-height, dual-slot) | |
K40 GPU Accelerator[18] | 1× GK110B | N/A | 2880 | 745 | 875 | GDDR5 | 384 | 12288 | 6000 | 288 | No | 4291-5040 | 1430-1680 | 3.5 | 235 | Internal PCIe GPU (full-height, dual-slot) | |
K80 GPU Accelerator[19] | 2× GK210 | N/A | 4992 | 560 | 875 | GDDR5 | 2× 384 | 2× 12288 | 5000 | 2× 240 | No | 5591-8736 | 1864-2912 | 3.7 | 300 | Internal PCIe GPU (full-height, dual-slot) | |
M4 GPU Accelerator[20][21] | Maxwell | 1× GM206 | N/A | 1024 | 872 | 1072 | GDDR5 | 128 | 4096 | 5500 | 88 | No | 1786-2195 | 55.81-68.61 | 5.2 | 50-75 | Internal PCIe GPU (half-height, single-slot) |
M40 GPU Accelerator[21][22] | 1× GM200 | N/A | 3072 | 948 | 1114 | GDDR5 | 384 | 12288 | 6000 | 288 | No | 5825-6844 | 182.0-213.9 | 5.2 | 250 | Internal PCIe GPU (full-height, dual-slot) | |
M6 GPU Accelerator[23] | 1× GM204 | N/A | 1536 | 722 | 1051 | GDDR5 | 256 | 8192 | 4600 | 147.2 | No | 2218-3229 | 69.3-100.9 | 5.2 | 75-100 | Internal MXM GPU | |
M60 GPU Accelerator[24] | 2× GM204 | N/A | 4096 | 899 | 1178 | GDDR5 | 2× 256 | 2× 8192 | 5000 | 2× 160 | No | 7365-9650 | 230.1-301.6 | 5.2 | 225-300 | Internal PCIe GPU (full-height, dual-slot) | |
P100 GPU Accelerator[25] | Pascal | 1× GP100 | N/A | 3584 | 1328 | 1480 | HBM2 | 4096 | 16384 | 1406 | 720 | No | 9519-10609 | 4760-5304 | 6.0 | 300 | NVLink card |
Model | Micro-architecture | Chips | Core clock (MHz) |
Shaders | Memory | Processing Power (GFLOPS)5 | Compute capability |
TDP (watts) |
Notes/Form factor | ||||||||
Thread Processors (total) |
Base Clock (MHz) | Max Boost Clock (MHz)7 |
Bus type | Bus width (bit) |
Memory (MB) |
Clock (MT/s) |
Bandwidth (total) (GB/s) |
Single Precision (MAD+MUL) |
Single Precision (MAD or FMA) |
Double Precision (FMA) |
See also
References
- ↑ High Performance Computing - Supercomputing with Tesla GPUs
- ↑ [1]
- ↑ Tesla Technical Brief (PDF)
- ↑ "Nvidia to Integrate ARM Processors in Tesla." (Article from 2012)
- ↑ "Nvidia chases defense, intelligence ISVs with GPUs."
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ 9.0 9.1 Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ 21.0 21.1 Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
- ↑ Lua error in package.lua at line 80: module 'strict' not found.
External links
Wikimedia Commons has media related to Nvidia Tesla series. |
- Articles which use infobox templates with no data rows
- Articles containing potentially dated statements from 2012
- Articles with unsourced statements from June 2015
- Pages with broken file links
- Commons category link is locally defined
- Computer architecture
- Graphics processing units
- GPGPU
- Nvidia
- Parallel computing
- Video cards