The nVidia Titan V is not a gaming card, but gives us some insights as to how the Volta architecture could react to different games and engines. The point here isn’t to look at raw performance in a hundred different titles, but to think about what the performance teaches us for future cards. This will teach us about the Volta architecture; obviously, you shouldn’t be spending $3000 to use a scientific card on gaming, but that doesn’t mean we can’t learn from it. Our tear-down is already online, but now we’re focusing on Titan V overclocking and FPS benchmarks, and then we’ll move on to production, power, and thermal content.
This nVidia Titan V gaming benchmark tests the Volta architecture versus Pascal architecture across DirectX 11, DirectX 12, Vulkan, and synthetic applications. We purchased the Titan V for editorial purposes, and will be dedicating the next few days to dissecting every aspect of the card, much like we did for Vega: Frontier Edition in the summer.
NVidia Titan V Specs vs. Titan Xp, 1080 Ti
Titan V | Tesla V100 | Tesla P100 | GTX 1080 Ti | GTX 1080 | |
GPU | GV100 | GV100 | GP100 Cut-Down Pascal | GP102 Pascal | GP104-400 Pascal |
Transistor Count | 21.1B | 21.1B | 15.3B | 12B | 7.2B |
Fab Process | 12nm FFN | 12nm FFN | 16nm FinFET | 16nm FinFET | 16nm FinFET |
CUDA Cores / Tensor Cores | 5120 / 640 | 5120 / 640 | 3584 / 0 | 3584 / 0 | 2560 / 0 |
TMUs | 320 | 224 | 224 | 160 | |
ROPs | ? | 96 (?) | 88 | 64 | |
Core Clock | 1200MHz | 1328MHz | - | 1607MHz | |
Boost Clock | 1455MHz | 1370MHz | 1480MHz | 1600MHz | 1733MHz |
FP32 TFLOPs | 15TFLOPs | 14TFLOPs | 10.6TFLOPs | ~11.4TFLOPs | 9TFLOPs |
Memory Type | HBM2 | HBM2 | HBM2 | GDDR5X | GDDR5X |
Memory Capacity | 12GB | 16GB | 16GB | 11GB | 8GB |
Memory Clock | 1.7Gbps HBM2 | 1.75Gbps HBM2 | ? | 11Gbps | 10Gbps GDDR5X |
Memory Interface | 3072-bit | 4096-bit | 4096-bit | 352-bit | 256-bit |
Memory Bandwidth | 653GB/s | 900GB/s | ? | ~484GBs | 320.32GB/s |
Total Power Budget ("TDP") | 250W | 250W | 300W | 250W | 180W |
Power Connectors | 1x 8-pin 1x 6-pin |
? | 1x 8-pin 1x 6-pin |
1x 8-pin | |
Release Date | 12/07/2017 | 4Q16-1Q17 | TBD | 5/27/2016 | |
Release Price | $3000 | $10000 | - | $700 | Reference: $700 MSRP: $600 Now: $500 |
The nVidia Titan V graphics card is not targeted at gamers, but rather at scientific and machine/deep learning applications. That does not, however, mean that the card is incapable of gaming, nor does it mean that we can’t extrapolate future key performance metrics for Volta. The Titan V is a derivative of the earlier-released GV100 GPU, part of the Tesla accelerator card series. The key differentiator is that the Titan V ships at $3000, whereas the Tesla V100 was available as part of a $10,000 developer kit. The Tesla V100 still offers greater memory capacity by 4GB – 16GB HBM2 versus 12GB HBM2 – and has a wider memory interface, but other core features remain matched or nearly matched. Core count, for one, is 5120 CUDA cores on each GPU, with 640 Tensor cores (used for Tensorflow deep/machine learning workloads) on each GPU.
The Titan V runs significantly lower clocks than what we’re used to seeing in gaming, but that’s the nature of hosting so many cores. We can make up for most of this with overclocking, which we’ll detail below:
NVidia Titan V Overclocking Results
GamersNexus Titan V Overclock Stepping (Stock Cooler) |
|||||||||
Peak Clock (MHz) | AVG Clock (MHz) | Core Offset (MHz) | MEM CLK (MHz) | MEM Offset (MHz) | Power Target | Voltage | Pass/Fail | GPU TMP | Fan Spd |
1682 | 1507 | 850.5 | 100 | Stock | P | 83 | 2371 | ||
1830 | 1605 | 850.5 | 100 | Stock | P | 78 | 3500 | ||
1830 | 1672 | 850.5 | 120 | Stock | P | 78 | 3500 | ||
1830 | 1755 | 100 | 850.5 | 120 | Stock | P | 81 | 4000 | |
1830 | 1770 | 125 | 850.5 | 120 | Stock | P | 81 | 4000 | |
1830 | 1785 | 150 | 850.5 | 120 | Stock | P | 81 | 4000 | |
1837 | 1807 | 175 | 850.5 | 120 | Stock | P - Artifacts | 81 | 4000 | |
1852 | 1822 | 200 | 850.5 | 120 | Stock | P - Artifacts | 81 | 4000 | |
1950 | 1822 | 225 | 850.5 | 120 | Stock | F - Driver Crash | 81 | 4000 | |
Restart | |||||||||
1927 | 1875 | 200 | 850.5 | 120 | Stock | P - Cool GPU | 63 | 4000 | |
1927 | 1837 | 200 | 945 | 100 | 120 | Stock | P | 74 | 4000 |
1927 | 1830 | 200 | 972 | 125 | 120 | Stock | P | 76 | 4000 |
1927 | 1830 | 200 | 999 | 150 | 120 | Stock | P | 77 | 4000 |
1927 | 1822 | 200 | 1026 | 175 | 120 | Stock | P | 78 | 4000 |
1927 | 1800 | 200 | 1039 | 200 | 120 | Stock | P - Artifacts | 81 | 4000 |
1927 | 1800 | 200 | ? | 225 | 120 | Stock | F - System Lock | 84 | 4000 |
This is our overclock “stepping table,” as we call it. For overclocking, we observed the complete stock card operating a peak frequency of 1682MHz in Firestrike Extreme’s looping stress test, with an average of 1507MHz after thermal limitations. Simply increasing the fan speed to 90% immediately pushes us to 1605MHz average, with no other changes. We next increased the power target and fan speed, dragging us up to 1672MHz average. From there, we incrementally stepped core offset upwards, eventually encountering stability issues at around 225MHz.
Our final core offset was 200MHz, and the final HBM offset was also 200MHz. We left the fan at 100% speeds and were still bound by thermals, sitting around 81-84C. We’ll be talking about power consumption in a separate content piece, as that’s another topic entirely, and will require much more research.
Volta’s Clock Behavior
This card has a lot more room for overclocking in it, but we need to liquid cool it and might do some shunt mods. As of right now, we’re hitting a hard thermal limit and a hard power limit, both of which constrain our clock potential. As with Pascal, we’ve learned that Volta’s Boost behavior is much the same as Pascal’s, and that temperatures sub-60C are the most beneficial for clock performance. Tipping past 60C, then 66C, then ~71-72C, then into the 80s does cost performance.
The card still seems to follow a similar hard limitation policy of ~84C for its thermal wall, but often remains closer to 81C under our overclock, a 100% load, and with fans at 90-100%. This is constraining us, and we’ll need a solution. Our thermal analysis content will look into that aspect of performance, but we need to get back to gaming.
Again, the point here is to take multiple types of games, key APIs, and other performance indicators to study the future of nVidia’s Volta architecture. We suspect that some of this may carry-over to Ampere, which has an indeterminate launch window.
Synthetics are important to begin with, as they’ll establish our baseline and grant us a means to analyze why performance is the way it is. With synthetics, we know specifically what each test is stressing, and so can pinpoint strengths and weaknesses in the Titan V.
Testing Platform
GN Test Bench 2017 | Name | Courtesy Of | Cost |
Video Card | This is what we're testing | - | - |
CPU | Intel i7-7700K 4.5GHz locked | GamersNexus | $330 |
Memory | GSkill Trident Z 3200MHz C14 | Gskill | - |
Motherboard | Gigabyte Aorus Gaming 7 Z270X | Gigabyte | $240 |
Power Supply | NZXT 1200W HALE90 V2 | NZXT | $300 |
SSD | Plextor M7V Crucial 1TB |
GamersNexus | - |
Case | Top Deck Tech Station | GamersNexus | $250 |
CPU Cooler | Asetek 570LC | Asetek | - |
BIOS settings include C-states completely disabled with the CPU locked to 4.5GHz at 1.32 vCore. Memory is at XMP1. The launch drivers were used for Titan V.