Titan V Gaming Benchmarks: Accelerating Async Performance in Dx12 & Vulkan

By Published December 12, 2017 at 9:33 am

The nVidia Titan V is not a gaming card, but gives us some insights as to how the Volta architecture could react to different games and engines. The point here isn’t to look at raw performance in a hundred different titles, but to think about what the performance teaches us for future cards. This will teach us about the Volta architecture; obviously, you shouldn’t be spending $3000 to use a scientific card on gaming, but that doesn’t mean we can’t learn from it. Our tear-down is already online, but now we’re focusing on Titan V overclocking and FPS benchmarks, and then we’ll move on to production, power, and thermal content.

This nVidia Titan V gaming benchmark tests the Volta architecture versus Pascal architecture across DirectX 11, DirectX 12, Vulkan, and synthetic applications. We purchased the Titan V for editorial purposes, and will be dedicating the next few days to dissecting every aspect of the card, much like we did for Vega: Frontier Edition in the summer.

NVidia Titan V Specs vs. Titan Xp, 1080 Ti

 
  Titan V  Tesla V100 Tesla P100 GTX 1080 Ti GTX 1080
GPU GV100 GV100 GP100 Cut-Down Pascal GP102 Pascal GP104-400 Pascal
Transistor Count 21.1B 21.1B 15.3B 12B 7.2B
Fab Process 12nm FFN 12nm FFN 16nm FinFET 16nm FinFET 16nm FinFET
CUDA Cores / Tensor Cores 5120 / 640 5120 / 640 3584 / 0 3584 / 0  2560 / 0
TMUs 320   224 224 160
ROPs ?   96 (?) 88 64
Core Clock 1200MHz   1328MHz - 1607MHz
Boost Clock 1455MHz 1370MHz 1480MHz 1600MHz 1733MHz
FP32 TFLOPs 15TFLOPs 14TFLOPs 10.6TFLOPs ~11.4TFLOPs 9TFLOPs
Memory Type HBM2 HBM2 HBM2 GDDR5X GDDR5X
Memory Capacity 12GB 16GB 16GB 11GB 8GB
Memory Clock 1.7Gbps HBM2 1.75Gbps HBM2 ? 11Gbps 10Gbps GDDR5X
Memory Interface 3072-bit 4096-bit 4096-bit 352-bit 256-bit
Memory Bandwidth 653GB/s 900GB/s ? ~484GBs 320.32GB/s
Total Power Budget ("TDP") 250W 250W 300W 250W 180W
Power Connectors 1x 8-pin
1x 6-pin
  ? 1x 8-pin
1x 6-pin
1x 8-pin
Release Date 12/07/2017   4Q16-1Q17 TBD 5/27/2016
Release Price $3000 $10000 - $700 Reference: $700
MSRP: $600
Now: $500

The nVidia Titan V graphics card is not targeted at gamers, but rather at scientific and machine/deep learning applications. That does not, however, mean that the card is incapable of gaming, nor does it mean that we can’t extrapolate future key performance metrics for Volta. The Titan V is a derivative of the earlier-released GV100 GPU, part of the Tesla accelerator card series. The key differentiator is that the Titan V ships at $3000, whereas the Tesla V100 was available as part of a $10,000 developer kit. The Tesla V100 still offers greater memory capacity by 4GB – 16GB HBM2 versus 12GB HBM2 – and has a wider memory interface, but other core features remain matched or nearly matched. Core count, for one, is 5120 CUDA cores on each GPU, with 640 Tensor cores (used for Tensorflow deep/machine learning workloads) on each GPU.

The Titan V runs significantly lower clocks than what we’re used to seeing in gaming, but that’s the nature of hosting so many cores. We can make up for most of this with overclocking, which we’ll detail below:

NVidia Titan V Overclocking Results

GamersNexus Titan V Overclock Stepping (Stock Cooler)

Peak Clock (MHz) AVG Clock (MHz) Core Offset (MHz) MEM CLK (MHz) MEM Offset (MHz) Power Target Voltage Pass/Fail GPU TMP Fan Spd
1682 1507 850.5 100 Stock P 83 2371
1830 1605 850.5 100 Stock P 78 3500
1830 1672 850.5 120 Stock P 78 3500
1830 1755 100 850.5 120 Stock P 81 4000
1830 1770 125 850.5 120 Stock P 81 4000
1830 1785 150 850.5 120 Stock P 81 4000
1837 1807 175 850.5 120 Stock P - Artifacts 81 4000
1852 1822 200 850.5 120 Stock P - Artifacts 81 4000
1950 1822 225 850.5 120 Stock F - Driver Crash 81 4000
Restart
1927 1875 200 850.5 120 Stock P - Cool GPU 63 4000
1927 1837 200 945 100 120 Stock P 74 4000
1927 1830 200 972 125 120 Stock P 76 4000
1927 1830 200 999 150 120 Stock P 77 4000
1927 1822 200 1026 175 120 Stock P 78 4000
1927 1800 200 1039 200 120 Stock P - Artifacts 81 4000
1927 1800 200 ? 225 120 Stock F - System Lock 84 4000

This is our overclock “stepping table,” as we call it. For overclocking, we observed the complete stock card operating a peak frequency of 1682MHz in Firestrike Extreme’s looping stress test, with an average of 1507MHz after thermal limitations. Simply increasing the fan speed to 90% immediately pushes us to 1605MHz average, with no other changes. We next increased the power target and fan speed, dragging us up to 1672MHz average. From there, we incrementally stepped core offset upwards, eventually encountering stability issues at around 225MHz.

Our final core offset was 200MHz, and the final HBM offset was also 200MHz. We left the fan at 100% speeds and were still bound by thermals, sitting around 81-84C. We’ll be talking about power consumption in a separate content piece, as that’s another topic entirely, and will require much more research.

Volta’s Clock Behavior

This card has a lot more room for overclocking in it, but we need to liquid cool it and might do some shunt mods. As of right now, we’re hitting a hard thermal limit and a hard power limit, both of which constrain our clock potential. As with Pascal, we’ve learned that Volta’s Boost behavior is much the same as Pascal’s, and that temperatures sub-60C are the most beneficial for clock performance. Tipping past 60C, then 66C, then ~71-72C, then into the 80s does cost performance.

The card still seems to follow a similar hard limitation policy of ~84C for its thermal wall, but often remains closer to 81C under our overclock, a 100% load, and with fans at 90-100%. This is constraining us, and we’ll need a solution. Our thermal analysis content will look into that aspect of performance, but we need to get back to gaming.

Again, the point here is to take multiple types of games, key APIs, and other performance indicators to study the future of nVidia’s Volta architecture. We suspect that some of this may carry-over to Ampere, which has an indeterminate launch window.

Synthetics are important to begin with, as they’ll establish our baseline and grant us a means to analyze why performance is the way it is. With synthetics, we know specifically what each test is stressing, and so can pinpoint strengths and weaknesses in the Titan V.

Testing Platform

GN Test Bench 2017 Name Courtesy Of Cost
Video Card This is what we're testing - -
CPU Intel i7-7700K 4.5GHz locked GamersNexus $330
Memory GSkill Trident Z 3200MHz C14 Gskill -
Motherboard Gigabyte Aorus Gaming 7 Z270X Gigabyte $240
Power Supply NZXT 1200W HALE90 V2 NZXT $300
SSD Plextor M7V
Crucial 1TB
GamersNexus -
Case Top Deck Tech Station GamersNexus $250
CPU Cooler Asetek 570LC Asetek -

BIOS settings include C-states completely disabled with the CPU locked to 4.5GHz at 1.32 vCore. Memory is at XMP1. The launch drivers were used for Titan V.

Continue to the next page  for gaming & synthetic benchmarks.


Prev Next »

Last modified on December 12, 2017 at 9:33 am
Steve Burke

Steve started GamersNexus back when it was just a cool name, and now it's grown into an expansive website with an overwhelming amount of features. He recalls his first difficult decision with GN's direction: "I didn't know whether or not I wanted 'Gamers' to have a possessive apostrophe -- I mean, grammatically it should, but I didn't like it in the name. It was ugly. I also had people who were typing apostrophes into the address bar - sigh. It made sense to just leave it as 'Gamers.'"

First world problems, Steve. First world problems.

We moderate comments on a ~24~48 hour cycle. There will be some delay after submitting a comment.

  VigLink badge