The Titan X Hybrid mod we hand-crafted for a viewer allowed the card to stretch its boost an additional ~200MHz beyond the spec. This was done for Sam, the owner who loaned us the Titan XP, and was completed back in August. We also ran benchmarks before tearing the card down, albeit on drivers from mid-August, and never did publish a review of the card.
This content revisits the Titan XP for a review from a gaming standpoint. We'd generally recommend such a device for production workloads or CUDA-accelerated render/3D work, but that doesn't stop that the card is marketed as a top-of-the-line gaming device with GeForce branding. From that perspective, we're reviewing the GTX Titan X (Pascal) for its gaming performance versus the GTX 1080, hopefully providing a better understanding of value at each price-point. The Titan X (Pascal) card is priced at $1200 from nVidia directly.
Review content will focus on thermal, FPS, and overclocking performance of the GTX Titan X (Pascal) GP102 GPU. If you're curious to learn more about the card, our previous Titan XP Hybrid coverage can be found here:
NVIDIA Titan X (Pascal) Specs
NVIDIA Pascal vs. Maxwell Specs Comparison | |||||||
Titan X | GTX 1080 | GTX 1070 | GTX 1060 | GTX 980 Ti | GTX 980 | GTX 960 | |
GPU | GP102-? Pascal | GP104-400 Pascal | GP104-200 Pascal | GP106 Pascal | GM200 Maxwell | GM204 Maxwell | GM204 |
Transistor Count | 12B | 7.2B | 7.2B | 4.4B | 8B | 5.2B | 2.94B |
Fab Process | 16nm FinFET | 16nm FinFET | 16nm FinFET | 16nm FinFET | 28nm | 28nm | 28nm |
CUDA Cores | 3584 | 2560 | 1920 | 1280 | 2816 | 2048 | 1024 |
GPCs | 6 | 4 | 3 | 2 | 6 | 4 | 2 |
SMs | 28 | 20 | 15 | 10 | 22 | 16 | 8 |
TPCs | 28 | 20 | 15 | 10 | - | - | - |
TMUs | 224 | 160 | 120 | 80 | 176 | 128 | 64 |
Core Clock | 1417MHz | 1607MHz | 1506MHz | 1506MHz | 1000MHz | 1126MHz | 1126MHz |
Boost Clock | 1530MHz | 1733MHz | 1683MHz | 1708MHz | 1075MHz | 1216MHz | 1178MHz |
FP32 TFLOPs | 11TFLOPs | 9TFLOPs | 6.5TFLOPs | 4.4TFLOPs | 5.63TFLOPs | 5TFLOPs | 2.4TFLOPs |
Memory Type | GDDR5X | GDDR5X | GDDR5 | GDDR5 | GDDR5 | GDDR5 | GDDR5 |
Memory Capacity | 12GB | 8GB | 8GB | 6GB | 6GB | 4GB | 2GB, 4GB |
Memory Clock | 10Gbps | 10Gbps GDDR5X | 8Gbps | 8Gbps | 7Gbps GDDR5 | 7Gbps GDDR5 | 7Gbps |
Memory Interface | 384-bit | 256-bit | 256-bit | 192-bit | 384-bit | 256-bit | 128-bit |
Memory Bandwidth | 480GB/s | 320.32GB/s | 256GB/s | 192GB/s | 336GB/s | 224GB/s | 115GB/s |
TDP | 250W | 180W | 150W | 120W | 250W | 165W | 120W |
Power Connectors | 1x 8-pin 1x 6-pin |
1x 8-pin | 1x 8-pin | 1x 6-pin | 1x 8-pin 1x 6-pin |
2x 6-pin | 1x 6-pin |
Release Date | 8/2/2016 | 5/27/2016 | 6/10/2016 | 7/19/2016 | 6/01/2015 | 9/18/2014 | 01/22/15 |
Release Price | $1200 | Reference: $700 MSRP: $600 |
Reference: $450 MSRP: $380 |
Reference: $300 MSRP: $250 |
$650 | $550 | $200 |
NVIDIA Titan X (Pascal) GP102 Architecture
The Titan XP uses a GP102 GPU under Pascal architecture, the largest Pascal chip presently available on a GeForce-branded card. The only current Pascal chip that's equipped in a flashier fashion than the GP102 is GP100, used for the Tesla P100 Accelerator and not meant for gaming. We detailed GP100 a few months ago, if interested in learning more about Pascal's intricacies.
GP102 hosts a total of 6 GPCs, for 28 SMs and 3584 CUDA cores. This isn't the biggest Pascal chip out, in terms of total SMs, but it's the biggest in the GeForce line. GP102 follows GP104's architecture and splits GPCs into sets of 5SMs, as opposed to the GP100 GPC-SM alignment of 10 simultaneous multiprocessors per GPC. This is why we often remind folks that cores and SMs can't be compared cross-generation, or sometimes even intra-generation. The GP100 chip used for the Tesla P100 Accelerator is built for simulation and deep learning, which means it's got a completely different implementation of cores. GP100 makes use of FP64 cores in a way that the Titan XP does not, and so is more suitable for double-precision tasks than the FP32-focused Titan X card of this generation.
Also like GP104-400, the Titan XP runs 8 TMUs per SM, totaling 224 texture map units. Clock-rate natively operates at 1531MHz with the stock cooler and hits 1700MHz or higher during load. Note also that we pushed to nearly 2000MHz in our liquid content without needing to overclock – that's GPU Boost 3.0 in action. The Boost behavior has detected thermal headroom to push the clock higher, and so it increases the operating frequency until a point at which a new limiter is encountered (normally power or voltage).
Cache size also plays a big role in the Titan XP. The TiXP expands cache to 3072KB over the 2048KB on the GTX 1080. VRAM on-board is 12GB for the Titan XP, versus 8GB for the GTX 1080. Both use GDDR5X at 10Gbps native.
Titan XP and GP102 host 12 billion total transistors and, despite a focus on FP32, works to introduce a new INT8 deep-learning instruction set. We're not benchmarking that today, though.
NVIDIA Titan X (Pascal) Tear-Down, PCB, & Cooler
Below are a few photos from our tear-down content:
The disassembly process for the Titan XP is almost exactly the same as what is used to disassemble the GTX 1080. The card uses a vapor chamber cooler for its thermal solution, coupled with a blower fan and usual PWM control. An aluminum baseplate rests between the PCB and the faceplate, which uses thermal pads to conduct heat into the plate. Air from the blower fan then pushes air through fins atop the baseplate, dissipating heat and pulling it away from VRAM and the VRM. This air is eventually pushed through the vapor chamber cooler, then out the back of the card.
The PCB hosts two power connectors and a third set of solder points for an additional 8-pin header, located at the right side of the board. The core VRM uses a 7-phase power design, with the memory VRM using a 2-phase design.
Continue to page 2 for test methodology.