NVIDIA's massive A100 GPU isn't for you

Ampere's long-awaited debut comes inside a $200,000 data center computer.

Thu, May 14, 2020, 9:00 AM·4 min read

In this mini-episode of our explainer show, Upscaled, we break down NVIDIA's latest GPU, the A100, and its new graphics architecture Ampere. Announced at the company's long-delayed GTC conference, the A100 isn't intended for gamers, or even for workstation users. Instead, it's the direct replacement to the Volta-based V100 — a 2017 GPU purpose-built for data centers.

Volta never directly came to consumers — aside from the Titan V and a Quadro workstation card — but the improvements and tensor cores it introduced were a key part of Turing, the architecture which underpins almost all of NVIDIA's current GeForce and Quadro cards. Whether the next generation of GeForce and Quadro cards are also called Ampere, or something else entirely, you can expect some of the A100’s new features to make their way over — and a lot of them not to.

NVIDIA was a little hazy on the finer details of Ampere, but what we do know is that the A100 GPU is huge. Its die size is 826 square millimeters, which is larger than both the V100 (815mm2) and the NVIDIA’s flagship gaming card, the RTX 2080 Ti (754mm2).

Those might not sound like big differences, but the A100 is NVIDIA's first GPU to be built on TSMC's 7nm process -- its current models are on 12nm. That means there’s a roughly 40-percent reduction in the amount of space required for each transistor, which has let NVIDIA, apparently, squeeze 54 billion transistors into the A100. We say apparently because that's such an enormous increase over, say, the 2080 Ti’s 18.6 billion transistors, that it almost feels like someone's done their math wrong. With that said, some quick calculations put the A100's transistor density at around 65 million per square mm, which is within the realms of possibility on TSMC's 7nm process.

Moving away from transistors, the A100 has 6,912 FP32 CUDA cores, 3,456 FP64 CUDA cores and 422 Tensor cores. Compare that to the V100, which has 5,120 CUDA cores and 640 Tensor cores, and you can see just how much of an impact the new process has had in allowing NVIDIA to squeeze more components into a chip that’s only marginally larger than the one it replaces.

The A100 is being sold packaged in the DGX A100, a system with 8 A100s, a pair of 64-core AMD server chips, 1TB of RAM and 15TB of NVME storage, for a cool $200,000. For context, the DGX-1, a similar system with 8 V100s, cost around $150,000 at launch. That equates to a 33-percent generational price hike, but NVIDIA claims the A100 is 20 times faster at AI inference and training compared to the V100. And AI is really all these cards are likely to be used for — NVIDIA has already sold DGX A100s to partners in the space, and has sent one to the Argonne National Lab to aid in the fight against COVID-19.

This 20x performance jump is partially due to the massive increase in cores, but architectural improvements and new ways of doing math (which we dive into in our video) are likely contributing much more. The A100 is also helped by its memory: It has 40GB of HBM2 memory, compared to the 16GB the V100 launched with (the company bumped the memory on Volta cards to 32GB later), which means each DGX A100 system has 320GB of VRAM to play with.

So what can this tell us about NVIDIA's much-anticipated new gaming cards? Well, concretely, some of these AI improvements will find their way into GeForce cards, improving performance in upscaling tasks like DLSS, or denoising, which is a key aspect of ray-tracing.

In a briefing with reporters, NVIDIA CEO Jensen Huang all-but-confirmed that, while there’s “great overlap in the architecture” between Ampere and the upcoming consumer cards, these gaming cards won’t feature HBM2 memory, and the sizing of the different elements in the chips will be very different, as they’ll be more focused on graphics performance than high-precision math. That means you should expect far higher gains in FP32 compute (which is where that TFLOP figure you hear about whenever a new GPU or console is launched comes from) for consumer cards, given so much of the A100’s die is given over to FP64-focused hardware.

Moving into theory crafting, a GeForce GPU the size of a 2080 Ti with a density approaching that of the A100, entirely targeted at gaming, would probably be twice as fast. To be clear, that's highly unlikely to happen: NVIDIA will probably shrink the chip down considerably, reducing its costs and letting it sell much faster cards at similar prices to its current generation. The current rumor du jour comes from the YouTuber Moore's Law is Dead, who suggests that the anticipated "3080 Ti" will have around 30-percent more cores than the 2080 Ti, which could make for a flagship GPU on a much-more reasonable 450mm2 die. Will a smaller die mean GPU prices go down? Given those specs will likely put NVIDIA’s offerings ahead of AMD’s rumored new cards, we doubt it.

Engadget
ISPs are fighting to raise the price of low-income broadband
Internet service providers are objected to the lower rates they need to offer lower income customers if they want to obtain government funds from a new Internet access program.
Engadget
Amazon is giving The Boys the prequel treatment
The cast and crew of Amazon's The Boys announced a bunch of new spinoffs for the supe action series.
Engadget
You can date everything in Date Everything!
Date Everything! is an upcoming dating sim game that lets you date evert
Engadget
The Bioshock movie is still happening but with a reduced budget
The Bioshock movie is still happening, but with steep budget cuts. It’s being reconfigured to become a ‘more personal’ film.
Engadget
Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package
Warner Bros. Discovery followed through on its threat to “take appropriate action” against the NBA for rejecting its broadcasting rights offer. On Friday, the media company sued the league after the NBA turned down its bid to match Amazon’s streaming package.
Engadget
Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now
Apple’s M3 MacBook Air combines Apple’s lightest and thinnest laptop design with the cutting-edge horsepower of the latest Apple silicon chip. You can get the 2024 model on sale for $200 off right now.
Engadget
Here's how to stop Grok's AI models using your tweets for training
X automatically opted users into letting Grok's AI models train on their tweets and interactions with the chatbot. Here's how to opt out.
Engadget
The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals
The week after Amazon's Prime Day can be a bit sleepy for deals, but we still found a few decent discounts on gear we've tested and recommend.
Engadget
The 65-inch LG C3 OLED TV is nearly half off for today only
The 65-inch LG C3 OLED TV is nearly half off for today only. That brings the set down to a record low of $1,300.
Engadget
NASA's Perseverance rover found a rock on Mars that could indicate ancient life
A Martian rock sample collected by Perseverance contains "chemical signatures and structures" that could've been formed by ancient microbial life from billions of years ago.
Engadget
Apple agrees to stick by Biden administration's voluntary AI safeguards
Apple has joined more than a dozen other tech companies in signing up for the Biden administration's voluntary AI code of practice.
Engadget
North Korean who used ransomware to attack US healthcare providers has been indicted
A grand jury in Kansas City has indicted Rim Jong Hyok, a North Korean intelligence operative who allegedly used ransomware to attack health providers' systems in the US.
Engadget
Samsung Galaxy Ring review: A bit basic, a bit pricey
The Galaxy Ring is comfortable and seemingly basic, but actually delivers detailed insight on your sleep, walks and runs.
Engadget
Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon
Apple's well-specked 14-inch MacBook Pro with an M3 Pro chip, 18GB of memory and 512GB of storage is on sale for the lowest price we've seen yet at Amazon.
Engadget
Gran Turismo 7's more realistic physics update is launching cars into orbit
Gran Turismo 7's latest update is causing some bizarre problems, making cars bounce violently or launch completely into the air.
Engadget
The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT
The biggest news stories this morning: AI video startup Runway reportedly trained on ‘thousands’ of YouTube videos without permission, The best cameras for 2024, WhatsApp hits 100 million monthly active US users.
Engadget
The best fitness trackers for 2024
Here's a list of the best fitness trackers you can buy, as chosen by Engadget editors.
Engadget
The best cameras for 2024
Here's a list of the best cameras you can buy, as chosen by Engadget editors.
Engadget
X's Grok chatbot is misleading voters about the presidential election
Grok's AI chatbot claims that President Biden's name must stay on the ballot in nine states, a claim that is categorically false.
Engadget
Comic-Con leak sparks rumors of two remastered Soul Reaver games
A photo from Comic-Con has leaked possible remasters of two Soul Reaver games from Crystal Dynamics.

NVIDIA's massive A100 GPU isn't for you

Ampere's long-awaited debut comes inside a $200,000 data center computer.

Latest Stories

ISPs are fighting to raise the price of low-income broadband

Amazon is giving The Boys the prequel treatment

You can date everything in Date Everything!

The Bioshock movie is still happening but with a reduced budget

Warner Bros. Discovery sues the NBA in a last-ditch effort to block Amazon’s new streaming package

Apple’s M3 MacBook Air with 16GB of RAM is $200 off right now

Here's how to stop Grok's AI models using your tweets for training

The 10th-generation iPad is back down to $300, plus the rest of this week's best tech deals

The 65-inch LG C3 OLED TV is nearly half off for today only

NASA's Perseverance rover found a rock on Mars that could indicate ancient life

Apple agrees to stick by Biden administration's voluntary AI safeguards

North Korean who used ransomware to attack US healthcare providers has been indicted

Samsung Galaxy Ring review: A bit basic, a bit pricey

Apple's 14-inch MacBook Pro laptop with an M3 Pro chip is $300 off at Amazon

Gran Turismo 7's more realistic physics update is launching cars into orbit

The Morning After: OpenAI reveals its AI-powered search engine, SearchGPT

The best fitness trackers for 2024

The best cameras for 2024

X's Grok chatbot is misleading voters about the presidential election

Comic-Con leak sparks rumors of two remastered Soul Reaver games

About

Sections

Contribute

Buying Guides