An alleged company slide detailing the architecture of AMD's upcoming "Hawaii" GPU was leaked to the web, revealing a monstrous combination of components. The GPU maintains the same component hierarchy as "Tahiti." The most distinguishing feature here is that whereas "Tahiti" features two shader engines, "Hawaii" features four. What it translates to, is double the geometry processing power, four independent geometry processors with a tessellation unit each, and double the number of ROPs, at 64. Each shader unit features 11 compute units (CU), the number-crunching machinery of the GPU. Each CU holds 4 TMUs (texture memory units), and 64 stream processors.The four shader engines of "Hawaii" are tied to a unified command processing structure, a 1 megabyte L2 cache, a 512-bit wide GDDR5 memory interfaces, and the ancillaries, that include the PCI-Express 3.0 x16 bus interface, six display controllers (six TMDS links in all), CrossFireX XDMA, and multimedia accelerators that include UVD (accelerates high-def video), VCE (video codec engine, accelerates multimedia codecs), and the new TrueAudio hardware DSP.

17 Comments on AMD "Hawaii" Architecture Diagram Leaked

#1 Delta6326

Beastly. Posted on Oct 14th 2013, 7:12 Reply

#2 Lionheart

That description just makes me want one of these even more :cry: Posted on Oct 14th 2013, 7:46 Reply

#3 MxPhenom 216

ASIC Engineer :twitch: :eek: :twitch: :D Posted on Oct 14th 2013, 8:20 Reply

#4 jigar2speed

On paper this GPU seems be a computing beast. O_O Posted on Oct 14th 2013, 8:30 Reply

#5 1d10t





-=edited=- McSteel The power density would have been too great on 22nm, it'd be extremely difficult to cool the GPU properly. Don't confusing yourself with your uArch processor :slap:

it explain well you didn't have proper cooling for your own :troll: :nutkick: Interesting fact that this is still on 28nm :toast:-=edited=-Don't confusing yourself with your uArch processor :slap:it explain well you didn't have proper cooling for your own :troll: :nutkick: Posted on Oct 14th 2013, 10:17 Reply

#6 McSteel

The power density would have been too great on 22nm, it'd be extremely difficult to cool the GPU properly. Posted on Oct 14th 2013, 10:24 Reply

#7 buggalugs

Yep, I want....



If somebody can tell me how a 512 bit memory interface uses less area than a 384 bit one that would be great.



I assume they're talking about the memory interface at the GPU end not the board. Posted on Oct 14th 2013, 13:51 Reply

#8 NC37

A GPU with L2 cache...huh...thats something I didn't expect to see. Wonder if they did that based on their experience working with APUs.



If AMD wasn't rehashing some of the 7x00 series models for this gen then I'd almost think we may finally be coming out of the stagnation period that we've been in since Fermi launched. Not that it has been bad, we've had some great GPUs. But it is about time we moved on. Posted on Oct 14th 2013, 14:16 Reply

#9 TheHunter

NC37 A GPU with L2 cache...huh...thats something I didn't expect to see. Wonder if they did that based on their experience working with APUs.



. Its nothing new, I think its been around since ATI 5000 series.. Even 6?year old G80 aka 8800gtx had L1, L2 cache.





Its here mostly for compute and yeah its very small, obviously. :) Its nothing new, I think its been around since ATI 5000 series.. Even 6?year old G80 aka 8800gtx had L1, L2 cache.Its here mostly for compute and yeah its very small, obviously. :) Posted on Oct 14th 2013, 15:08 Reply

#10 acperience7

Really curious to see if 2 of these could take advantage of the extra lanes in a 4820k or a 4960x. Posted on Oct 14th 2013, 17:08 Reply

#11 HisDivineOrder

buggalugs Yep, I want....



If somebody can tell me how a 512 bit memory interface uses less area than a 384 bit one that would be great.



I assume they're talking about the memory interface at the GPU end not the board. It appears they optimized the memory interface since GCN 1.0 for GCN 1.1. Makes you wish they'd updated the 7970/R9 280X, 7950/R9 280, 7870/R9 270X, and 7850/R9 270 with GCN 1.1 just to see what improvements would have come from the optimization of diespace. If I recall correctly, the rumored improvements of Volcanic Islands (when it was a full GPU update instead of reheated leftovers with a new high end) were going to be "higher clockspeeds" and/or "smaller dies."



It kinda makes a bit more sense when given how optimized R9 290X seems to be. It appears they optimized the memory interface since GCN 1.0 for GCN 1.1. Makes you wish they'd updated the 7970/R9 280X, 7950/R9 280, 7870/R9 270X, and 7850/R9 270 with GCN 1.1 just to see what improvements would have come from the optimization of diespace. If I recall correctly, the rumored improvements of Volcanic Islands (when it was a full GPU update instead of reheated leftovers with a new high end) were going to be "higher clockspeeds" and/or "smaller dies."It kinda makes a bit more sense when given how optimized R9 290X seems to be. Posted on Oct 14th 2013, 17:57 Reply

#12 W1zzard

HisDivineOrder If somebody can tell me how a 512 bit memory interface uses less area than a 384 bit one that would be great the memory controller uses less silicon surface area _of the gpu_



the pcb with the memory controller signals will be much more complicated, crowded and expensive, more than a factor of two, when compared to 256 bit. the memory controller uses less silicon surface area _of the gpu_the pcb with the memory controller signals will be much more complicated, crowded and expensive, more than a factor of two, when compared to 256 bit. Posted on Oct 14th 2013, 18:13 Reply

#13 itsakjt

Freaking awesome!



512 bit bus width! That's freaking amazing. I wonder how much these will cost! Posted on Oct 14th 2013, 19:48 Reply

#14 theoneandonlymrk

W1zzard the memory controller uses less silicon surface area _of the gpu_



the pcb with the memory controller signals will be much more complicated, crowded and expensive, more than a factor of two, when compared to 256 bit. very nice likeing the look of 8 Ace engines and 4 geometry/tess engines, anyone after a 7970:mad:

also 340Kb cache per Cu compute unit makes for a lot of on die cache x44 very nice likeing the look of 8 Ace engines and 4 geometry/tess engines, anyone after a 7970:mad:also 340Kb cache per Cu compute unit makes for a lot of on die cache x44 Posted on Oct 14th 2013, 20:37 Reply

#15 EpicShweetness

So....

2816 SP's at a clock speed of ??? (maybe 1050)

64 ROP's

176 TMU's

A Bus that is capable of 64 bytes (512 bits) in a single clock, and does 5 billion clocks in a second moving a substantial 320 Gigabytes in that second

Finally a pool of 4GB VRAM, wow :rockout:



Shaping up to be an amazing card, less TMU's then a GK110 based product though. Posted on Oct 14th 2013, 21:56 Reply

#16 TheHunter

EpicShweetness Shaping up to be an amazing card, less TMU's then a GK110 based product though. But more ROPs, which means higher pixel fillrate, apparently +64 gpixel/s.,

I noticed Crysis, Crysis Warhead really likes higher pixel fillrates.





And from what saw higher TMU's matter only at higher resolution or multi monitor setup But more ROPs, which means higher pixel fillrate, apparently +64 gpixel/s.,I noticed Crysis, Crysis Warhead really likes higher pixel fillrates.And from what saw higher TMU's matter only at higher resolution or multi monitor setup Posted on Oct 14th 2013, 22:43 Reply