Carzy Zarx’s PC Gaming Emporium - Catch Up on All the Latest PC Gaming Related News

Chazore

Currently Offline

58,534

13104 posts since 05/07/14

Recent Badges:

Genocide 5,000 posts on the gamrConnect forums.
Happy Birthday Logged in on your birthday.
Open For Business Earned 10 badges.
Harvest Time Logged in at the start of Spring.
Watch Your Back! Received 10,000 profile views.
4 Years Has been a VGChartz member for over 4 years.

Currently Playing:

The Elder Scrolls Online (PC)

Chazore on 10 June 2023

So I'm browsing Steam Summer fest games, and then I come across this:

https://store.steampowered.com/app/2230110/Fae_Farm/

£50... that is one hell of an expensive indie farm sim game, that's for sure.

I guess Square and Sony upping their prices is getting some daring indies to try the same, because that seems beyond reasonable, for what little it offers compared to all the other farming sim indie titles out there.

Last edited by Chazore - on 10 June 2023

Mankind, in its arrogance and self-delusion, must believe they are the mirrors to God in both their image and their power. If something shatters that mirror, then it must be totally destroyed.

haxxiy

Currently Online

39,857

9675 posts since 21/10/08

Recent Badges:

Watch Your Back! Received 10,000 profile views.
Cleanse The Wilderness 250 comments posted on VGChartz news articles.
The Nuts & Bolts Add a total of 25 games to your collection.
Social Butterfly 50 friends on gamrConnect.
Man or Robot? Managed to avoid being banned for 10 years.
Making Friends 10 friends on gamrConnect.

haxxiy on 10 June 2023

Pemalite said:

haxxiy said:

GPU chiplets are super super hard.

You can't turn the GPU cores into chiplets because you need low latency and high bandwidth... Both things that chiplets heavily impact.

CPU cores can get away with it... Because CPU cores aren't transferring upwards of several terabytes of data per second.

AMD's approach was to actually break up their memory interface and have lots of memory interfaces instead.

So instead of having a single 384bit interfacing directly with memory, AMD has made 12x 32bit interfaces.
Each chiplet is thus housing two 2x32bit interfaces, which interfaces directly with DRAM to make a cumulative 384bit interface.

It also means instead of 1x fabric going from one large memory controller to the CCD like on a Ryzen CPU, there are 6x fabrics which can do 900GB/s of bi-directional traffic... Each.
Suddenly they have more than enough bandwidth to interface the GPU cores to the memory interface... But still not enough for multiple GPU core chiplets.

It also does mean that there is very little room to interface additional chips to AMD GPU's as the memory controller chiplet approach takes up most of the surrounding area around the cores themselves.

But I could see them integrating them into the memory controller chiplets at a later date... Sadly I don't think we will ever see the holy grail of multi-GPU chiplets due to the lack of bandwidth with the infinity link.

But I do see a future where we have stacked GPU chiplet dies just like how we stack cache on CPU's now.

That's true for GPU cores, but upscaling and frame generation don't need to read anything from the GPU except for the frame buffer.

In theory, you can fit a ray-tracing solution then although you'd be very limited in terms of shading (like other post-procs done in the ROP) but maybe something clever can be thought of such as path tracing frames on and off and matching in the back buffer.

I keep hoping we'll see that smart stuff like that from AMD or Intel but nope. They just want to fund stuff that can allow them to compete in HPC.

As for 3D SiCs, that feels like an even greater challenge than chiplets ('2.5D') and most of the industry seems to agree.

TallSilhouette

Currently Offline

23,167

3592 posts since 08/01/15

Recent Badges:

Happy Birthday Logged in on your birthday.
A Civilized Man Managed to avoid being banned for 5 years.
Site Veteran Has been a VGChartz member for over 5 years.
8 Years Has been a VGChartz member for over 8 years.
Vice Free Managed to avoid being banned for 6 months.
6 Years Has been a VGChartz member for over 6 years.

TallSilhouette on 11 June 2023

Would you think this was a render if you didn't know beforehand, let alone a real time one?

Pemalite

Currently Offline

49,768

12216 posts since 05/03/09

Recent Badges:

Mirror Image Awarded for uploading an avatar.
First Rung Of The Ladder Earned 10,000 gamrPoints
10 Years Has been a VGChartz member for over 10 years.
Escape Artist Managed to avoid being banned for 1 month.
A Civilized Man Managed to avoid being banned for 5 years.
14 Years Has been a VGChartz member for over 14 years.

Pemalite on 11 June 2023

haxxiy said:

Pemalite said:

GPU chiplets are super super hard.

You can't turn the GPU cores into chiplets because you need low latency and high bandwidth... Both things that chiplets heavily impact.

CPU cores can get away with it... Because CPU cores aren't transferring upwards of several terabytes of data per second.

AMD's approach was to actually break up their memory interface and have lots of memory interfaces instead.

So instead of having a single 384bit interfacing directly with memory, AMD has made 12x 32bit interfaces.
Each chiplet is thus housing two 2x32bit interfaces, which interfaces directly with DRAM to make a cumulative 384bit interface.

It also means instead of 1x fabric going from one large memory controller to the CCD like on a Ryzen CPU, there are 6x fabrics which can do 900GB/s of bi-directional traffic... Each.
Suddenly they have more than enough bandwidth to interface the GPU cores to the memory interface... But still not enough for multiple GPU core chiplets.

It also does mean that there is very little room to interface additional chips to AMD GPU's as the memory controller chiplet approach takes up most of the surrounding area around the cores themselves.

But I could see them integrating them into the memory controller chiplets at a later date... Sadly I don't think we will ever see the holy grail of multi-GPU chiplets due to the lack of bandwidth with the infinity link.

But I do see a future where we have stacked GPU chiplet dies just like how we stack cache on CPU's now.

That's true for GPU cores, but upscaling and frame generation don't need to read anything from the GPU except for the frame buffer.

In theory, you can fit a ray-tracing solution then although you'd be very limited in terms of shading (like other post-procs done in the ROP) but maybe something clever can be thought of such as path tracing frames on and off and matching in the back buffer.

I keep hoping we'll see that smart stuff like that from AMD or Intel but nope. They just want to fund stuff that can allow them to compete in HPC.

As for 3D SiCs, that feels like an even greater challenge than chiplets ('2.5D') and most of the industry seems to agree.

Up-scaling and frame generation do need a lot of fast caches to keep data buffered to work from, otherwise it does get expensive.

nVidia could in theory make all the Ray Tracing and Tensor/A.I routines run on the shader engines/Cuda cores as they are capable of 4/8/16INT and Floats to various degrees, but dedicating the hardware and slimming down those units is more space-efficient rather than making each shader pipeline more flexible... Although nVidia did make it's CUDA cores capable of performing Integers and Floating Point numbers concurrently with Turing if I remember correctly.

Many simpler upscaling/Anti-Aliasing methods these days are actually performed on the Shaders/Cuda cores because it's cheap and fast rather than ROPS... Often ROPS will be a bottleneck in a GPU design, hence why the Geforce 760 had such a massive improvement over the 660... It had less CUDA Cores and Teraflops, but much higher ROP throughput. (2.25 Teraflop on the 760 vs 2.45 Teraflops on the 660.)
In some scenarios where AA was done on the Cuda cores, the 660 would be faster than the 760, but once you start leveraging MSAA on the ROPS, the 760 would dwarf the 660. (Re-affirms the idea that Teraflops is also bullshit for determining gaming performance as well.)

So you could in theory run Ray Tracing operations on the Tensor cores rather than the RT cores if you wanted... But we can't couple those operations to ROPS as they just don't have the INT/FP throughput.

3D chip stacking started out with stacked DRAM, then NAND. Now we are doing it with Stacked cache on top of CPU cores.
The next jump will be stacked chiplets. - Yeah it will be a challenge, but so was chiplets once upon a time.

Stacked chips though can actually get around the limited latency and bandwidth issues of Fabric interconnects, so there are inherent benefits... Imagine current Ryzen chips, using the same fabrication, but consuming 50% LESS power than they do currently by stacking the chiplets. (That's how much energy is wasted on the fabric.)

--::{PC Gaming Master Race}::--

JEMC

Currently Online

87,646

25086 posts since 20/08/08

Recent Badges:

Hit And Run 15 comments posted on VGChartz news articles.
9 Years Has been a VGChartz member for over 9 years.
Happy Birthday Logged in on your birthday.
Making Progress Earned 50,000 gamrPoints
Watch Your Back! Received 10,000 profile views.
Killer Scorpion Earned 60 badges.

Currently Playing:

Age of Empires II: HD Edition (PC)
Euro Truck Simulator 2 (PC)

JEMC on 11 June 2023

TallSilhouette said:

Would you think this was a render if you didn't know beforehand, let alone a real time one?

Impressive. But I'd like to know the specs of his system.

Please excuse my bad English.

Currently gaming on a PC with an i5-4670k@stock (for now), 16Gb RAM 1600 MHz and a GTX 1070

Steam / Live / NNID : jonxiquet Add me if you want, but I'm a single player gamer.

haxxiy

Currently Online

39,857

9675 posts since 21/10/08

Recent Badges:

Harvest Time Logged in at the start of Spring.
Leaving Limbo 100 posts on the gamrConnect forums.
Mighty Heart Logged in on Valentine's Day.
14 Years Has been a VGChartz member for over 14 years.
10 Years Has been a VGChartz member for over 10 years.
Escape Artist Managed to avoid being banned for 1 month.

haxxiy on 11 June 2023

Pemalite said:

haxxiy said:

3D chip stacking started out with stacked DRAM, then NAND. Now we are doing it with Stacked cache on top of CPU cores.
The next jump will be stacked chiplets. - Yeah it will be a challenge, but so was chiplets once upon a time.

Stacked chips though can actually get around the limited latency and bandwidth issues of Fabric interconnects, so there are inherent benefits... Imagine current Ryzen chips, using the same fabrication, but consuming 50% LESS power than they do currently by stacking the chiplets. (That's how much energy is wasted on the fabric.)

I don't disagree that it will eventually be the way to go, just that chiplet will be less of a challenge and become viable first as a mid-step, perhaps even in consumer-grade GPUs. Infinity Link has a thousand times more bandwidth than either Crossfire or SLi, so I'm sure engineers can find a way around any lingering latency issue.

That or most people will be streaming games within ten years or so, so multi-chip GPUs become more desirable to the industry players anyway.

Jizz_Beard_thePirate

Currently Online

109,560

28719 posts since 07/08/13

Recent Badges:

11 Years Has been a VGChartz member for over 11 years.
Ride Into the Sunset Managed to avoid being banned for 3 months.
Man or Robot? Managed to avoid being banned for 10 years.
Pon Received 100 wall post comments on gamrConnect.
God Of VGC 10,000 posts on the gamrConnect forums.
7 Years Has been a VGChartz member for over 7 years.

Jizz_Beard_thePirate on 11 June 2023

Apple M2 Ultra SoC Isn’t Faster Than AMD & Intel Last Year Desktop CPUs, 50% Slower Than NVIDIA RTX 4080

https://wccftech.com/apple-m2-ultra-soc-isnt-faster-than-amd-intel-last-year-desktop-cpus-50-slower-than-nvidia-rtx-4080/

Performance per watt will be Apples strong suit but when you consider just how much the M2 Ultra costs vs a PC with say a 13900K, the question then becomes how many years of extended use is going to take to make up the difference you are paying cause a base model Mac Studio with M2 Ultra starts at $4000 USD. You can get a 13900K + 4090 for $3000 USD quite easily but a 4090 is over 2x faster. Realistically, the only way these products make sense is if you need to use Apple specific software or need tons of memory for Ai since it's cheaper to spend $6600 USD on 192GB of Unified memory on M2 Ultra than $40,000 on a single Nvidia H100 with 80GB of vram.

Don't forget to tune into the Xbox Showcase here:

Last edited by Jizz_Beard_thePirate - on 11 June 2023

PC Specs: CPU: 7800X3D || GPU: Strix 4090 || RAM: 32GB DDR5 6000 || Main SSD: WD 2TB SN850

Jizz_Beard_thePirate

Currently Online

109,560

28719 posts since 07/08/13

Recent Badges:

So You Came Back For More, Huh? Logged in a second time.
Genocide 5,000 posts on the gamrConnect forums.
Brotherhood 100 friends on gamrConnect.
Some Here, Some There Bank a Total of 5,000 VG$.
One Of A Kind 1,000 replies made to user's most popular thread.
Good Listener Received 1,000 wall post comments on gamrConnect.

Jizz_Beard_thePirate on 11 June 2023

Alright it was a good conference. I'll post some of the trailers I am interested in:

PC Specs: CPU: 7800X3D || GPU: Strix 4090 || RAM: 32GB DDR5 6000 || Main SSD: WD 2TB SN850

hinch

Currently Offline

26,840

5661 posts since 01/03/13

Recent Badges:

'Ello Princess! Awarded for signing up.
Congratulations on Pressing Start! Score your first game in your collection.
Leaving Limbo 100 posts on the gamrConnect forums.
Watch Your Back! Received 10,000 profile views.
10 Years Has been a VGChartz member for over 10 years.
8 Years Has been a VGChartz member for over 8 years.

hinch on 11 June 2023

Didn't watch from beggining but so many decent games and new IP's shown on there. Also pretty funny that Microsoft just embarrassed Sony with their presentation. And they were on a hiatus for a year.

CP 2077 dlc, I'm super excited for. That Bioshock looking game from IneXile looked promising too.

Jizz_Beard_thePirate

Currently Online

109,560

28719 posts since 07/08/13

Recent Badges:

Brotherhood 100 friends on gamrConnect.
Completed Persona 100% completed profile.
Freezing Logged in on Christmas day.
Harvest Time Logged in at the start of Spring.
Breaking Out Managed to avoid being banned for 1 year.
Hit And Run 15 comments posted on VGChartz news articles.

Jizz_Beard_thePirate on 11 June 2023

Fuck me Starfield looks incredible

PC Specs: CPU: 7800X3D || GPU: Strix 4090 || RAM: 32GB DDR5 6000 || Main SSD: WD 2TB SN850

Existing User Log In

New User Registration

Forums - PC Discussion - Carzy Zarx’s PC Gaming Emporium - Catch Up on All the Latest PC Gaming Related News

Recent Badges:

Currently Playing:

Recent Badges:

Recent Badges:

Recent Badges:

Recent Badges:

Currently Playing:

Recent Badges:

Recent Badges:

Recent Badges:

Recent Badges:

Recent Badges: