VGLeaks: Durango Memry System Overview & Example

ethomaz

Banned

186,625

33999 posts since 02/02/07

Recent Badges:

A Badge Within A Badge Earned 20 badges.
3 Years Has been a VGChartz member for over 3 years.
One Piece at a Time Add your first game to your collection.
We're Still Watching You Received 50,000 profile views.
Mr. Punctual 10 game release dates added to the VGChartz database.
Everything's Falling Into Place Add a total of 100 games to your collection.

Currently Playing:

Metal Gear Solid HD Collection (PS3)

ethomaz on 13 March 2013

Memory

As you can see on the right side of the diagram, the Durango console has:

8 GB of DRAM.
32 MB of ESRAM.

DRAM

The maximum combined read and write bandwidth to DRAM is 68 GB/s (gigabytes per second). In other words, the sum of read and write bandwidth to DRAM cannot exceed 68 GB/s. You can realistically expect that about 80 – 85% of that bandwidth will be achievable (54.4 GB/s – 57.8 GB/s).

DRAM bandwidth is shared between the following components:

- CPU
- GPU
- Display scan out
- Move engines
- Audio system
- ESRAM

The maximum combined ESRAM read and write bandwidth is 102 GB/s. Having high bandwidth and lower latency makes ESRAM a really valuable memory resource for the GPU.

ESRAM bandwidth is shared between the following components:

- GPU
- Move engines

Video encode/decode engine. System coherency

There are two types of coherency in the Durango memory system:

Fully hardware coherent

I/O coherent

[...]

The CPU

The Durango console has two CPU modules, and each module has its own 2 MB L2 cache. Each module has four cores, and each of the four cores in each module also has its own 32 KB L1 cache.

When a local L2 miss occurs, the Durango console probes the adjacent L2 cache via the north bridge. Since there is no fast path between the two L2 caches, to avoid cache thrashing, it’s important that you maximize the sharing of data between cores in a module, and that you minimize the sharing between the two CPU modules.

Typical latencies for local and remote cache hits are shown in this table.

- Remote L2 hit approximately 100 cycles
- Remote L1 hit approximately 120 cycles
- Local L1 Hit 3 cycles for 64-bit values
- 5 cycles for 128-bit values
- Local L2 Hit approximately 30 cycles
- Each of the two CPU modules connects to the north bridge by a bus that can carry up to 20.8 GB/s in each direction.

From a program standpoint, normal x86 ordering applies to both reads and writes. Stores are strongly ordered (becoming visible in program order with no explicit memory barriers), and reads are out of order.

Keep in mind that if the CPU uses Write Combined memory writes, then a memory synchronization instruction (SFENCE) must follow to ensure that the writes are visible to the other client devices.

The GPU

The GPU can read at 170 GB/s and write at 102 GB/s through multiple combinations of its clients. Examples of GPU clients are the Color/Depth Blocks and the GPU L2 cache.

The GPU has a direct non-coherent connection to the DRAM memory controller and to ESRAM. The GPU also has a coherent read/write path to the CPU’s L2 caches and to DRAM.

For each read and write request from the GPU, the request uses one path depending on whether the accessed resource is located in “coherent” or “non-coherent” memory.

Some GPU functions share a lower-bandwidth (25.6 GB/s), bidirectional read/write path. Those GPU functions include:

- Command buffer and vertex index fetch
- Move engines
- Video encoding/decoding engines
- Front buffer scan out
- As the GPU is I/O coherent, data in the GPU caches must be flushed before that data is visible to other components of the system.

The available bandwidth and requirements of other memory clients limit the total read and write bandwidth of the GPU.

Move engines

The Durango console has 25.6 GB/s of read and 25.6 GB/s of write bandwidth shared between:

- Four move engines
- Display scan out and write-back
- Video encoding and decoding
- The display scan out consumes a maximum of 3.9 GB/s of read bandwidth (multiply 3 display planes × 4 bytes per pixel × HDMI limit of 300 megapixels per second), and display write-back consumes a maximum of 1.1 GB/s of write bandwidth (multiply 30 bits per pixel × 300 - megapixels per second).

You may wonder what happens when the GPU is busy copying data and a move engine is told to copy data from one type of memory to another. In this situation, the memory system of the GPU shares bandwidth fairly between source and destination clients. The maximum bandwidth can be calculated by using the peak-bandwidth diagram at the start of this article.

http://www.vgleaks.com/durango-memory-system-overview/
&
http://www.vgleaks.com/durango-memory-system-example/

Turkish

Currently Offline

124,463

11601 posts since 13/01/11

Recent Badges:

Genocide 5,000 posts on the gamrConnect forums.
Making Progress Earned 50,000 gamrPoints
1st Birthday Has been a VGChartz member for over 1 year.
8 Years Has been a VGChartz member for over 8 years.
13 Years Has been a VGChartz member for over 13 years.
Mirror Image Awarded for uploading an avatar.

Turkish on 13 March 2013

Whats the secret sauce?

ethomaz

Banned

186,625

33999 posts since 02/02/07

Recent Badges:

Completed Persona 100% completed profile.
Site Veteran Has been a VGChartz member for over 5 years.
Stealth Critic 100 critic reviews added to the VGChartz database.
2 Years Has been a VGChartz member for over 2 years.
Cleanse The Wilderness 250 comments posted on VGChartz news articles.
Drake's Diary 100 status updates.

Currently Playing:

Metal Gear Solid HD Collection (PS3)

ethomaz on 13 March 2013

Turkish said:
Whats the secret sauce?

Seems like the Move Engines... free up the CPU and GPU for these tasks...

But I'm confuse because the diagram just shows the GPU with direct access to eDRAM... the CPU not.

disolitude

Currently Offline

87,040

14997 posts since 13/01/08

Recent Badges:

6 Years Has been a VGChartz member for over 6 years.
Making Progress Earned 50,000 gamrPoints
Littlest Genocide 1,000 posts on the gamrConnect forums.
Ride Into the Sunset Managed to avoid being banned for 3 months.
Cleanse The Wilderness 250 comments posted on VGChartz news articles.
8 Years Has been a VGChartz member for over 8 years.

disolitude on 13 March 2013

Turkish said:
Whats the secret sauce?

Built in DVR and bundling with cable providers?

theprof00

Currently Offline

154,498

31184 posts since 02/04/08

Recent Badges:

Leaving Limbo 100 posts on the gamrConnect forums.
8 Years Has been a VGChartz member for over 8 years.
6 Years Has been a VGChartz member for over 6 years.
You're Popular... Author of at least 5 gamrConnect threads that has received over 1,000 replies.
Wanted Win VG$100 in total.
Painting A Picture 100 screenshots added to the VGChartz database.