Carzy Zarx’s PC Gaming Emporium - Catch Up on All the Latest PC Gaming Related News

PC - Carzy Zarx’s PC Gaming Emporium - Catch Up on All the Latest PC Gaming Related News - View Post

sc94597

Currently Offline

105,581

19738 posts since 23/01/08

Recent Badges:

9 Years Has been a VGChartz member for over 9 years.
8 Years Has been a VGChartz member for over 8 years.
Man or Robot? Managed to avoid being banned for 10 years.
3 Years Has been a VGChartz member for over 3 years.
'Splodin Ahead Author of 500 forum threads.
A Free Man Managed to avoid being banned for 2 years.

sc94597 on 09 January 2025

CNNs are more efficient at low parameter counts, but vision transformers scale better with higher parameters counts. Given that the hardware used for this (Tensor cores for Nvidia, Matrix cores for AMD) are usually under-utilized when gaming AMD probably will have to switch to ViTs as well to keep up since there is plenty of room to scale for better quality.

Edit The biggest difference will also be observable in motion, because that is probably where ViTs will shine over CNNs, temporal consistency due to the attention mechanism.

Last edited by sc94597 - on 09 January 2025

Delete Shortcut Create Shortcut
View Post

Existing User Log In

New User Registration

PC - Carzy Zarx’s PC Gaming Emporium - Catch Up on All the Latest PC Gaming Related News - View Post

Recent Badges: