By using this site, you agree to our Privacy Policy and our Terms of Use. Close
Pemalite said:
sc94597 said:

TFLOPs are a function of clock-frequency and core-count. Both of those are knowns now.

There is more to it than that, you also need to include the number of instructions and the precision.

For example...
A GPU with 256 cores operating at 1,000Mhz with 1 instruction per clock with 32bit precision is 512Gflops.
A GPU with 256 cores operating at 1,000Mhz with 2 instructions per clock with 32bit precision is 1024Gflops.
A GPU with 256 cores operating at 1,000mhz with 2 instructions per clock but operating at 16bit precision double packed is 2048Gflops.

Same number of cores, same clockspeed... But there is a 4x difference.

Developers optimizing for mobile hardware tend to use 16bit precision whenever possible due to the inherent power saving and speed advantages.

sc94597 said:

The node isn't important anymore.

The node is important, it dictates the size, complexity and energy characteristics of the SoC.

In the conversation that was being had it is already known that we are talking about an Ampere chip (T239) and single-precision, by convention. The missing variables were frequencies and core-counts, with architecture and precision of interest being held constant. 

Node size is important for those things yes, but if we already know the frequency and core-counts (as we essentially now do), it is no longer important for calculating hypothetical single-precision floating point performance.