By using this site, you agree to our Privacy Policy and our Terms of Use. Close
fleischr said:
Ljink96 said:

The thing is, the document says 1024 Flops. So I can't really assume at this point. I kinda doubt switch would be .5TF, we've heard .75 from digital foundry. The document is confusing. It really doesn't specify if it's a Megaflop, Gigaflop or at FP16 or 32 but you are most likely right. 

It could be 1024Gflops and be 1TF FP16, or 1024Gflops FP32, aka 512TF. But do hardware manufacturers default to FP16 when noting specs?

I think the idea is to optimize software for FP16 so that you really do have 1TF worth of compute power.

If you look at spec sheets provided by nVidia for Tegra, they list both FP16 and FP32 tflops. We're used to seeing FP16 / 2 = FP32 flops, but that isn't always the case. The Tegra K1 only had ~350 gflops in either FP16 or FP32 modes because the Kepler architecture couldn't even leverage FP16. Starting only with Tegra X1 did they do anything to take advantage of FP16.

People have been reeeaaally quick to write off FP16, but seldom give you valid, scientific reasons why.

Not expert on subject, but few things to consider:

"To get an idea of what a difference in precision 16 bits can make, FP16 can represent 1024 values for each power of 2 between 2-14 and 215 (its exponent range). That’s 30,720 values. Contrast this to FP32, which can represent about 8 million values for each power of 2 between 2-126 and 2127. That’s about 2 billion values—a big difference."

https://devblogs.nvidia.com/parallelforall/mixed-precision-programming-cuda-8/ 

 

Current performance king in gaming GPUs, Titan X:

FP32: 10,157 GFLOPS

FP16: 159 GFLOPS

This is of course nVidia's way to prevent gaming cards being bought instead of Teslas for tasks that actually benefit from FP16...but just shows how little FP16 performance is important in games.

Honestly, not an expert on the subject, but last time I recall any talk about FP16 in gaming was some 15 or so years ago. Sure, mobiles have it, but degradation in quality seems to be quite noticeable.