By using this site, you agree to our Privacy Policy and our Terms of Use. Close
walsufnir said:
BlkPaladin said:

Actually not if you look in the post above you and my last post, it depends on the chip. You just cannot magically make a chip half percise to run faster. Depending on how they are made it may double the perfermance of FP16 instruction or it may run at the same speed. I use registers in my answer because that is how deep my knowledge goes about these things go, I'm sure there are other ways to speed of FP16 and FP32 instructions other ways. But a register for all intents and purposes of this explination can only run one instruction at a time. And depending on how the chip is made to run the FP32 instructions can influence if the chip experences a "speed boost" running thing at half-percision. For example some 32-bit instruction are run on two 16-bit registers. So if it is optimized to do so, if you put 16-bit instructions into this register you can put another instruction at the same time in the other register and thus "twice" the speed in this case. But there are 32-bit registers that will only do one instruction at a time no matter how small the instruction is. So just looking at terms of FLOPS and Full percision/Half percision doesn't tell the entire story.

FLOPS, like Hertz before it, is just a advertising go-to word that really has no real world inpact.

It's funny that back in the day people went crazy over higher bit consoles and now using less bits to store data is important :)

Not when it comes to OS.



If you demand respect or gratitude for your volunteer work, you're doing volunteering wrong.