By using this site, you agree to our Privacy Policy and our Terms of Use. Close
Pemalite said:
Bofferbrauer2 said:

You're really not. I got those 20-40% not from estimates, but from Benchmarks between the TX and the TX2, and that was the bracket of performance increase (in FPS) between the two with similar power consumption.

Gotta' disagree. Because... Bandwidth. 25.6GB/s isn't good for 1080P gaming... There would be a linear increase in performance with a doubling of bandwidth.

I did some testing on this back in the day and posted it on this very forum about how bandwidth scales with resolution. - Around 150-200GB/s is the ideal ballpark for 1080P... Otherwise you are fillrate limited.

So you are going from 25.6GB/s to 59.7GB/s... Which is a 133% increase in bandwidth... Maxwell to Pascal V4 Delta Colour compression gives Pascal a 20% best-case scenario bandwidth improvement on top of that.
https://www.anandtech.com/show/10325/the-nvidia-geforce-gtx-1080-and-1070-founders-edition-review/8

So that potentially makes the difference of 25.6GB/s to 71.64GB/s or 179% advantage to Tegra Pascal.

If you can point me to a point in history where a GPU of the same relative architecture but with a 150%-200% bandwidth advantage getting less than a 50% performance improvement at decent resolutions... I'll eat my hat.

Because I can point to many cases where nVidia/AMD have taken a GPU, but hamstrung bandwidth by including DDR4/DDR3 over GDDR5 and performance took a massive dive.
Like the Radeon 7750, Geforce 1030 etc'.

Here's the results I had in GoogleNet and AlexNet, two neural networks that make full use of the GPU. Both chips have the same settings:

BenchmarkJetson TX1TX2 Max-QTX2 Max-P
GoogLeNet (batch=2) 141 FPS137FPS176 FPS
GoogLeNet (batch=128) 202 FPS195 FPS252 FPS
AlexNet (batch=2) 163 FPS176 FPS222 FPS
AlexNet (batch=128) 507 FPS465 FPS603 FPS

As you can see, at no point do we get a 50% uplift at similar power profiles.

That being said, the TX2 can be fully unleashed, at which point the chip has an additional 10-20% uplift over Max-P - but then, it consumes also roughly 25% more power than the TX1. It's possible that you were referring to those results, as then we're getting around 50% uplift.

Of course, it's also dependent on the Benchmark. Jpeg compression for instance is almost 3 times as fast on the TX2. Here, the higher benchmark comes fully to play. But that simply ain't true for all the applications.