Deneidez said:
Ascended_Saiyan3 said: It's already been shown, though many tests, that 1 SPU can equal or surpass Intel's top single core processor in many tasks. The inter-core bandwidth has been proven at 197GB/s. The link from my percious post shows Intel Core i7 965XE's TOTAL bandwidth (cache, memory, and inter-core bandwidth) is 106GB/s. Plus, it only has 4 cores at 2 ops per clock. Also, I can't believe you talked about running code/apps not written for it's architecture! That's a bit like saying 2.4GHz PowerPPC processor would demolish an Intel Core i7 965XE at Linux written specifically for PPCs. LOL. It's just that the code is written to take advantage of different architectures. |
I would like to see those tests. If you get me ps3, I can also make program for it which runs fine on Xbox(original) and really doesn't run well on ps3 even if you find any expert to optimate that program in anyway but still making it function like original(Of course).
Btw, parallizing is very easy on homogeneous platform.
|
http://www.mc.com/uploadedFiles/Cell-Perf-Simple.pdf
Against the fastest single cores (like Intel Xeon)...1 SPE can perform up to 7 times better than a single core 3.6GHz Intel Xeon.
http://www.research.ibm.com/cell/whitepapers/alias_cloth.pdf
Cloth Physics on 2.4GHz Cell vs single core 3.6GHz Intel Xeon
http://www.simbiosys.ca/science/white_papers/eHiTS_on_the_Cell.pdf
Performance comparison between Intel/AMD dual-core processors versus Cell...also partially explains that realism iin games is not limited by the GPU anymore.
http://www.cs.berkeley.edu/~samw/research/papers/ipdps08.pdf
Cell beats Intel Quad-Core (Clovertown) at DP GFLOPS, which it's terrible at compared to it's SP (just divide the chart number in half for Cell due to Cell blade being used).
http://www.ibm.com/developerworks/power/library/pa-cellperf/
IBM document on Cell and per SPE performance
http://www.tomshardware.com/news/ibm-lead-architect-cell-cpu-ps3-gaming,1336.html
SPEs capable of "running single core scalar programs in their entirety.
http://lzhan.wikispaces.com/Cell+Programming?f=print
Individual programmer experiments on PS3...shows per SPE output at bottom.
http://arxiv.org/PS_cache/physics/pdf/0611/0611201v2.pdf
Cell almost 20x (2 orders of magnitude) faster than Opteron processor at Molecule Dynamics
http://www.sintef.no/upload/IKT/9011/SimOslo/eVITA/2008/hagen2.pdf
Cell versus Intel Core2 Duo in power consumption and theoretical performance.
http://www.power.org/resources/devcorner/cellcorner/hpcspe.pdf
Cell versus Intel Quad Core (Clovertown) theoretical GFLOPS...and real world Multigrid Finite Element solver running at an unprecedented 52GFLOPS sustained performance.
http://www.power.org/devcon/07/Session_Downloads/PADC07_Bergmann_Sourcery_VSIPL.pdf
Cell almost 14x the sustained performance of single core 3.6GHz Intel Xeon at VSIPL++ fast convolution.
http://www.cis.udel.edu/~cavazos/cisc879/papers/cellFMwhitepaper.pdf
Cell (1GB RAM) 11x faster than Intel Quad-Core (Clovertown) w/16GB RAM at SP for financial market applications.
http://74.125.45.104/search?q=cache:lLEILf-tpPAJ:gametomorrow.com/blog/index.php/2005/07/26/beyond-polygons/+SPE+GFLOPS&hl=en&ct=clnk&cd=30&gl=us&client=opera
Cell ray-casts complex 720p scenes at greater than 30fps.
http://64.233.169.132/search?q=cache:XcMos60ZImsJ:gametomorrow.com/blog/index.php/2005/11/30/+gametomorrow+gpus-vs-cell&hl=en&ct=clnk&cd=6&gl=us&client=firefox-a
Cell beats Nvidia 7800 GT OC at ray-tracing quaternion Julia fractals by a factor of over 5x.
http://64.233.169.132/search?q=cache:F7MZrviYvQwJ:gametomorrow.com/blog/index.php/%3Fp%3D187+gametomorrow+Cell+vs+g80&hl=en&ct=clnk&cd=1&gl=us&client=firefox-a
Cell over 4x faster at ray-tracing Stanford Bunny than Nvidia 8800GTX.
The last 3 links were to GameTomorrow. That site is down, but a search will reveal that information on other site.