| Entroper said: , you have to think carefully about how the threads will use data |
Precisely. Each SPE has very high computationnal power.The fact that each SPE can put / store datas very quickly to the local store of another, and that with 6 of them you can organize them (ring , linear or hypercube, etc.) make me say this processor was designed towards distributed parallel programming.
The possible rewards are potentially tremendous in what you can perform as calculation.
Problem is reaching global efficiency has proven theorically and practically very difficult on such systems. Have you looked how the algorithms for "sort" or "matrix multiplication" look like in such environnements ?
You said this is "a strength of the design". I prefer to say this is a very ambitious one.
Edit : typos.







