New Server Chip Niagara 307
* * Beatles-Beatles writes "Sun recently announced their latest release in server technology. The UltraSparc T1 processor, code-named Niagara, has eight computing engines on a single chip, with each core capable of handling up to four tasks at once." With this new processor Sun hopes to get a leg up on the competition. The Niagra chip is being billed as an "eco-friendly" chip because of its low power requirements. From the article: " [...] removing the world's Web servers and replacing them with half the number of UltraSparc T1-based systems would have the same effect on carbon dioxide emissions as planting 1 million trees."
Better link (Score:5, Informative)
Comparisons to the cell? (Score:5, Informative)
Re:Apple need this (Score:5, Informative)
Niagra = 70 watts
G4 = 19 watts
Re:What about I/O? (Score:5, Informative)
The lowdown (Score:5, Informative)
Since the story is devoid of content:
- up to 8 cores, 4 threads per core
- integrated RSA
- 3MB L2 cache
- 90nm process
- 1.2 GHz
Re:Comparisons to the cell? (Score:5, Informative)
Re:What about I/O? (Score:4, Informative)
Re:nasty stuff (Score:1, Informative)
By the why, the green discoloration on potatoes that grow exposed to the sun is chlorophyll and solanine with small amounts of arsenic. You'd get a tummy ache from eating a green potato as opposed to the amounts used in chip-fab.
Re:Massively multi-core x86s (Score:5, Informative)
Similar the new ARM cores Cortex [arm.com] it takes roughly the same power at 1Ghz which gives it apparently 2000MIPS. The area is about the same as PPC 440. So in theory you could hook 4-8 of these up as well and get a killer chip too..
Point is Suns quotes of being "2 possibly 3 generations ahead" is totally bullshit. They're at most one generation ahead. It takes one multi-core ARM or PPC to totally destroy this.
Tom
Arbor Day Foundation (Score:3, Informative)
Re:nasty stuff (Score:5, Informative)
Are you implying that you can have useful throughput under no load? How do you measure this idle throughput advantage?
The Intel/AMD architectures are historically single-threaded desktop-centric where the most important thing usually is to run one thing really fast. Sun, however, was always in the HPC/workstation game where overall throughput matters most, latencies and single-thread performance be damned. These two groups were playing pretty different games up to recently.
But now, Intel/AMD have hit a GHz and complexity brick wall. They are forced to promote multi-threading multi-core at the desktop-level and optimize their future desktop chip designs for multi-threaded application throughput rather than single-threaded performance. Imagine what would happen if AMD and Intel could afford to quit competing on single-threaded performance overnight: goodbye complex deep out-of-order execution, goodbye branch-prediction and speculative execution - those transistors would be much better spent on implementing quad-threading cores to keep every pipeline filled with useful instructions that will retire cleanly on every clock.
Sacrificing single-thread performance for simultaneous multi-threaded throughput in the above-described way has been the name of Sun's game for the last few years.
Obsession with single-threaded performance is what costs current x86 CPUs the most power. Of course, in the P4/HT case, there is the added power and transistor costs of trying to be a jack-of-all-trades who predictably turned out as a master-of-none. (The P4's uOP replay engine is a neat idea... but re-executing the same stupid uOPs until they meet retirement conditions is woefully wasteful, whoever designed and bothered to patent this should be fired.)
Re:What about I/O? (Score:5, Informative)
Re:Or even easier.... (Score:5, Informative)
Re:Massively multi-core x86s (Score:3, Informative)
Compare a 2MB L2 cache on a P4 to a box with 1MB of L2 or 512K
What is also important is associativity. If you have a low-assoc cache, meaning a given address has few places in the cache it could reside you end up wasting more space. That's why [iirc] the AMD processors have high associvity L2 caches. They make good use of the 512K available.
At my previous job we built Gentoo distros on 128 and 256K semprons and the time to build wasn't really that different even though we were building 100s of packages.
So you could have a relatively high performance web server or file store [or whatever] without the 2MB of cache stuck on the back of the thing.
Tom