A Three-Way AMD Opteron Server 137
Abdul tips a thin little review up at The Inquirer of the Themis Slice. "The Slice is a three socket Opteron machine with two PCIe slots and two Infiniband 4x ports... Why would you want three sockets rather than four? Easy, latency. Any CPU in a 3S system is one hop away from any other CPU. In a 4S system, you can be two hops away. This adds latency, and more importantly, you take a big hit on cache coherency latency. This kills performance."
Weird (Score:1)
Re:Weird (Score:5, Informative)
Re: (Score:2)
Re: (Score:1)
Re: (Score:2)
Re: (Score:2)
Re:Weird (Score:5, Informative)
Re: (Score:2, Interesting)
This can be solved directly by creating chips with multi
Re: (Score:3, Informative)
Re: (Score:1)
Re: (Score:1)
Re: (Score:2)
Transistor planes, no.
A typical CPU by AMD or Intel is about 9-12 layers, only on of which (the bottom doped Si layer) has transistors. Everything else is poly or metal.
-nB
Re: (Score:2)
In Casinos, yes.
Wrong? (Score:1)
Re: (Score:1)
X---X
|\
| X |
\ |
X
Re: (Score:2)
on a side note t
Re: (Score:1)
Re: (Score:1)
Re: (Score:1)
Re: (Score:1)
Re: (Score:3, Interesting)
Problem 1)
Draw four circles on a piece of paper.
Now draw a line from every circle to every other circle without crossing any lines.
Problem 2)
Draw four circles on a piece of paper. Draw two "pins" on each.
Now draw a minimal path between any two circles such that you can only start and stop at a pin, and only one connection can go to a single pin.
You have the right idea for problem 1, that for low-N, you can just route connections through different layers
Re: (Score:2)
Re:Weird (Score:5, Insightful)
/ | \
/
x---------x
Re: (Score:2)
Re: (Score:2)
Very Carefully. (Score:2)
nothing new (Score:4, Informative)
IBM System x3755 (Score:5, Informative)
The IBM System x3755 [ibm.com] has offered this feature since it came out as well. Instead of the fourth processor card you install a pass through card and it turns it into a three way. We've done a few benchmarks [lionbridge.com] (warning pdf) with the Pass Through card and what it could do between 3CPU and 4CPU operations.
pretty cool ability for a few things.
Re:IBM System x3755 (Score:5, Funny)
You don't say... : p
Re: (Score:3, Interesting)
Visopsys, ReactOS, OpenSolaris, plan9, Minix, QNX, MMURTL, OpenVMS, Haiku, and some others could serve for utility and novelty in varying degrees, but I already have plenty of software for OS/2.
Yes, I'm an avid system collector. If you have hardware or sof
The article had me at 'three-way'. (Score:2)
Re: (Score:2)
Re: (Score:2)
I ran a BBS on it in the dying days of the BBS era. There are times I want to bring it back (ala, dialup to initiate a circuit connection, then DSL to DSLAM connection, but the TelCos won't allow that because they are asshats).
-nB
Time to fire up the old BBS server and serial console port into it for fun
Re: (Score:2)
Re: (Score:2)
Fairly recently.
I can select from proxy servers across the world as exit points and my local site has OC48.
Not telling where though.
-nB
Re: (Score:2)
Re: (Score:2)
They actually did address this in their benchmark document:
Configuration Exception
Due to backorder shipping delays from HP on the 144GB SAS 15K RPM hard drives the 72GB SAS drives
were deemed an acceptable substitute. The SPECjbb2005 workload tool does nothing to exercise the hard
drive and writes no data to it. As a result, this configuration exception was determined to be immaterial to the
performance results addressed in this st
Re: (Score:2)
Believe it or not, when my team is tasked with coming up with studies like this, we try to be as fair as possible and don't try to stack the deck. We know the people evaluating purchasing our stuff aren't that stu
Re: (Score:2)
Re: (Score:2)
What is this article about? (Score:2, Funny)
Too bad it was two other guys (Score:2)
Twice.
CoProcessors? (Score:5, Interesting)
There is some interesting potential in that realm.. Crypto accelerators for VPN, SAN, or other devices. Multimedia encode/decode accelerators (encode 1080P H.264 in real time?). Inevitable video game acceleration devices (physics co-processor, accelerated NIC chip, 3D GPU offload processor?).
Those would be even more interesting in home-user oriented Athlon64 boards. Multi-socket opteron boards are out of my price range.
Re: (Score:3, Insightful)
Re: (Score:2)
FPGAs are very popular so that you can create custom co-processors.
What would you do... (Score:5, Funny)
> Why would you want three sockets rather than four? Easy, latency. Any CPU in a 3S system is one hop away from any other CPU. In a 4S system, you can be two hops away. This adds latency, and more importantly, you take a big hit on cache coherency latency. This kills performance."
Lawrence: Three chips at the same time, man.
Peter: That's it? If you had a million dollars, you'd use three sockets at the same time?
Lawrence: Damn straight. I always wanted to do that, man. And I think if I worked at AMD I could hook that up, too; 'cause I hate motherboard layouts with latency.
Peter: Well, not all layouts.
Lawrence: Well, the type of chips that'd triple up on a board like that would.
Peter: Good point.
Lawrence: Well, what about you now? what would you do?
Peter: Besides three chips at the same time?
Lawrence: Well, yeah.
Peter: Idle.
Lawrence: Idle, huh? Peter: I would relax... I would sit on my ass all day... I would idle.
Lawrence: Well, you don't need a million dollars to idle, man. Take a look at that fourth chip: it's two hops away, don't do shit.
Re: (Score:2)
And they're not good thoughts.
Re: (Score:2)
Re: (Score:2)
Mac OS X on this machine... (Score:2, Funny)
Re: (Score:3, Funny)
Only if you use it to fry pancakes!
Where's the specs? (Score:2, Interesting)
Re:Where's the specs? (Score:5, Funny)
Re: (Score:1)
Threesome (Score:3, Funny)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re:Threesome (Score:5, Funny)
Re: (Score:2)
Same latency with 4 processors (Score:4, Interesting)
Re: (Score:2)
Ignoring the physical trace-routing issues, you can have N fully connected nodes as long as every one has a N-1 connections (ie, a dedicated link to every other node), plus you need at least one bus-drop somewhere.
In practice, all those connections need to physically connect somewhere, making more than a handful of fully-connected processors all but impossible.
Re: (Score:2)
Will have, yes, once both chip and socket support it. The current socket only supports 3 HT links.
Re:Same latency with 4 processors (Score:5, Informative)
Check it out here. [realworldtech.com]
Re: (Score:2)
6-way systems (Score:2)
ROOTER (Score:2)
Themis Computer has developed a breakthrough in distributed computing for mission-critical systems. By functionally disaggregating commercial computing resources and housing them in a standardized footprint, purpose-built enclosure, the Themis Slice Architecture provides resilience with superior thermal and kinetic management. This open and modular design allows for spiral technology refresh, extending computing infrastructure investments for complete lifecycle management.
I admit this article is probably just over my head technically, but did anyone else read this and think of ROOTER [mit.edu]? I mean, what is "kinetic management" in a computer? Maybe they spin the CPUs through the air instead of blowing air over them. That might explain "spiral refresh technology" as well.
Workarounds (Score:1)
hard to justify (Score:5, Funny)
Re: (Score:3, Funny)
Re: (Score:3, Insightful)
(sorry about this)
Re: (Score:2)
Re: (Score:2)
Multi core (Score:3, Interesting)
HT links (Score:1)
think three-dimensional (Score:1, Insightful)
How about a tetrahedron for four CPUs?
Re: (Score:1, Informative)
Not as good as it sounds (Score:2)
However, it doesn't work that well for large apps that get parallelized across multiple CPUs. It turns out that most code, and most compilers, are good at splitting tasks in two - or in powers of two - so having three CPUs is no faster than having two.
Re: (Score:1)
The third processor can run supporting thread(s) that control the "worker" threads. Let alone support processes such as network, I/O, or anything else in the OS - leaving the two CPUS (and their caches) wide(r) open for application crunching.
Re: (Score:3, Informative)
Hmmm, now that I think about it, a three way box might be really interesting for some HPC loads as well. The low latency is a really big issue for some codes, and the three way could be more scalable (with some hand coding
Re: (Score:2)
So with a 3-way box you'd just use something like -j3 or -j4 to distribute load. unless they're dual cores than -j6 or -j7 would do.
Tom
Re: (Score:2)
And even disregarding compilers, not all tasks warrant powers of twos. I don't know where he got that from. For instance, when you run Apache you can control the number of threads you want, doesn't have to be a power of two. The kernel will distribute them across available cores, just as easily with
Is this new? (Score:1)
Re: (Score:2)
It's interesting that (Score:2)
I thought IB was dead - replaced by 10gigE?
Re: (Score:2)
Re: (Score:2)
Doesn't IB have lower latency than that?
I've thought of this (Score:2)
Re: (Score:2)
Could be marketed to China (Score:2)
reminds me of engines (Score:2)
Tell it to a BMW or Jaguar driver (Score:3, Informative)
The smoothest piston automotive engines are in-line 6 cylinder engines or V-12 engines, which provide a power pulse with every 30 degrees of crankshaft rotation
Re: (Score:2)
so 3 is better than 4?
is this AMDs way of saying "oh look we cant make a proper quad core system like intel so we just make 3 the magic number! and everyone will buy our marketing technobable crap"
Anything is possible. The real question is, what is AMD capable of selling. Sure they can add 1 more hypertransport controllers as some of the others posters have mentioned, but what does that to the cost of the chip? Sometimes, you have to slower to go faster. Or, in this case, you need fewer to do more.
Re: (Score:1)
Re: (Score:2)
4 cores per chip (providing 3 unused HTs), by 4 chips.
Re: (Score:2)
Not with Intels design, no thanks. On Intel all cores share the same bus, making it more and more congested with additional CPUs. The usefull limit is somewhere between 4 and 8 cores with that design.
Re: (Score:1)
Re: (Score:1, Informative)