


Xeon vs. Opteron Performance Benchmarks 362
QuickSand writes "Anand got his hands on some of Intel and AMD's enterprise processors including 4MB L3 Xeons, and put them to the test. Results were a little varied as 4-way Opteron systems seemed to fare the best, although dual Xeon configurations almost always beat dual Opterons. The exact benchmarks are here."
IA-32e vs IA-32 (Score:5, Interesting)
Re:IA-32e vs IA-32 (Score:5, Informative)
Re:IA-32e vs IA-32 (Score:4, Insightful)
This makes me want to throw up. The last motherboard purchase I made, it was a chore finding one with the _least_ amount of features. Need an AMR riser slot? Fuck no, I'd rather have the PCI slot back. Need integrated sound? No, integrated sound makes my already bad speakers sound worse. It must've been tough figuring out how to make a decade's worth of improvements in technology amount to nothing. I have an ISA soundblaster from 10 years ago that sounds better than the onboard sound on my last motherboard. Need integrated video? I won't begrudge you this. Some people build clusters with their motherboards, and a video card is needed to boot, but if I have a choice I won't buy a mobo with integrated video.
In short, I want a motherboard with slots for RAM, an AGP slot, a socket/slot/hole for a CPU, PS/2 hookups, serial and USB connectors, and the rest of the board filled up with PCI (or PCI express) slots. That's the ticket.
Re:IA-32e vs IA-32 (Score:3, Interesting)
Re:IA-32e vs IA-32 (Score:5, Funny)
This makes me want to throw up. The last motherboard purchase I made, it was a chore finding one with the _least_ amount of features. Need an AMR riser slot? Fuck no, I'd rather have the PCI slot back.
You do realize it costs much less to put a AMR or CNR slot on a motherboard then a PCI slot right?
Need integrated sound? No, integrated sound makes my already bad speakers sound worse. It must've been tough figuring out how to make a decade's worth of improvements in technology amount to nothing. I have an ISA soundblaster from 10 years ago that sounds better than the onboard sound on my last motherboard.
Now its obvious you're trolling. Say what you want about AC97-based onboard sound (which nearly everythign is), but its good enough. Your speakers are much more likely the problem. The long and short is that all PC sounds cards equally suck until you get to professional grade gear.
Need integrated video? I won't begrudge you this. Some people build clusters with their motherboards, and a video card is needed to boot, but if I have a choice I won't buy a mobo with integrated video.
What does having this cost you? Its not like you have to use it, or that boards with onboard video cost signifcantly more.
Re:IA-32e vs IA-32 (Score:5, Informative)
You must be joking. Most of the integrated sound I've had the joy to listen to produced noticeable background-noise. The most obvious one was the Eden-M board. Most mb-producers don't give much about seperating the analogue part from the digital, so accompanied with an intergrated graphics card, you can practically hear how a window is restored. It is usually not the quality of the on-board sound, which sells it, but purely the capability of producing some sound.
Re:Why are there so many Opteron/Athlon64 sockets? (Score:5, Insightful)
It's pretty much the same story with SlotA/SocketA. They had an initial design that was quickly replaced. The second socket then stuck it out for the duration.
Intel did pretty much the same thing with their P4, initially releasing it on socket 423 and then quickly moving to socket 478 which has lasted for several years now (though it too will soon be replaced).
Markets change, technology changes, and sometimes sockets need to change with them. Remeber that the specification for Socket 754 and Socket 940 for current Athlon64 chips was set in stone about 3 years ago (before the first beta chips tapped out), and a lot has changed since then. AMD has gone to great lengths to minimize socket changes, but there's only so much that they can do.
Re:IA-32e vs IA-32 (Score:3, Informative)
As for a real IA-32e chip with the instructions enabled, Intel has stated that they arent coming out for while, and since Intel is moving the P4 to the new 775 chipset in a few months, I would bet that they would also be released under this new chipset. Heck, they might not even release their IA-32e chips within
Re:IA-32e vs IA-32 (Score:4, Insightful)
They're disabled in all socket 478 chips. The new Pentium 4e chips (Prescott core) supposedly have the extensions, but they remain disabled. Technically, it may be possible to gain access to those instructions through some sort of BIOS hack, but you would also need to use an Operating system that can detect, support, and make use of those new instructions. Also, you risk using unfinished or untested parts of the CPU if you do manage to gain access and use the extensions. There would be no benefit other than simple tinkering.
"If they really are just "extensions" then I don't see why anything special would need to be on the motherboard correct?"
You still need a CPU that supports the instructions, and which has them enabled. Technically, if Intel released Prescotts in S478 form with IA-32e enabled, it should work fine with an existing motherboard which would otherwise support the Prescott chip you're using. The probablility of Intel taking the time and effort to do this less than a quarter away from a whole new socket is virtually nil.
"The cpu should switch into 64bit mode whenever the OS tells it to right?"
That's not entirely accurate. Technically, what happens under AMD64 (the basis for IA-32e), is that specific instructions can be sent to the CPU to have it run code in what's called "Compatibility mode", which essentially allows it to behave as though it were a 32-bit CPU. The difference is that you're not 'switching' to 64-bit mode. You're either in 64-bit mode with the option for compatibility mode when needed (meaning you need a 64-bit capable OS), or you're in 32-bit and you're stuck in 32-bit.
If you're looking for 64-bitness, you may simply want to get an Athlon64. If you're waiting for 64-bitness on the Intel side of things, you'll be waiting until some time towards the end of this year. Good luck.
Re:IA-32e vs IA-32 (Score:3, Interesting)
The 32e means its an extended version of the machine/assebly code modified from IA32 to work on 64bit processors and still have backwards compatibility.
Pffft. These Intel vs. AMD flamewars are pointless (Score:4, Funny)
Re:Pffft. These Intel vs. AMD flamewars are pointl (Score:5, Informative)
What a bunch of crap! That's almost as big a lie as Intel makes of AMD cpu's. I didn't even bother reading the rest when you are obviously delusional.
BTW A64 chips can be had for the same prices as their A32 counterparts in the same speed rating, Motherbaords are almost equally expensive whether A32 or A64, & outperform them by up to 30%.
Ok well now I have read the rest of your comments & I have to reply to those as well:
"they're going to go to Intel because it has more upgrade possibilities"
What possibilities? They force upgrades way more often than AMD, & are known for being the best money sink for performance users.
"is cheaper than the Athlon 64 for the same level of computing power"
Uh not really, A64 costs the same or less than Intel for comparable performance as long as you don't follow the rule of 'Mhz/Ghz equals performance'. Take a look at pricewatch or Newegg and see for yourself...
"currently performs better"
Hogwash.
"So this is more of a plea for AMD to extend the Athlon "32" line a bit further. Please AMD, don't prematurely kill off 32-bit Athlon chip development!"
Uh they are building them at least through the end of this year, & probably next to some degree. The thing is their is no real future for A32, performance has been decreasing performance-wise in comparison & wasn't keeping up. AMD realized their best bet was to focus on A64 with it's integrated memory controller & higher IPC than A32 has. A64 is a real contender where as A32 wasn't keeping up & they knew it.
Re:Pffft. These Intel vs. AMD flamewars are pointl (Score:3, Insightful)
Since when tomshardware and "relevant" fit in same sentence? Besides, future upgrades or no, chips with gigantic cache will _ALWAYS_ be very, very expensive.
Re:Pffft. These Intel vs. AMD flamewars are pointl (Score:3, Insightful)
Re:Pffft. These Intel vs. AMD flamewars are pointl (Score:3, Insightful)
Out with the old, in with the new; sounds good to me.
"People who've made a choice in the past year to go with an AMD-compatible FSB400 mainboard are getting the shaft"
Those who made the choice to purchase a CPU which is at the upper end of an architecture's limits have only themselves to blame. If AMD spent another 20 years trying to make K7 faster, you'd be posting
Re:Pffft. These Intel vs. AMD flamewars are pointl (Score:5, Informative)
I recently upgraded my principle system and at the time, I was determined that the P4 2.8HT CPU was going to be the chip the machine was built around. It clearly trounced the Athlon XP 2800 in all tests for about $100 more. I was about to order when the AMD Atlhon 64 3000+ was released. It outperformed the Intel chip in most areas, was 64 bit, backwards compatible, and only $20 more. In my opinion, and Anandtech and Tom's Hardware agree, the AMD Athlon 64 3000+ is the best bang for the buck CPU out there.
Pricewatch's Lowest Prices are...
AMD Athlon 64 3000+ - $205
AMD Athlon 64 3200+ - $251
AMD Athlon 64 3400+ - $401
Intel Pentium 4 3.0 (800) - $209
Intel Pentium 4 3.2 (800) - $273
Intel Pentium 4 3.4 (800) - $420
AMD Athlon XP 3000+ - $158
AMD Athlon XP 3200+ - $194
Looking at these benchmarks here [anandtech.com] and on the following page here [anandtech.com]. You can see, with the exception of pure media encoding, the Athlon 64's perform better. The margin is slim in some areas, and quite large in others. All this and the chips are a little cheaper.
In my opinion, the XP line was dead. It had gone as far as it was going to go. I think AMD knew that the 3000 and 3200 were more like the 2850 and 2900 and they weren't going to get any more mileage out of the design. I definitely think AMD made the right call putting all their eggs in the Opteron/Athlon 64 basket.
Re:Pffft. These Intel vs. AMD flamewars are pointl (Score:3, Interesting)
The Mobile Athlon64 3000+-based eMachines M6807 [circuitcity.com] latpop is available at Circuit City and Best Buy (M6805).
The Athlon64 3200+-based Compaq s6900NX [circuitcity.com] is also available at Circuit City.
The Athlon64 3200+-based eMachines T6000 [bestbuy.com] is available a
xeons/opterons market share (Score:4, Insightful)
many people did not upgrade to Intel's Itanium, but rather were upgrading to their high end dualie xeon systems -- they run very reliably, and very fast. a few instances where we've put in dual 2.x ghz xeons for web/mail servers...and only a slashdotting could bring them down...(well, an exaggeration...but you get the point).
Re:xeons/opterons market share (Score:5, Interesting)
Folks were avoiding the Itanium because it was a disaster; slow and expensive. We've been looking at 64 bit computing for a while, because of the seamless > 4GB RAM capabilities. Intel's PAE extensions are OK, but they really didn't solve any of the problems we were having.
The net result was we went to 64 bit PPC architecture 3 years ago on those critical systems, And everything has been fine. AIX works great, and IBM's embrace of GNU/Linux means an easy learning curve for us Linux users.
Re:xeons/opterons market share (Score:5, Interesting)
I wish we could get by with cheap Xeons, but they just don't cut the mustard for our applications.
Re:xeons/opterons market share (Score:5, Insightful)
"I wish we could get by with cheap Xeons, but they just don't cut the mustard for our applications."
This is exactly why Opteron DOES compete with Itanium - if only indirectly. Opteron will never hit the big-tin niche, simply because it was never designed, nor intended to do so. What Opteron does is bring 64-bitness, and all the benefits therein to the mid-range crowd. This forces Intel to choose between giving up on Itanium as anything other than a big-tin chip, or losing half its mid-range customers to AMD. Losing such a lucrative market would be far worse for Intel in the long run than losing the 10 years of R&D sunk into Itanium, so they've chosen to bring the Xeon line to the 64-bit world. With the new Potomac core (Q1/H1 '05), the XeonMP will be the CPU of choice for Intelphile mid-range customers in need of Itanium's benefits, but conscious of cost. The result will be that Itanium's legs will finally be completely taken out from under it, and it will be resigned to little more than a handfull of extremely high-end big-tin servers each year.
Does this mean Intel should continue to develop Itanium, even if it becomes clear it can no longer sustain its own R&D? I don't know - I think that's a question for Intel's board to answer. What I do know is that AMD had it right in '98/'99 when they decided to help transition people to 64-bit CPUs without losing x86's incredible compatibility. The bottom line is that someone like you would have gladly gone with either Opterons or Xeons had the choice been given to you. Unfortunately for Intel's margins, you and those in your position now have that choice.
Re:xeons/opterons market share (Score:3, Interesting)
Heh, I guess the Cray Red Storm [amd.com] system kind of shoots down that theory... ;-)
Actually the design of Opteron beats Itanium for HPC, and the relative number of Opteron vs. Itanium HPC design wins bears that out nicely.
Re: (Score:3, Insightful)
Re:xeons/opterons market share (Score:5, Interesting)
Well, I think that people look at the x86 architecture, and they can see the many, many horrible hacks that have been used to sustain it. That much is pretty obvious if you spend even 10 minutes looking over things. You sit there scratching your head and going, "What the hell? Why'd they do that?", and then realize it's because something, somewhere, was broken until they did it. The reason people don't like to start looking into replacement architectures is exactly as you expressed; the must-have software. You can try running that software under emulation, but the best architecture in the world is always going to take a performance nosedive when running code under emulation. I can look at what IBM has been doing, or even at what Intel was doing with EPIC back in the day, and I can say, "wow, that's pretty cool". But what I can't do is put down the x86, toss all the old software, and hope that all the new software, written for a completely new architecture, is going to work in some sort of reliable fashion. What you really get with x86 is 20 years of experience, and thus, a measure of predictability. In essence, you're paying for predictable problems (much better than unpredictable ones) with old, poor architecture.
"The AMD solution doesn't do away with x86"
AMD64 actually does get rid of a lot of garbage in x86 that is no longer in use. Take a look at the presentation (link at Ace's [aceshardware.com]) by the guy who designed AMD64. He was actually pretty thrilled (well, as thrilled as this guy gets) about being able to dump a lot of the cruft x86 has accumulated. Unfortunately, many things had to remain intact, for the obvious reason of compatibility. I have to warn you though, the guy from the AMD presentation is a real ball of fire. (Although, the ex-Intel guy from the other presenation was pretty interesting and funny)
Re:xeons/opterons market share (Score:4, Informative)
Re:xeons/opterons market share (Score:3, Insightful)
The problem with itanium is not that they aren't a good technology, but rather that intel is trying to shove them into the high-end of the market, which is a difficult place to compete. sparc, power, pa-risc, alpha have all been around for years, have established customer bases, and lots of businesses
A point that isn't made in the artical (Score:4, Interesting)
Re:A point that isn't made in the artical (Score:2, Insightful)
Re:A point that isn't made in the artical (Score:5, Interesting)
OK, lets go over this again. There is nothing really special about Xeons vs a P4 except the P4 is crippled so that it cannot do SMP, and there may be more cache options on a Xeon. Performancewise they are the same @ the same clock speed. FWIW, I've been dissapointed with XP regardless of the hardware
Now, back to this benchark thingy. 1st, I would appreciate in the article writeup that it said that it was only doing a simple read/write database benchmark, and that was it, but we don't come to slashdot for the stories, right? Also, in my opinion there was no significant difference between the two platforms regarding their speed on this benchmark. The difference between 1st and 2nd place, regardless of who won that test, was between 5 and 12%. I don't start to get interested until there is at least 20% difference, and even then that would only determine my choice for an initial purchase, I would never upgrade a system unless there was at least 100% speedup, preferably 200 -> 400% is worthy of doing an upgrade.
It would have been interesting to see results like this for more platforms, because I have not seen any significant numbers from the Opteron yet. For example, the memory bandwidth of the Opteron is 1/2 that of the Itanium2's.
Re:A point that isn't made in the artical (Score:5, Insightful)
How about cost? The Xeons cost twice as much as the Opterons, and the Opterons give equivalent or better performance! Although you are correct that the performance difference may not be staggering (and between top of the line chips, who would expect it to be?), the price/performance ratio certainly is.
Re:A point that isn't made in the artical (Score:3, Informative)
In terms of bang for your buck Opterons rock.
Re:A point that isn't made in the artical (Score:3, Informative)
That "Xeons cost twice as much" statement is correct for the Xeon MP 3Ghz/4MB cache vs. Opteron 848, which was the primary focus of this article.
Re:A point that isn't made in the artical (Score:3, Informative)
No Xeon MP as fast as 3.0 GHz or with the 4MB cache is available via pricewatch.
The fastest Xeon MP processor available they list is 2.8 GHz with 2MB cache, for $3788 [pricewatch.com]
A single Opteron 848 processor costs $1469 [pricewatch.com]
Oh, lookie, the AMD processor is less than half the price of a Xeon product inferior to the one that article
Re:A point that isn't made in the artical (Score:4, Informative)
One 3Ghz XEON MP cost about 3500$
Re:A point that isn't made in the artical (Score:5, Interesting)
Good post. However the one comment that I didn't agree with was the above.
My guess is that you aren't involved with any applications where compute time = money. When you are running simulations (say large CFD runs for example) that can takes days or weeks per run, a 50% improvement in speed is a major breakthrough if you get it by not touching code, ie hardware upgrades. Optimzing code is great and all, but it can introduce bugs and other expected behavior. Plus, us development people are pricey. Hardware is relatively cheap. Add in the fact that you generally get charged for CPU time on these big machines (or clusters of little ones), then *any* speed that you get is a major breakthrough, ie you can run more simulations in the same time for the same money.
In your environment, it's probably okay for you to only upgrade every three years when you get a doubling or more of performance, but there are enviroments where any speed increase is sought after highly, even if it's 20%. I suspect this is true of the special effects industry too, guys like Pixar, ILM, etc. If they can render more frames in the same time or even render the frames in the same time at a higher level of detail, that's worth paying for. Perhaps someone who knows more would care to enlighten us, I'm curious if I'm interpreting that correctly.
Re:A point that isn't made in the artical (Score:3, Interesting)
Your right. I work with scientists that run programs up to 5 days over 10 to 20 processors. We get our money upfront, but everyone wants their answers quickly.
When you are running simulations (say large CFD runs for example) that can takes days or weeks per run, a 50% improvement in speed is a major breakthrough if you get it by not touching code, ie hardware upgrades.
So your saying that its more cost effective for
Re:A point that isn't made in the artical (Score:3, Interesting)
It depends, I'm not trying to make a blanket statement that this is always the case, but yes, I can certainly envision scenarios in that the benefit to customers is worth the price of the upgrade when you get less
Re:A point that isn't made in the artical (Score:5, Insightful)
What exactly would be our grounds for dissapointment?
That your company spent $3750x2x55= $412500 on processors alone (assuming you have the 1mb MP model Xeons), when you could have the same performance for a quarter of that price.
Re:A point that isn't made in the artical (Score:4, Informative)
$332x2x55=$36,520
Those are 2.8ghz Xeons, btw, so more expensive. They're also the price on their own, bought singularly. $37k for 55 developers CPUs is pretty good, and could undoubtedly be bought down to nearer $20k when bought bulk.
Seriously, price arguments about Intel don't work that well any more. They used to cost shedloads, now it's only their top-of-line processors (such as the extreme edition, and new models) that carry the weight (after all, if you need the fastest processor available, you can afford to pay for it :-P).
Re:A point that isn't made in the artical (Score:3, Informative)
As a general rule: Get rid of the consultant
Quick Link to Test Results (Score:4, Informative)
http://www.anandtech.com/IT/showdoc.html?i=1982&p
summer cottage? (Score:3, Funny)
Ah, so for all our college-student friends, that would be "the parents' house"?
Cache always help (Score:5, Informative)
Re:Cache always help (Score:3, Interesting)
The reason the 4mb Xeon's are significantly outperforming the 2mb Xeon's is due to the shared bandwidth architecture of the Xeon's. The cache makes up for the lack of access to data via the FSB and keeps the very deep pipeline of the P4 series processors full. The long pipeline is the reason that cache misses impact the speed of the P4s so much - despite Inte
I recommend Glasses (Score:5, Informative)
Re:I recommend Glasses (Score:5, Interesting)
Anandtech isn't biased. (Score:5, Insightful)
One of the purposes of the test was to show how the memory bandwidth bottleneck of the Xeons limits their effectiveness in 4-way configurations, which the Opterons do not have that problem. Doing this comparison with different memories would make things more complicated.
Additionally, you'll notice that Anand's final words recommend the Opteron for being at least equivalent and much cheaper than Xeon. This was also the selection process for their new forum servers, so you can bet that they aren't getting any kickback from Intel, or those would be Xeons.
If you still have doubts about the validity of Anandtech's testing, check out the benchmarks from their AMD vs. Intel web server test in December: http://www.anandtech.com/IT/showdoc.html?i=1935&p
Really, I think some people ought to think before they flame like this. The benchmarks are showing the Opterons to be equivalent or faster in 2-way configurations and definitely faster in 4-way configurations, so what is there to complain about? The fact that Anandtech has consistently recommended AMD's processors just makes it doubly silly.
Re:Anandtech isn't biased. (Score:4, Insightful)
Then they should not have done an memory intensive and disk intensive benchmark like a database, now should they?
NUMA means Opteron is Better (Score:5, Informative)
That is why Opteron is required for good performance with eight to sixteen processors, and you can even see the improvement on the four way tests that Anand ran.
Re:NUMA means Opteron is Better (Score:3, Informative)
Saying Opteron is better for 16 way means nothing as those systems do not exist.
NUMA architectures are not without issue. I emphasize the "NON UNIFORM" aspect of the acronym. Even if SGI wants to change it to "nearly uniform". Sounds like you've been reading too much of their properganda.
Re:NUMA means Opteron is Better (Score:4, Insightful)
I think the above poster had the correct idea about NUMA, but worded it in a misleading way. A NUMA design (either of opterons, or of Xeon-quads) will have to do some memory access through the memory controllers on other nodes. This increases the latency of memory access, and can clog up the inter-processor links if lots of memory loads/stores go to remote memory. Thus NUMA-aware operating systems and system libraries are necessary to maximize the amount of memory access that is local, and minimize the usage of the inter-processor links.
While the opteron design is elegant, and fast, it is not the only smart way to do things. It offers great aggregate memory bandwidth, but can slow things down in the worst case. Most large NUMA systems are created by linking 4-way SMP nodes. (Examples: Sunfire, HP alphaservers, Cray X1, NEC SX-6, Unisys 7000, IBM xseries 4xx xeon, IBM xseries 4xx itanium,...) Apart from opteron systems, the only systems I can think of that do NUMA per processor are the cray T3E, SGI origin, and intel paragon, all of which are Massively parallel supercomuters.
It is safe to say, however, that a shared bus system does not scale well beyond a few processors. This is best demonstrated by the 36 processor SGI-challengeXL, which was significantly bottle-necked at the memory bus.
food for thought.
Re:NUMA means Opteron is Better (Score:4, Informative)
NUMA is a tradeoff. Each processor has its own memory controller and its own bank of memory. Therefore each processor has preferential access to that bank of memory. If, however, a processor attempts to access memory in another processor's bank, its slower. This cost is mitigated by intelligent VMs that attempt to put the memory for processes in the memory bank controlled by the processor running that process. If this is done efficiently enough, the benifit of having 4 memory controllers far outweighs the cost of possibly having to get memory slowly from another processor.
Can someone please clear my ideas about this (Score:4, Interesting)
One thing I did not understand is how come the 3MB cache is helping with big database query ? I thought that will thrash the cache and there will be not much performance gain if you are working with bigger code/data set. Also, for the four CPU opteron, do they have hyper transport going from every cpu to every cpu ? Is it like a mesh or like a ring where every cpu has only two connections to it's next ones.
Another thing I did not get is how linux is handling ( not handling ) the local memory to the CPU. This thing looks like a mini-numa type system. Does linux actually try to keep the data in the RAM and process it with the cpu it is connected to ? how does this really work ?
May be you guys can help clear my ideas .
Re:Can someone please clear my ideas about this (Score:4, Interesting)
this is interesting, and i don't have an answer, except to say that SQL servers generally try to load all of the tables into available RAM. If the data is too large, then simply the indexes(??). If the server ever has to go back to the harddrive for data (which would make it bloody slow for a query) it will check recently cached stuff first - and larger caches means reduced time pulling data sets from a raid array or single harddrive.
that is atleast my take...it generally differs from server to server....MySQL does not run exactly the same way as MSSQL as Oracle. (which means I've generalized.)
Re:Can someone please clear my ideas about this (Score:4, Interesting)
As a page gets loaded from the harddisk it is loaded into the server's cache. Any read/writes are done to memory and not the disk. Background process write the pages to disk that are dirty. All transactions are written to the transaction log so if the server crashes before this happens recovery can happen when the db starts back up.
This means that a large portion of the data is already in memory. Servers usually pre-allocate gigs of memory for this purpose, the more the better and a big reason for 64 bit on large dbs.
Under x86 caching schemes, the CPU does speculative loads. It "guesses" what memory the processor is going to need and starts loading it into high speed cache. This is perfect for a db since most of the time the db pages you need are in sequential order. Especially when you are talking about pages that only include indexing data. The query usually does most of it's work using indexes, and then at the end will actually "lookup" the data.
So bigger caches mean that these big binary trees get loaded into cache and the algorythms can loop through them faster and pull off the cache.
Take this into the Itanium world and we can start to get even better performance. The thing people tend to forget about Itanium is that you can tell it to load you data into cache. So an optimized DB server can have it read this large section of code into the cache while it does calculations. Itanium allows 3 instructions to be loaded. Once hypertheading is put into Itanium you will see these DB apps really fly. Itanium is showing good promise in this arena, even at 1.5 GHZ. Clock that up to 2 to 3ghz with multiple hypertheaded cores and we are going to have one fast chip for dbs.
The big issue is the price, if you are going to spend that much money, go with the proven Sun/IBM. Itanium is set to replace the Xeon by 2007 ( I'm guessing before then because of scaling issue on the Xeon and x86 emulation software giving decent 32 bit for legacy apps ).
I really think Intel needs to push their Itanium. MS likes it, Linux likes it, a few db servers like it, and a slew of other high performance, server based things. I don't see how Intel is going to scale against the Oppie. Not unless they stick a on-die memory controller. Hypertheading and the new thread based instructions will help though. Should be an inresting battle. I'll be happy if AMD can get 10% to 20% marketshare, then we will see some true competition and innovation like we have on the desktop.
Importance of compilers (Score:5, Insightful)
So in any evaluation, the compiler and binaries that are used is an important question.
There was no mention of this in the article.
Re:Importance of compilers (Score:4, Informative)
Believe it or not, Intel's compiler generates very good code for the Opteron. Far better than GCC or generic IA32 compilers.
This is the experience I've had with the Intel Fortran compiler (ifort) on an Athlon XP. Codes compiled with ifort are around twice as fast as those compiled with GNU g77 (for Fortran 77), and around 1.5 times faster than those compiled with Lahey lf95 (for Fortran 95).
OS (Score:4, Interesting)
Back to Intel Fanboy (Score:3, Interesting)
Now... given this kind of statistics, as sad as it may sound I'd say I am willing to pay anything for an Intel just to avoid the headaches.
Re:Back to Intel Fanboy (Score:3, Insightful)
Re:Back to Intel Fanboy (Score:5, Insightful)
Now... given this kind of statistics, as sad as it may sound I'd say I am willing to pay anything for an Intel just to avoid the headaches.
That is an interesting use of the word statistics. In order to determine if your next processor is likely to break, you should look at thousands or hundereds of thousands of Intel procs and AMD procs. Your 7 processor study is inherently flawed.
Re:Back to Intel Fanboy (Score:5, Funny)
Re:Back to Intel Fanboy (Score:4, Insightful)
Re:Back to Intel Fanboy (Score:5, Funny)
Re:Back to Intel Fanboy (Score:3, Insightful)
Cooling is/was more important, especially for the older T-birds.
Cooling is VERY important for all current processors. It all becomes relative. When the Thunderbird was current, it used quite a bit more power than the PIII that it competed against, so cooling was very important for that chip. Now, the ~70W that the T-Bird used is not at all abnormal, this is the same basic power consumption of an AthlonXP (Barton or Thoroughbred), Athlon64/Opteron, P4 "Northwood" or even an IBM PowerPC 970 (aka Apple
New hammers on the way. (Score:5, Informative)
Some info here [techreport.com]. SSE3 is the big thing.
Re:New hammers on the way. (Score:3, Informative)
I doubt SSE3 will make a difference for very many applications. A quick overview of x86 vector instructions:
MMX: primarily vector integer instructions on 64-bit registers; main flaw is that they use the floating-point registers, so you can't mix MMX and FP code. Biggest win for image processing, which is usually 8-bit data and perfect for MMX.
3Dnow: adds vector floating-point instructions on 64-bit registers (introduced by AMD).
SSE: primarily vector floating-point
Basically... if you have more then 2 use AMD (Score:5, Insightful)
Price vs. Performance (Score:5, Insightful)
Re:Price vs. Performance (Score:5, Insightful)
Even if someone has money to burn, wouldn't it be better to get more performance anyway?
Re:Price vs. Performance (Score:3, Interesting)
Because even if the chip is a significant portion of the cost of building the computer, it is only a small fraction of the total cost over the useful lifetime of the cluster.
Because one has to benchmark for one's own problem set. It's possible that one set of instructions are be
memory controllerS? (Score:5, Interesting)
I'd like to see some test on servers like the IBM x445 [ibm.com] with NUMA.
The Usual Problem (Score:5, Interesting)
The tests in this article, involved running the same exact binaries (out-of-the-box Microsoft 386 stuff) on both types of CPUs, rather than the code being compiled to run natively. The Opterons were fighting with one hand tied behind their backs.
In other words, this benchmark is mainly only of interest to Microsofties. If that's what you run, then fine, the article may be useful to you and you may get something out of reading it.
If you are trying to maximize speed, though, then the software contraints that this test took place under, are totally contrary to what you'd actually be doing (running code that is appropriate for the hardware).
BTW, another weird thing I noticed about this article: these guys use flash for static images of bar graphs. WTF? Anandtech, your w3b d3$1gn3rz R S0 31337!!!1
-5, Clueless (Score:5, Insightful)
Secondly, you are not going to get MS to recompile an MS-SQL for Opteron. You're not going to get IBM to support a Linux installation, after you've rolled your own ueber-NUMA-patch-level-42 kernel.
The test was clear - out of the box, plug in servers, load OS, load app, run benchmark.
And the outcome was clear, the Opteron architecture is vastly superior, both performance and price-wise.
The MHz myth is over, at least in Slashdot and Anandtech circles.
Re:-5, Clueless (Score:3, Insightful)
2) IBM would probably support uber-numa patched kernels as you put it, since they are one of the main proponents of Linux-on-massively-parallel-supercomputers anyway.
Do some research.
Re:-5, Clueless (Score:3, Informative)
IBM supports native x86_64 distros (SLES8/AMD64 now, and looks like RHEL3/AMD64). So IBM supports you running 64-bit today.
Windows is working towards 64-bit for Opteron, so yes, MS is recompiling their crap for Opteron.
Simply because there is a logical reason for images being flash makes it no less annoying as hell.
Opteron is a fascinating platform, and very cool, *especially* with respect to 64 bit computing.
Article on one page (Score:4, Informative)
Hmm (Score:4, Funny)
Did they fare well, also?
Conclusion... (Score:5, Informative)
The comparison we've made here is a very important one; it identifies Intel's strengths and their weaknesses with Xeon, and it crowns Opteron a clear multiprocessor winner. An area that we didn't touch on is cost, which is where AMD truly shines. The Opteron 848 processors we tested are around 1/2 the price of Intel's 2MB L3 Xeon MPs and we have not seen retail data on how expensive the 4MB parts will be.
In a 4-way configuration AMD's Opteron cannot be beat, and thus it is our choice for the basis for our new Forums database server.
Apples, Oranges, and Efficiency (Score:5, Informative)
The Opteron clocked at 2.2ghz with 1MB of cache was very close in 2-way performance with the Xeon 3.0 and 3.2 ghz each with 4 and 2 mb of cache respectively. The 1.8ghz Xeon compared well with the Xeon 2.8ghz with 2MB of cache. The Opterons were typically within 3% or so of their Intel counterparts in 2-way benchmarks and closer to 10% ahead in 4-way.
If nothing else, this says a lot about the efficiency of the Opteron's design. Less silicon, and more importantly for AMD, less expensive silicon, manages to achieve very close results.
Comparing Prices (Score:5, Insightful)
4 Xeons (@Intel's announced pricing): $14768 ($3692 ea)
Did the quad Xeon system outperform the quad Opteron by a factor of 2.5:1? No. In fact, in some cases, the quad Opteron outperformed the quad Xeon. The Xeon had advantages of hyperthreading, 4x as much cache, and a clock speed 800mhz higher than the Opteron, ans still got beat.
Clock speed may sell in the consumer market ("Me want bigger!"), but in the server market, Opterons getting better performance for half the price are going to win more and more converts.
- Greg
Re:Comparing Prices (Score:3, Informative)
Order Entry Stress Test (Score:3, Insightful)
In our infinite desire to please everyone we worked very closely with a company that could provide us with a truly Enterprise Class SQL stress application. We cannot reveal the identity of the Corporation that provided us with the application because of non-disclosure agreements in place.
Okay... So we know what kind of hardware they're testing against, but not knowing what kind of software they're benchmarking? "We're using an enterprise scenario" isn't good enough.
It's nice to look at pretty charts and all, but I imagine anyone who is going to investigate enterprise level solutions is going to want to know EXACTLY what this is being benchmarked on.
Even though I typically tend to trust Anandtech's outlook on things, I'm still kind of so-so on this review. Their forum test is not really externally reproducible and their enterprise test is too vague. I doubt any IT person would weigh this review too heavily when making a decision.
Then again, I could be wrong.
Story submitter -5 Ray Charles (Score:5, Informative)
Perhaps the submitter's screen reader doesn't work well with flash, but in the 2-way benchmarks, Opteron was on top twice, and Xeon was on top 3 times. All the 2-way benchmarks were fairly close (within 5%), and the Xeons never beat the Opterons by a margin greater than 1.7%. I don't quite know where 40% wins translate into "almost always" loses. In other words, the story submitter is a moron, or simply didn't look at the article.
"Results were a little varied as 4-way Opteron systems seemed to fare the best,"
Seemed? Let's see, out of five 4-way benchmarks, Opteron won... all of them - performing about 10% better than Xeons each time.
Since when did we start letting Tom Pabst submit articles to
Note to editors: When the submission is non-sequitur, either reject it or edit it.
Not Surprising (Score:3, Interesting)
Varied, perhaps, but not surprising. AMD has integrated the memory controller on the CPU, which could explain their getting better when the number of CPUs increases (the Intels being held back by having to go through the same memory controller).
As for Intel winning out on the dual CPU systems, well, they are ahead of AMD in the CPU speed race, aren't they?
Dual Xeon's beat Dual Opteron's? I think not. (Score:5, Interesting)
Perhaps the benchmarks show the 2P Xeon's doing OK against 2P Opteron's, but for the price of two Xeon MP chips you can buy five Opteron 848's. Rounding that down, I wonder how well the 2P Xeon does against the 4P Opteron? Oops, Anand already though of that. He says "it would not be pretty." Indeed.
Re:Dual Xeon's beat Dual Opteron's? I think not. (Score:5, Interesting)
The graphs mean that Opterons with a "measly" 1 meg of cache are beating out Xeons that have (a) four times the cache, (b) 50% higher clock speed, and (c) a price tag that's three times higher.
Hats off to AMD. In times past (K2/K3), price was the only thing they had better than Intel. Now they've got both price and performance.
steve
Re:Why benchmark games? (Score:5, Insightful)
The last two articles on Xeons used their forum database as the workload for the benchmark. In the current article he even managed to use an unnamed enterprise order management system.
Then, if you have the games and the 64 bits systems at hand, why not do a quick benchmark?
Their review of windows64 highlighted some obvious problems, probably with drivers/PCI, that may be relevant for professional use (think of CAD).
Re:Why benchmark games? (Score:5, Interesting)
I'd like to see more "Consumer Reports" type tests to. Test hardware configuration X as a high-volume SQL server, and show me how it's held up after a month, 3 months, 6 months, and a year. Yes, maybe I'd upgrade before then, but not everyone would, and I'd like to see common failures and problems down the line - not a 1-2 day test.
Re:Why benchmark games? (Score:3, Insightful)
Re:Suck... (Score:5, Informative)
They DO. It's called an Opterion!
Guess you are refering to the Athlon64 which is a one way processor (just like the Pentium 4).
BTW the Opterion is made in 3 flavors, the 100 series is a SINGLE way cpu with NO smp support. It's very nearly the same as the Athlon64-FX. The Opterion 200 series is a 2 way (2 cpus), and the Opterion 800 series supports up to 8 cpus. AMD dropped plans for a 400 series, but you can use the 800 series chips to build servers with 1-8 cpu's.
Re:Suck... (Score:5, Funny)
Re:Suck... (Score:3, Funny)
Re:Suck... (Score:4, Informative)
Why can't AMD make a Dual AMD64?
They do - it's called an Opteron 2xx.
Re:Suck... (Score:3, Informative)
Actually, AMD came out with an entirely new "socket" with the original K7, the Athlon. It actually debuted in a slot form factor, but the resulty remains the same - AMD has been independent of Intel sockets since '99. AMD and Intel have numerous cross-licensing agreements, which is why Intel is offering an AMD64-compatible bunch of CPUs fairly soon. AMD could have used socket 423 or
Re:Suck... (Score:3, Informative)
Re:Why is this news, seriously? (Score:3, Interesting)
"In a 4-way configuration AMD's Opteron cannot be beat, and thus it is our choice for the basis for
Re:L3 cache (Score:3, Informative)
Re:No Surpises Here (Score:3, Funny)
because their processor cores are running at almost 50 per cent faster
Because everyone knows more of them megahurtzy things is good,right?
Re:Question For Geeks Intel Hyperthreading (Score:4, Insightful)
The idea behind hyperthreading is that the P4's long pipeline will often stall with only a single thread going through. With hyperthreading you run two threads at once, so when one thread stalls you just start up the other thread and go with that one for a while. In a way it's almost like a poor-mans dual-processor system, giving you two logical processors on a single chip.
Now, obviously there are a few things to consider here. First off, if ALL of your processing is being done in a single thread then you aren't going to see any benefit to hyperthreading, and in fact the extra overhead might even make things a bit slower (usually only 1-2% slower).
Games almost always do all their major processing in a single thread. Even if they have extra threads hanging around, you almost always spend 99%+ of your time in a single thread. For this reason, games see virtually no benefit to hyperthreading (they don't see much/any benefit from dual-processor setups either).
On the other end of the spectrum, some applications see up to a 25% performance boost when hyperthreading is enabled. The tests I've seen show the biggest improvement have been things like Photoshop and rendering applications. Some server applications should benefit as well.
The other boost that hyperthreading gives you, like with a real dual-processor setup, is that it makes multitasking a bit "snappier". This is by no means a night-and-day difference here, but it is there.