EM64T Xeon vs. Athlon 64 under Linux (AMD64) 313
legrimpeur writes "Anandtech has a nice performance comparison under Linux (AMD64) between the recently introduced 3.6GHz EM64T Xeon processor and an Athlon 64 3500+. It is disappointing to see how the Athlon gets trounced in FPU intensive benchmarks. No memory-bound benchmarks (where the Athlon is supposed to have an edge) are presented, though." Update: 08/09 23:34 GMT by T : As the Inquirer reports, many Anandtech readers take issue with the comparison.
Comment removed (Score:5, Informative)
Re:Math Co-Processor (Score:2, Informative)
2 things... (Score:4, Informative)
I would be nice to see more non-synthetic benchmarks.
Re:Opteron (Score:3, Informative)
Although the Athlon 64 3500+ and the Xeon 3.6GHz EM64T processors were not necessarily designed to compete against each other, we found that comparing the two CPUs was more appropriate than anticipated, particularly in the light of Intel's newest move to bring EM64T to the Pentium 4 line. Once we obtain a sample of the Pentium 4 3.6F, we expect our benchmarks to produce very similar results to the 3.6 Xeon tested for this review.
Without a doubt, the 3.6GHz Xeon trounces over the Athlon 64 in math-intensive benchmarks. Intel came ahead in every severe benchmark that we could throw at it, particularly during John the Ripper. Even though John uses several different optimizations to generate hashes, in every case, the Athlon chip found itself at least 40% behind. Much of this is likely attributed to the additional math tweaking in the Prescott family core.
That's not to say that the Xeon CPU necessarily deserves excessive praise just yet. At time of publication, our Xeon processor retails for $850 and the Athlon 3500+ retails for about $500 less. Also, keep in mind that the AMD processor is clocked 1400MHz slower than the 3.6GHz Xeon. With only a few exceptions, the 3.6GHz Xeon outperformed our Athlon 64 3500+, whether or not the cost and thermal issues between these two processors are justifiable.
We will benchmark some SMP 3.6GHz Xeons against a pair of Opterons in the near future, so check back regularly for new benchmarks!
Re:More Slashdot Flamebait? (Score:5, Informative)
Re:Math Co-Processor (Score:5, Informative)
http://contracts.corporate.findlaw.com/agreemen
So I don't see any problem fro AMD in licensing the cp-processor.
Re:Why Not Opteron? (Score:2, Informative)
Riiight (Score:3, Informative)
The 3500+ is a mainstream, desktop processor. For a more accurate comparison, the FX series, and the opteron line should have been used.
Re:The Comparison is not really fair... (Score:2, Informative)
Re:Opteron (Score:5, Informative)
Re:Opteron (Score:1, Informative)
30 fps is a slideshow (Score:3, Informative)
The reason our eyes don't have a problem with 24 fps film is because movies have lots of motion blur! Video games have no motion blur at all, unless you're playing a PS2, in which case everything is blurry.
Re:What about scientific code? (Score:1, Informative)
Stuff compiled without optimisations!! (Score:2, Informative)
Seriously, it makes a great difference what version of GCC they use.
I saw a great boost in benchmarks when I switched from gcc 3.3 to 3.4 on my AMD64.
-O3 -pipe -march=k8 -fomit-frame-pointer -ftracer
That's the way to go!
"We compiled the program using
Flawed benchmarks (Score:5, Informative)
this test they did was flawed in all respects.
Re:FPU intensive? (Score:5, Informative)
I looked at the code and played with it a little (I got it from http://cr.yp.to/primegen.html [cr.yp.to] and it seems the benchmark is mostly limited by the implementation of putchar().
My system was an dual AMD Opteron 1.8GHz running Win XP pro with Cygwin. I modified the benchmark to not use putchar() but instead just write the characters to a 1MB buffer, and it got 16 times faster! To be specific, "primes 1 100000000 > file" went from 24.2 seconds to 1.497. Note that it's generating 51MB of output for primes under 100 million. I didn't bother running it for the 100 billion max, but would expect it to be around 50GB.
This is a very poor benchmark since it's just measuring your stdc implementation of putchar and your system's ability to sink data to
Re:What about scientific code? (Score:1, Informative)
My experience has been the following. My personal Athlon XP2800 system configured as follows:
Asus cheapo motherboard
1GB ram
SCSI ultra wide hard drive (ancient!!)
Windows 2K
Lahey Fujitsu 5.7 Fortran compiler
No tweaks here, everything out of the box.
Six thousand dollar workstation bought by my research group (last year though):
Xeon 2.8Ghz
Configured by Dell, whatever mobos they use
1GB ram
Ultra 160 SCSI drives
Red Hat Linux
Fujitsu 6.0 Fortran compiler
Results: To be honest, I am not certain on whether that old LF95 6.0 compiler is 64 bit native. 6.2 is though. My self built system runs MY computational programs about 25-30% faster.
The benchmarks are WRONG. (Broken TSCP Makefile) (Score:3, Informative)
The major issue is that Anandtech does not know how to compile software.
The Makefile used for TSCP on the A64 is broken, and does not apply -O2 optimization at the right stage.
My A64 3200+ scores 290K n/s when -O2 is properly applied.
On "primegen" most of the time is spent in putchar(), instead of in computation, and they should comment out the putchar() loop instead of directing output to
Also, they should have edited conf-cc and turned on -O2 optimization.
ubench is known to be buggy, and the AMD64 results have been questioned on other sites as being implausibly bad.
They copied their data wrong on the first database test. The A64 3500+ times in at 215 in 64b mode, beating the 3.6 GHz Nocona.
Their encoding benchmarks are equally suspicious.
And gzip was a 32bit executable.
In short, this "review" is HORRENDOUS, and filled with errors. A64 3500+ vs. Opteron 150 is a distraction from the real problem:
These guys don't know how to compile, optimize, and benchmark software.
Re:More Slashdot Flamebait? (Score:2, Informative)
However, I'm thinking more along the lines of AMD's general strategy: More work-per-clock. The K7 was intended as a direct competitor to the Pentium III, and the design was great. (It even holds up pretty well against the P4.)
The Pentium Pro had two integer units and a single floating-point unit. The k7 had three of each.