Memory Timings Analysis 159
keefe007 writes "It's generally known that smaller and more aggressive memory timings combined with higher clock speeds leads to higher performance, but for the most part, the increase in performance from tweaking each individual setting is relatively unknown. Perhaps in a bit too ambitious move, I set out to examine the impact of each individual memory timing and clock speed on overall performance. Find out the results of the tests at Techware Labs."
I've always found... (Score:4, Funny)
Re:I've always found... (Score:4, Funny)
Combinations, more! More!!!! (Score:5, Funny)
Only one operating system? (Score:2, Interesting)
Testing just Windows is like test driving the Indy 500
Re:Only one operating system? (Score:2)
Funny you should mention that, I just played one of a pile of Indy 500 games on Windows XP. How ya doin on Linux over there?
Re:Only one operating system? (Score:1, Troll)
Could be worse. He could have just tested Linux. How many extra FPS are you going to squeeze out of Tux Racer? Might as well test a speedboat in an olympic sized swimming pool. "I can drive nowhere, but I'm getting there real
DURRRRRRRRRR (Score:1)
Re:Combinations, more! More!!!! (Score:2)
Hey, that's funny, but if a bunch of people did a handful of tests each, and someone was able to figure out the right statistical tools to compare those results, this might not be so far-fetched.
Re:Combinations, more! More!!!! (Score:2)
cas vs bus speed (Score:5, Insightful)
but people will always say they have their stuff at the most agressive timings just to say that they are there, even though the average performance increase is only 0-2%
Re:cas vs bus speed (Score:1)
You often get better performance when the memory and cpu are running the same clock speed as then you don't end up with the CPU waiting on odd-cycles for the memory to pump data.
You should run tests to verify, but hey, at least this is only two settings in stead of 289 or whatever.
Re:cas vs bus speed (Score:5, Insightful)
Yeah, that's a problem I have with this guy's test results. He's taken all the memory parameters which have very different purposes and effects, and rated them all on a scale of percentage increases in memory bandwidth.
The important effect of dropping CAS latency is that it improves memory response time on a small request - it's not meant to really give a big boost to bulk bandwidth.
You can think of the tradeoffs from spending a fixed dollar amount on clock speed boosts vs lowering CAS times as kinda like the difference between going RDRAM and going SDRAM. RDRAM had much higher bandwidth, but the latency sucked. SDRAM had lower bandwidth, but also had lower latency.
So wrap it up - this test is uninteresting because it rated all those parameters based on how they affected bandwidth, when really only the clock speed speed has a significant impact on bandwidth - a lot of the other parameters are really more about latency and responding well to certain patterns of access.
Re:cas vs bus speed (Score:5, Informative)
What makes this terrible is the fact that there are latency measuring tools out there, lmbench specifically. It really wouldn't take that long to measure both latency and bandwidth.
Considering the fact that this definitely would be interesting, it's a little annoying that he didn't do that.
There are much more intensive memory benchmarks than Sandra. That's why it's a little annoying that Sandra's become so popular. There are other, easy to automate benchmarks that do a much better job. Sandra's useful, but not for this kind of thing.
Just plain useless.
Re:cas vs bus speed (Score:2)
That's not entirely correct, either. He's talking about "bank interleaving" -- multiple banks on the same module all communicate through a single bus, so there's no benefit to having two banks read at the same time, as only one can return the data.
To achieve "RAID for memory", in performance sense, you need channel interleavin
Re:cas vs bus speed (Score:2)
Normal RAID works this way too. There's only one SCSI/IDE bus, so they can't read at the same time, because only one can return the data.
Here's how you get a benefit:
(Memory here has a latency of 2 clock cycles)
1: Controller issues Read A to Chip A - Chip A
Re:cas vs bus speed (Score:2)
Re:cas vs bus speed (Score:2)
Tom's did a BIOS tweak article article about 2 years ago on this:
http://www6.tomshardware.com/howto/2001072
Results = Waste of Time? (Score:4, Insightful)
Waste of Time? No Fricking Way! (Score:3, Funny)
Re:Results = Waste of Time? (Score:2)
Well, was it more of a waste of time than playing some online game for 6 hours straight? Ever done that?
While the outcome proves that there was no real increase, it was a worthy test. Would I have done it? No way. Was it a waste of time? Maybe, but was it a worthwhile waste of time, compared to other things? Probably. But the guy was right, until
Re:Results = Waste of Time? (Score:2)
Yeah, proving something doesn't work is useless (Score:2)
A few ticks backwards of the clock
One guy
A limited number of brain cells
One overgrown ego
A couple of feathers
Some glue
Running at a good clip
A decent sized cliff
Some nice jagged rocks
Everytime said guy runs off the cliff with a couple of feathers glued to his back he ends up a chunky wet spot on the jagged rocks.
But that won't stop some other idiot from saying He just needed-
More feathers
Straps instead of glue
To run faster
A taller cliff
Less pointy rocks
Let's not get into the point of how it
Re:Yeah, proving something doesn't work is useless (Score:1)
Re:Results = Waste of Time? (Score:5, Informative)
If you've heard the quote, "Never underestimate the bandwidth of a 747 full of DVDs" (updated for modern times), you can see that it is an example of why bandwidth is not the only important factor. On the benchmark he used, a 747 full of DVDs probably would have scored pretty well.
If you're going to play with latency settings, at least use a test that measures latency.
Re:Results = Waste of Time? (Score:2)
Now, if you're talking about a flying saucer DVD, that's another think, but you've got to be pretty precise when it comes to direct landing in the slot-in dvd reader.
Full text of article in case of slashdotting (Score:4, Informative)
Review by Harry Lam on 05.16.03
Test Ram provided by Crucial, MSRP: $26.00 (per stick)
Introduction:
The typical BIOS usually offers a varying number of settings directly related to memory: everything from timings to clock speeds. It's generally known that smaller and more aggressive timings combined with higher clock speeds leads to higher performance, but for the most part, the increase in performance from tweaking each individual setting is relatively unknown. Perhaps in a bit too ambitious move, I set out to examine the impact of each individual memory timing and clock speed on overall performance. The article that follows contains my experiences in this "memory benchmarking adventure" in conjunction with Crucial's PC2700 DDR RAM (and also gives a relatively good picture to the limits of this memory).
I would recommend that anyone interested in learning more about memory timings take a look at this site. It gives a pretty good technical intro to memory timings.
Testing/Methodology:
Motherboard Selection:
I decided to use the Soyo SY-P4X400 for testing, due to the flexibility of its BIOS in relation with memory timings, allowing me to change 10 different memory-related settings.
Benchmark:
To save on time and testing (all of the testing occurred over a 5 day period, with several hours of testing in each day), I picked only one benchmark: the memory test on SiSoft Sandra Professional 2003 v9.41 (SP1). I did notice that the initial few benchmarks on any configuration usually were significantly higher or lower than the "steady-state" score (the stabilized value that comes up after successive test runs of the benchmark in a row). To compensate for this, I selected the median score after the scores stabilized from successive benchmarks.
Depth:
I decided that 4,608 different combinations of memory timings on my particular test bench was a tad bit too much testing, and created a methodology which would get a look at the general increasing performance of memory timings but had the downside of having an uneven number of data points for timings that were deemed "less-significant (more on this later). VA Software is DEAD. This methodology simplified the number of combinations down to a mere 289 combinations (which actually still is extremely time consuming, considering that the test computer has to be reboot after testing each combination).
I established Memory Speed (100, 133, 166), CAS Latency (3.0, 2.5, 2.0, 1.5), and Bank Interleave (Disabled, 2 Bank, 4 Bank) as the primary criterion for my benchmarking (as these usually are the settings that are most emphasized). The "less significant" memory timings (Trp, Tras, Trcd, DRAM Command Rate, DRAM Burst Length, Write Recovery Time, and DRAM Access Time) as a result received a less thorough testing.
The general testing methodology is as follows:
All combinations of Memory Speed, CAS Latency, and Bank Interleave were tested at the least aggressive memory timings, and once that was complete, I changed the first of the "less significant" memory timings to a more aggressive value (Trp was changed from 3T to 2T). I then repeated benchmarks for all possible combinations of CAS Latencies, and Bank Interleaves based on this new timing (12 total combinations). Slashdot really licks my nads. Once this was complete, I changed the value of the next "less significant" memory timing (Tras), and repeated another set of 12 combinations (keep in mind, I left Trp at 2T, the most "optimized" value). This process was repeated for each "less significant" memory timing, and then the entire set (of 96 different combinations) was repeated at an increased clock speed (for a total of 289 different combinations).
As I stated earlier, this results in an uneven number of data points. For example Trp had 36 data points at 3T compared with 252 data points at 2T, and the reverse is true for DRAM Access time (252 to 36).
Test RAM:
Crucial was gracious enou
Re:nice troll. (Score:2, Funny)
Now if you really had a sense of humor, you would have changed the technical details. But then again, you're only an AC, right? Perhaps I'm expecting too much.
Re:Full text of article in case of slashdotting (Score:2)
Now I only wished you had posted the full text of the article, as you advertise, and not just the first page... And if you had to post only one page, the conclusions would have been more interesting. The above is pretty useless...
If you are going to post the content of an article, please do it right
You get a good point, though, for posting as an AC and not being an karma whore
Re:Full text of article in case of slashdotting (Score:3, Informative)
-----
Memory Timings Analysis
Review by Harry Lam on 05.16.03
Test Ram provided by Crucial, MSRP: $26.00 (per stick)
Observations:
page 2 and 3 mirror (Score:1)
<a href="users.domaindlx.com/leukhe/page2.htm">page2
<a href="users.domaindlx.com/leukhe/page3.htm">page3
By the way, the parent post is including items like "VA IS DEAD" in his text.
Re:page 2 and 3 mirror (Score:1)
That's what the preview button is for...
oops. (Score:1)
What beats me... (Score:5, Insightful)
Last year, I did a demo of a Via system with SDRAM and it did about 40% faster than a DDR-RAM board. The VP-Fin chap has become highly suspicious of any memory performance graphs or numbers, these days. And in true BOFH style, I've got decision-making rights on all PC purchases.
Thanks to all the confusion.
More details? (Score:2)
For programs that fit in CPU cache, or spend a lot of time on the cache contents before moving on to a new dataset, DDR vs. SDR doesn't matter as much as programs that are not cache-efficient. (FYI, older FPS games, esp. Quake I/II
this guy (Score:5, Insightful)
Speaking as an engineer, I do hate buying new stuff because its cheaper to do so rather than spending time tweaking the old stuff, but 100's of combinations, for a few % increase? Even I would be perfectly happy in paying the money rather than loosing 0.05% of my life!
Re:this guy (Score:1, Funny)
At least this guy's actually done the research now. And it bolsters what is common wisdom.
Re:this guy (Score:3, Funny)
- How much have you lost with this "Loose 20 pounds in 2 weeks" program?
- Two weeks.
Re:this guy (Score:1)
Re:this guy (Score:2)
What does that mean? You can save $10-15 a stick of memory if you just buy CAS 2.5/3 instead of CAS 2 RAM (unless you really want that extra 2% increase of speed)
I definitely think the write-up is on par with the earlier write-ups indicating that SDRAM kicked the snot out of RDRAM 3-4 years
except he fucked up..... (Score:3, Insightful)
2: In his final analysis he failed to mention that there must be another bottleneck in the system causing the sub standard performance increases.
3: He only tested memory transfer and not random access, page faults and all the other things that really slow your computer down.
If your after max performance then your going to buy the best anyhow, if you not then a PC still using PC133 memory will be fine.
Re:this guy (Score:3, Informative)
He only measured memory bandwidth...which does not exactly translate to real-world performance. He says there is no performance benefit from CAS2 as opposed to CAS2.5 or CAS3, but if you read Tom's Hardware [tomshardware.com] you'd know that CAS does have a drastic impact on overall performance. The benefit is just not in badwidth, it is in the time the processor has to wait from when it calls on the memory, to when it recieves the answer. The longer it is waiting for an answer, the longer it is sitting a
Re:this guy (Score:2)
As for a total waste of his time? Who cares? He was bored. We all have our hobbies. As long as he makes a living, and he comes out having had a good time, learning or what have you, be happy he did it. At least SOMEONE did.
Re:this guy (Score:2)
Re:this guy (Score:2)
If something takes in the long run 10 years from now, or 9 years from 2 years from now.... well you do the math. But no one knows what to expect. So making the best out of what you have is always good.
Re:this guy (Score:2)
Re:this guy (Score:1)
I doubt he knew in advance there would only be a few % increase. What if he had found, for instance, that changing the DRAM Burst Length from 4 to 8 netted a 40% boost in throughput?
His work is still valuable, if only to document (with numbers to back it up) that it's really not necessary to tweak these settings.
Re:this guy (Score:2)
According to you he has too much time on his hands. I myself enjoy doing this type of stuff, even if its only 0.01% increase its the satisfaction of achieving it rather than that extra 0.1 frame per second.
But then again I am no engineer I just a pathetic physicist who likes to tinkle around with stuff I know nothing about.
Re:this guy (Score:1)
Conclusion...UPDATED (Score:5, Funny)
Just bought some of that PC2700 myself (Score:3, Interesting)
Before I plunk down $$$ for PC3200, I wanted to know if it is worth it. I was hoping this article would help answer the question, but it looks like he is only testing various BIOS settings with a single DIMM, and not comparing memory with different access speeds. Any
Re:Just bought some of that PC2700 myself (Score:1, Informative)
Re:Just bought some of that PC2700 myself (Score:1)
Re:Just bought some of that PC2700 myself (Score:2)
Re:Just bought some of that PC2700 myself (Score:1, Interesting)
Re:Just bought some of that PC2700 myself (Score:2)
I've found.. (Score:4, Funny)
I've found that dumping beer onto my computers' silicon memory has the same effect as dumping beer down my throat does on my carbon-based memory.
So, CAS has no effect on bandwidth???? (Score:2, Interesting)
Sisoft Sandra memory bandwidth tests are good at exactly that, memory bandwidth tests.
What would be much more interesting is how programs that rely on small memory latencies (especially scientific programs) depend on changing the CAS.
Other Benchmarks (Score:3, Insightful)
Re:Other Benchmarks (Score:1)
Re:Other Benchmarks (Score:2)
THAT would potentially have been an interesting article.
totally flawed experiment (Score:2)
The first major problem I have is that he didn't test enough data points to prove that the differences in memory performance were actually due to the tweaks he did, and not thanks to some other environmental condition like sun spots, power surges, or what he ate for breakfast. His ENTIRE test suite only eaked out
Just for those who need more education. (Score:5, Informative)
GamersEd.com [gamersed.com]
What do Compaq et al quote for bank interleave? (Score:3, Interesting)
Does it really turn out that 4 way interleave is kind of bogus and only a 2-8% increase in performance? I suppose 8% might mean a lot, but on average it could be just 4% or so.
Re:What do Compaq et al quote for bank interleave? (Score:2)
No, the interleaving he's talking about is for page refreshing, not actually interleaving multiple memory sets.
steve
Wow, no shit (Score:2, Interesting)
He finally concludes that memory clock speed has a significant effect on bandwidth, while CAS and other settings hardly have an effect at all. Something I've known intuitively all along, and anyone with a rudimentary understanding would know.
In other words, when all those "super dooper case-moddin' overclockin' nothing-knowin' computer experts" payed an extra hundred bucks for stick of CAS 2 ram instead of CAS 2.5, they got ripped off. No surp
Re:Wow, no shit (Score:1)
Had he used a program that accessed many little bits of data throughout RAM (such as a scientific simulation) he might have found that CAS latency had a much larger role.
Re:Wow, no shit (Score:2)
I also underclock my athlon so it stays nice and cool, and run my hard drives in 'slow but quiet' mode.
I don't know why people are so obsessed with tweaking every 1% out of their PCs, it's a good way to ki
Re:Wow, no shit (Score:2)
I can understand slowing things down to make them quieter, but if you think running your ram slower is saving significantly on repair costs you're nuts.
Re:Wow, no shit (Score:2)
As for my RAM, I like to know that I can move my memory chips to any machine without problems, and getting the Kingmax m
Re:Wow, no shit (Score:2)
The other nice thing about buying overspecced components is that you have an upgrade path.
I've got one stick of quality PC3200 RAM that I'm effectively underclocking, even when I overclock my cheapie P4 1.8A. With this CPU, 2700 would be just fine.
But in a year or so, I'll get my money's worth when I hand the 1.8A to a friend and swap in a replacement CPU with a 166 or 2
Re:Wow, no shit (Score:1)
Re:Wow, no shit (Score:2)
As for keeping the machine, I intend to give it to future offspring or a significant other down the line. My current girlfriend is happy with her Celeron 500, which is several years old. I figure by the time I move up to 'The Next Big
Re:Wow, no shit (Score:2)
Memory performance affects applications in basically 2 ways. Bandwidth and latency. He measured bandwidth. CAS changes latency.
CAS 2 is better in many benchmarks than CAS 2.5, but almost certainly not in one that only measures bandwidth, since CAS primarily alters latency.
Look at the real application benchmarks out there, rather than something synthetic, and you can see as much as 25% increase going from CAS 3 to CAS 2.
You mean...?!?!? (Score:5, Informative)
Wow!!!!%#@
You mean reducing RAM latency doesn't increase bandwidth?!?!#%!1 d00d!
*sigh*
This benchmark would have been vastly more informative if the guy had gotten his tests and statistics right. First, he needs to learn the difference between a median and a mean, which are very different. Second, actually testing latency might have been nice, considering that one of his independent variables is CAS latency. Not to mention the fact that the hardcoded pixel widths in the stats table are horribly wrong on a high-DPI system. People! The em is your friend!
So basically what we have here is this:
Ah, well. I'll go back to my completely untweaked Athlon and be happy. :-)
Testing method (Score:3, Insightful)
Then he could have gotten the same information with many fewer test runs.
Also you could end up with interaction effects, which is nice. Maybe two settings have a greater or lesser effect.
I did my own tests... (Score:1, Interesting)
He only looks at bandwidth-LATENCY MATTERS (Score:1)
Memory speed vs FSB speed (Score:2, Interesting)
Anyways, what the article discovers is that you'll get BEST performance when memory speed == FSB speed. In benchmarks they find that a Athlon 3000+ (333Mhz FSB) with DDR333 is faster than the 3000+ with DDR400 (or DDR444). So, mental note, when shopping for a system, don't bother paying extra for that fas
actually, not completely true (Score:3, Insightful)
The big thing about underclocking the bus speed, though, is that you can now overclock the latencies. You can make that CAS2.5 pc3200 a CAS2 pc2700 and tweak other latency settings, too. It also mea
What about ECC? (Score:1)
At least I don't worry about cosmic rays; I use ECC!
Horrible test methods (Score:4, Informative)
First off, he tested all these super specialized memory timings using a stick of RAM that was rated CL2.5 So he was overclocking it and stressing it when he ran a lot of the low latency settings tests. A better setup would've been to get the best darned stick of RAM and THEN test how the timings affect performance.
Next, almost all of the timings he adjusted in the tests affect latency not bandwidth, but he used bandwidth as his ONLY benchmark. If a program is swapping small amounts of data, but VERY quickly and often, latency has more of an effect than bandwidth.
Finally, he doesn't address asynchronous bus speed issues or how well some of his unattainable settings would work (because of my first complaint, his memory was unstable at the aggressive timings).
I'm not a statistician, but it doesn't appear to me that he really understands some of the statistical methods for a good test. This is what I've garnered from reading other slashdot posts, at least.
L1 & L2 CPU cache? (Score:2)
Of course, I've never run these benches so I don't know. I'm just asking.
Generally known? (Score:1)
Inventive use of the term "generally known"...
Why didn't he automate everything? (Score:1)
Then also use a good benchmark program that can run from FreeDOS and you're ready way faster. Not to bash any (more) real OS, but DOS boots way fast, it's easier to 'deploy' than making your own memory-benchmark-OS.
Memory explanation ?!? (Score:2)
Could someone please post a link to a page with good overview of what PC2100/PC2700/... are, SDRAM/DDRAM/... On what kind of motherboards they should go ? What bus ? With which processors, etc...?
And also what all the little tweaks in the BIOS are (latency, ECC, scrubbing...) I tried searching google, but it's always vendors' hype.
Re:Memory explanation ?!? (Score:2)
Re:Sounds like an Ad... (Score:5, Insightful)
Sure it's advertising for whomever the vendor was, but its also a sponsorship of something that the author might have had to pay for himself. Or might have had to do without.
Re:Sounds like an Ad... (Score:2)
Re:Sounds like an Ad... (Score:1)
Re:Sounds like an Ad... (Score:1)
Re:I'm a /.-er! (Score:2, Funny)
Nonsense. A true slashdotter would demand the blueprints and die layouts of the RAM.
Re:I'm a /.-er! (Score:1)
A true slashdotter wouldn't call them blueprints!
Well, on second thought, maybe you're right.
Re:Sounds like an Ad... (Score:2, Funny)
obviously someone from across one of the ponds. here in the states, free crap will trump that there 'integrity' card each and every time.
and hey, now that i've spilled the beans, AMD, i could probably post some benchmarks of your new 64 bit processor if i only had a couple of them to play around with (wink, wink, smp, nudge, nudge).
Re:Sounds like an Ad... (Score:1)
ANd yes I understand that out of spec won't perform as well, but I'm talking about properly specced and tested RAM.
Re:WOW! (Score:1)
Re:WOW! (Score:2)
Re:WOW! (Score:1, Informative)
Re:this is news? (Score:2)
Re:this is news? (Score:1)
Re:this is news? (Score:2)
So both are very critical and only half the story was measured... most of those settings are to improve latency so measuring bandwidth is a tad pointless.