Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

 



Forgot your password?
typodupeerror
×
IBM Hardware

More on the PowerPC 970 386

functor writes "Ars Technica's Jon Stokes has a treatise up covering the microarchitecture of the high-performance 64-bit PowerPC 970 microprocessor, due to be released by the end of the year, that goes over in detail how this chip is put together, and how we can expect it to perform. This is the follow-up to Stokes' article detailing the PPC 970's design philosophy. 'It appears to hold quite a bit of promise in bolstering Apple's currently almost obsolescent product line, and it appears to have been designed explictly to fulfil Apple's requirements. To say the least, the second half of this year looks to be pretty interesting as Apple's product line promises to become competitive performance-wise with IA-32 and x86-64-based PCs again.''
This discussion has been archived. No new comments can be posted.

More on the PowerPC 970

Comments Filter:
  • by eweiland ( 89563 ) on Wednesday May 14, 2003 @08:31AM (#5953557)
    This is still a PPC chip. No changes to programs are necessary for them to run on it. The only change that will have to be made is if a software vendor decides to run in 64-bit mode which many don't have to do. Performance of the new chip is not dependent on whether the program runs at 32 or 64 bits. This is not a migration like moving from the 680x0 line of processors to the PPC which was an overall change in architecture.
  • by functor ( 31042 ) on Wednesday May 14, 2003 @08:35AM (#5953578) Homepage
    The vast majority of applications ought to run without modifications, since the instruction set is backward compatible (i.e. there are instructions for 64-bit addressing and with 64-bit-wide operands, but as far as I can tell, that's all). CPU-intensive applications would benefit from a recompile using a compiler that is aware of the PPC 970's unique pipelining and queueing, and can order instructions in the instruction stream that will allow for maximum execution unit utilization.

    Some applications, e.g. large databases and applications that deal with very large integers, will benefit from being rebuilt with 64-bit addressing and 64-bit instructions, but for the vast majority of (desktop) applications that run on OS X, all 64-bit binaries will do is to increase the utilization of CPU instruction cache (and often data cache), and hence reduce performance as the cache miss rates go up.

    So, in the end, don't worry; your OS X applications will run fine (for the most part) on a PowerPC 970-equipped Macintosh. ;)
  • by Snart Barfunz ( 526615 ) on Wednesday May 14, 2003 @08:50AM (#5953673)
    If by PC you mean personal computer, a phrase in common currency some time before the arrival of IBM's PC, then the answer is - Yes, many. If you mean a personal computer capable of running a Microsoft operating system natively, then (discounting early PPC NT ports) the anser is - No. But so what?
  • by Trurl's Machine ( 651488 ) on Wednesday May 14, 2003 @08:54AM (#5953693) Journal
    it still seems weird to see IBM (creator of the PC) making chips for Apple

    It's not that weird right now - their cooperation on PowerPC started almost 20 years ago. But it was weird ineed back then. I heard that on their first date, pardon, meeting, engineers of both companies wore the other company's dress code. The IBM guys came in jeans and t-shirts, the Apple guys came in suits and ties. How desperate both sides were to show each other that they have no hard feelings about past!
  • by 11223 ( 201561 ) on Wednesday May 14, 2003 @08:56AM (#5953708)
    You're completely wrong. The maximum speed of the FSB and whether it supports DDR (or QDR) is determined by the processor, not by the chipset. For the G4e, the maximum known speed at which MaxBus can operate is 167MHz - precisely what Apple uses.

    They can't make the FSB DDR or QDR without appropriate support from the processor, and that's exactly what they haven't been getting from Moto.

  • by 11223 ( 201561 ) on Wednesday May 14, 2003 @08:59AM (#5953736)
    The PPC 970 is great news for Apple, but it is still a bone thrown to them while the x86 PC is feasting on the meat of the Intel and AMD processors.

    As Nethack would say, "Ugh! This meat is tainted!"

    The 970 is fundamentally a 64-bit processor, and its performance must be evaluated in that context. The fact that the 970 will pull off amazing speed in the 32-bit arena only shows how well-designed this processor is.

    Keep in mind, the Hammer is only shipping at 1.4, 1.6, and 1.8 GHz - the same speeds the 970 is targeted at. And the 970 has the advantage of an ISA that was designed from the beginning to do 32 and 64 bit addressing, versus one that's a 64-bit extension of a 32-bit extension of a 16-bit micro with full compatibility to an 8-bit redesign of a 4-bit processor.

  • by pchown ( 90777 ) on Wednesday May 14, 2003 @08:59AM (#5953739)
    Interesting, if you look at the pipeline design of the PowerPC [arstechnica.com] it is much closer to Intel than AMD [aceshardware.com]. The PowerPC pipeline has sixteen stages, the Pentium 4 twenty, and the Athlon ten.

    Presumably the P4 can reach higher clock speeds than the Athlon because there is less work to do at each pipeline stage. On the other hand a longer pipeline increases the probability of a stall, so the work done per clock cycle goes down.

    I'd speculate that the PowerPC ought, therefore, to be able to achieve clock rates approaching but not equalling the P4, since they are both comparatively "over-pipelined". At the same time, the PowerPC ought to deliver slightly more throughput per clock cycle because the pipeline is slightly shorter.

    Meanwhile, the Athlon will be running at a significantly lower clock rate, but delivering comparable throughput.
  • by functor ( 31042 ) on Wednesday May 14, 2003 @09:06AM (#5953778) Homepage
    No, but IA-32 motherboard manufacturers go a good number of steps further. ;) I recommend that you investigate Intel's Placer (E7505) chipset [intel.com] and motherboards based on it (several of Supermicro's offerings, as well as offerings from Tyan and other manufacturers, e.g. the Iwill DPL533 [iwillusa.com] and DP533 [iwillusa.com]. These motherboards support 133 MHz QDR system buses (coming to 533 million transfers a second), matched (quite well) with two channels of PC2100 DDR SDRAM (resulting in 4.267 GB/s of memory bandwidth that is actually utilizable by the processors, since the memory bandwidth matches the system bus bandwidth, unlike Apple's offering, which is bottlenecked by the system bus at just 1.333 GB/s, whether you have one processor or two). (And I'm certain that 200 MHz QDR Xeon chipsets are not far off in the future, since Intel in general appears to be headed in that direction.)
  • Re:Dual FPUs! (Score:2, Informative)

    by tortap-0 ( 306464 ) on Wednesday May 14, 2003 @09:18AM (#5953860)
    While dual FPUs are useful this is what SIMD operations like Intels SSE and Motorolas Altivec are used for. The dual FPUs might be good for doing other work but DSP filters and Photoshop filters will use SIMD operations wherever they can.

    That is very bandwidth intensive work, moving alot of floating point numbers from memory, and this is where the 970 will be superior to the G4e. But this is also the strong point of the Intel P4 running at super high frequencies. The AMD Athlon 64 will clock for clock be competitive with the P4 running genreal code, but doing SIMD operations they can all do 4 at a time. Then the higher clocked chip always wins. The Altivec unit of the 970 will have to be alot better than the SSE2 from Intel to beat it.
  • Re:Hehe (Score:3, Informative)

    by ZorinLynx ( 31751 ) on Wednesday May 14, 2003 @09:26AM (#5953927) Homepage
    Awww jeez, that's just a permutation of the old joke:

    "Why did Intel rename the 586 to Pentium?"

    "Because they added 100 to 486 and got 585.999999878787775555"

  • Re:64-bit Adobe apps (Score:3, Informative)

    by darc ( 532156 ) on Wednesday May 14, 2003 @09:42AM (#5954073) Journal
    Actually, the performance gain is because 3D applications tend to use floating point matrix transformations, which are helped along greatly by the vector units on the processors. It's not really 32 vs 64, so as much as the advantage of having hardware better suited to lots and lots of matrix transformations.
  • Re:Kitchen Sink (Score:5, Informative)

    by functor ( 31042 ) on Wednesday May 14, 2003 @10:13AM (#5954333) Homepage
    The Pentium 4 is, in fact, designed to scale to high clock speeds exactly so that it can tolerate lots of pipeline bubbles in flight without ending up stalling for too long.

    A lot of these tricks (high decode bandwidth, multiple instruction queues [really buffers meant for reordering the instruction stream], branch prediction, etc.) are meant to reduce hazards such as pipeline bubbles as far as possible, and the PPC 970 does these hazard-reducing operations rather well, too.

    And, yes, we're now in the post-RISC world where instruction complexity (particularly in the realm of SIMD and streaming/explicit cache manipulation instructions) is growing because simple instructions clearly aren't enough to allow for great throughput increments.

    (Read some of Stokes' older articles in the Ars Technopaedia; I'm sure you'll find them interesting.)
  • by Lebannen ( 626462 ) <slash&irowan,com> on Wednesday May 14, 2003 @10:14AM (#5954343) Homepage
    As well as the depth of the pipeline, I believe the article also says you need to look at the width of the pipeline; it points out the G4+ is wide and shallow, the Pentium 4 is narrow and deep, and the 970 is wide and deep. You will therefore get bubbles in the 970's pipeline, but their effect is minimised and you're far less likely to get stalls.

    Combine this with the more intelligent branch prediction, out-of-order execution etc in the 970, and you're probably looking at a chip which is slightly less efficient clock-for-clock than the G4+, but more efficient than the Pentium 4.

    Integer performance wise, it looks like the 970 will be about equal to a Pentium 4 of 25-50% higher clockspeed; FPU-wise, and of course Altivec-wise, it looks like a monster. So; it probably won't outperform the current Pentium 4s at a lot of tasks, but will kick it about on other more specialised tasks, which is a big step over the G4+. We're not looking at a Pentium-crusher, but we are looking at something that will be vaguely competitive.

    Just gotta see how well it scales, after that, and whether 64-bit will mean anything for average tasks... and when it actually happens, of course.
  • by TheRaven64 ( 641858 ) on Wednesday May 14, 2003 @10:20AM (#5954400) Journal
    I sold my G4 tower some time ago becuase it was not fast enough to compete with my winders boxes.

    I'm intrigued. What do you do that makes a 1.25GHz G4 feel slow? I'm still using a 1.33GHz Athlon and it feels quite fast. I keeps thinking about upgrading the CPU, but really can't see the point. I rarely use more than 20% of it as it is...

  • by murgee ( 615127 ) on Wednesday May 14, 2003 @10:39AM (#5954567) Homepage
    This argument is pointless, and one oft-repeated unfortunately. Different people find different things work for them. Whereas you and other people may not find OS X a particularily productive enviroment, other people (including myself) find it to be, and moreso find Windows and Linux (especially Linux, IMHO) provide them with a mediocre enviroment to work in.

    I'm not posting this to beat up on the parent but it's something that tends to come up often.
  • Re:Obsolete my a$$ (Score:2, Informative)

    by Alan Partridge ( 516639 ) on Wednesday May 14, 2003 @12:12PM (#5955407) Journal
    Webster's New Collegiate Dictionary:

    a: no longer in use
    b: of a kind or style no longer current

    therefore ALL computers are most certainly NOT obsolete as soon as they are bought, just as a perfectly functional computer that happens to be a couple of years old isn't. Obsolete - in computerese - generally means NO LONGER SUPPORTED.
  • Re:Dual FPUs! (Score:3, Informative)

    by Graymalkin ( 13732 ) on Wednesday May 14, 2003 @02:10PM (#5956542)
    Use Apple's vecLib framework, it supports single and double precision operations via the AltiVec unit. Linky linky [apple.com].

They are relatively good but absolutely terrible. -- Alan Kay, commenting on Apollos

Working...