Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!

Cray CTO Says Cray Computers Are Great 338

Posted by michael on Friday August 20, 2004 @11:06AM from the couldn't-be-any-other-way dept.

Jan Stafford writes "Linux clusters can not offer the same price-performance as supercomputers, according to Paul Terry, chief technology officer of Burnaby, British Columbia-based Cray Canada. In this interview, Terry explains that assertion and describes Cray's new Linux-based XD1 system, which will be priced competitively with other types of high-end Linux clusters."

This discussion has been archived. No new comments can be posted.

Cray CTO Says Cray Computers Are Great

Load All Comments

Search 338 Comments Log In/Create an Account

Comments Filter:

Theyyyyyy'rrrrrre Great! (Score:3, Funny)

by ackthpt ( 218170 ) * writes: on Friday August 20, 2004 @11:07AM (#10023250) Homepage Journal

I thought that was Tony the Tiger.
I wonder how Cray computers are in milk...

Share
twitter facebook
Imagine... (Score:5, Funny)

by Rosco P. Coltrane ( 209368 ) writes: on Friday August 20, 2004 @11:07AM (#10023254)

no nevermind.

Share
twitter facebook
The issues are progress and long-term usefulness (Score:5, Informative)

by Space cowboy ( 13680 ) * writes: on Friday August 20, 2004 @11:07AM (#10023256) Journal

Given the difference in rate-of-evolution in the two camps, it can't be long before PC clusters, probably running Linux / with PVM or BSP (that's bulk-synchronous parallel rather than 3D graphics :-) are perfectly capable of doing what supercomputers do today. Of course, there'll be new really-super computers then, but that's a different story :-)

It's all very well to mock the I/O of PCI, but that's why we're all imminently moving to PCI Express, at a rather more respectable (current) maximum of 8+GBps rather than 133Mbps... Run a few gigabit ethernets in a hypercube formation and you have some rapid data transfer...

I notice he hasn't quoted the data-transfer rate on these new super-duper chips. The whole article does rather look like a piece of advertising on the cheap, speaking of which, the cluster solution is (relatively) CHEAP. Did I mention that ITS CHEAP...

Simon.

Share
twitter facebook
- Re:The issues are progress and long-term usefulnes (Score:5, Interesting)
  
  by Marx_Mrvelous ( 532372 ) writes: on Friday August 20, 2004 @11:15AM (#10023362) Homepage
  
  There are some limitations to clusters that "supercomputers" don't have. Even if your network were exactly as fast as the internal bus of one of the Cray supercomputers (which I highly doubt it is), you still have a logical layer on top of it (TCP/IP/UDP etc). This slows it down.
  
  For some applications, a cluster of slow PCs is ok. Bu if you want to do real time-intensive computation, you really can't beat a good internal bus.
  
  Parent Share
  twitter facebook
  - Re:The issues are progress and long-term usefulnes (Score:5, Informative)
    
    by Wesley Felter ( 138342 ) writes: <wesley@felter.org> on Friday August 20, 2004 @11:35AM (#10023602) Homepage
    
    Good clusters don't use IP; they use Infiniband, Myrinet, or Quadrics, which all have OS bypass and trasport offload features so that the app can talk directly to the NIC. In fact, Cray's XD1 "supercomputer" uses the same Infiniband interconnect as some "clusters"; Cray just has better NICs.
    
    Parent Share
    twitter facebook
    - Re:The issues are progress and long-term usefulnes (Score:2)
      
      by red floyd ( 220712 ) writes:
      
      Infiniband uses a variant of IPv6 for addressing, and I believe the protocol is IPv6 based (It's been a few years since I looked at IB).
      - Re:The issues are progress and long-term usefulnes (Score:4, Informative)
        
        by Wesley Felter ( 138342 ) writes: <wesley@felter.org> on Friday August 20, 2004 @11:59AM (#10023904) Homepage
        
        Only in routed Infiniband networks, which no one uses. The normal Infiniband protocol is very lean and totally different from TCP/IP.
        
        Parent Share
        twitter facebook
  - - Re:agreed (Score:3, Interesting)
      
      by jedidiah ( 1196 ) writes:
      
      I'm not sure you do either.
      
      A NUMA machine is just a cluster where the wire is in the form of a bus rather than copper or fibre cabling. The communications protocol for the bus may be better optimized for "supercomputing". However, you can do the same thing for a MPP optimized network protocol.
      
      It's all ultimately just wires and protocols.
      
      The total lack of process migration between nodes in a cluster might actually give clusters and edge over some NUMA implementations.
      
      Watching a single process dance aroun
- Re:The issues are progress and long-term usefulnes (Score:5, Informative)
  
  by vondo ( 303621 ) * writes: on Friday August 20, 2004 @11:18AM (#10023393)
  
  The latency on Ethernet is too high for many tightly coupled applications (lattice QCD for example). This is why people who need better networking use something like Myrinet. I would assume that these Cray machines have very high band-width, low-latency communications. This is where super-computers distinguish themselves from clusters.
  
  Parent Share
  twitter facebook
- Re:The issues are progress and long-term usefulnes (Score:5, Informative)
  
  by PythonCodr ( 731083 ) * writes: on Friday August 20, 2004 @11:22AM (#10023444)
  
  It's not just the speed of the data transfer, it's also the latency of the interconnect. A lot of scientific codes will pass around a lot of little messages, and GigE is fast for bulk transfer, but it's not so good for that. That's why there are companies like Quadrics, Myricom, etc... Infiniband should fix this, but you'll want a big infiniband switch.
  
  His point is building fast machines is hard, and the fastest machines are really hard. Too many folks think all you have to do is throw enough PCs and GigE nics at the problem. You can build a machine that way, but the codes don't scale well. Some scientific code will quickly show negative scaling in fact (where the more processes you add, the *slower* you code will run.) MPI codes do that all the time, which is one of the reasons you'll see people running their code at sizes smaller than the whole machine, and different sizes on different machines.
  
  Yeah, you can build a Linux based world-class supercomputer as a cluster, but you better be willing to sweat the details is all. Or buy a Cray, I guess. ;-)
  
  Parent Share
  twitter facebook
- Re:The issues are progress and long-term usefulnes (Score:5, Insightful)
  
  by ctr2sprt ( 574731 ) writes: on Friday August 20, 2004 @11:23AM (#10023460)
  
  You're right, the key is "cheap." Clusters don't offer the same level of performance as supercomputers. I don't think you'd disagree with that statement. What they do is offer a similar level of performance - once unattainable by desktops or even high-end servers, and here I mean real high-end servers instead of just quad Opterons or the like - for probably a tenth the cost.
  But even then, there are legitimate needs for supercomputers. A traditional PC-based server solution will address probably 99% of all problems. An inexpensive cluster will get you 99.9%. But there's that remaining 0.1%, and that's the target audience for whom Cray and similar companies exist.
  The fact that PCs can be used almost unmodified to create supercomputers and high-speed clusters is remarkable, and says tremendously good things about the flexibility and power of the architecture as a whole. But there are just places it can't go, not yet. For example, you know how you never get 99% efficiency with 100 megabit ethernet? You're lucky to get 70% with gigabit, and 50% is a pretty common figure. PCI-X, at least at the speeds we're talking about here, is so rare now that it's hardly cheaper than custom supercomputer-style solutions - effectively because it is a custom supercomputer-style solution. I don't think we'll ever see common systems, even midrange servers, with more than one 16X PCI-X slot.
  I really think this is what Cray mean here. Not that Linux-based clusters have no use, but that there is still a significant market for which they are suboptimal. And, in all probability, will always remain suboptimal. However fast PCs get, however popular PCI-X and similar high-speed buses become, supercomputers will just get faster to match... and computational problems will get harder to go along with them. I just don't see the need for supercomputers, at some level, ever going away.
  (I hope people find my comment useful in some way. I elected to post it rather than mod down the idiot posting flamebait about Macs in reply to you. And here's hoping people don't interpret this as karma whoring, since usually if you say "This will get modded down" it doesn't. But... oh, hell. I don't even know which Slashdot rule of thumb applies to my post at this point.)
  
  Parent Share
  twitter facebook
  - Re:The issues are progress and long-term usefulnes (Score:3, Informative)
    
    by drinkypoo ( 153816 ) writes:
    
    While your comment is largely informative you are still confusing PCI-Express with PCI-X. They are different things. I know that it's inherently confusing, but still...
- Re:The issues are progress and long-term usefulnes (Score:5, Insightful)
  
  by Performer Guy ( 69820 ) writes: on Friday August 20, 2004 @11:42AM (#10023694)
  
  Clusters are nice for some problems but message passing and memory copying over a network is not ideal even when you have what *you* think is a lot of bandwidth. Latency and cache coherency and having a single image system can be critical factors in some classes of supercomputing problem, not to mention ease of use and specialized fp vector instructions that are often supported. The topology in large systems is often built (flexibly) into the memory controller hardware, the CPU writes to memory and it finds the right node, page migration and process affinity along with other advanced features like hardware level cache coherency helps these systems outperform clusters with ease given the right problems.
  
  The coolest thing about this IMHO is that Cray are using Linux for their single image systems.
  
  Yep the performance of computers is always on the increase but there will always be demand for more compute, the question is where do you want to be on the performance curve, not the absolute performance. People solve increasingly difficult problems with increasing detail and there looks to be no slowdown. They buy what suits their budget and solve as rigorously as they can for their hardware, and as hardware improves they redefine the types of problem they want to solve.
  
  Yup clusters are cheap and they're on the top 500 but nobody actually buys a supercomputer to run LINPACK. They use them to solve real problems, the list is just for bragging rights.
  
  Parent Share
  twitter facebook
- Re:The issues are progress and long-term usefulnes (Score:3, Funny)
  
  by NoMoreNicksLeft ( 516230 ) writes:
  
  Don't suppose anyone has an old YMP or whatever that they'd be willing to give to a good home in Virginia?
  
  Or for that matter, a warezed copy of Unicos....
- Latency (Score:3, Informative)
  
  by khrtt ( 701691 ) writes:
  
  It's all very well to mock the I/O of PCI, but that's why we're all imminently moving to PCI Express, at a rather more respectable (current) maximum of 8+GBps rather than 133Mbps... Run a few gigabit ethernets in a hypercube formation and you have some rapid data transfer...
  
  The main reason for supercomputers to exist is not the high bandwidth, it's the latency of the switch. The network hardware that is used in clusters as the interconnect medium (switch) can provide very high bandwidth, but the latency
  - Re:Latency (Score:4, Insightful)
    
    by Wesley Felter ( 138342 ) writes: <wesley@felter.org> on Friday August 20, 2004 @12:04PM (#10023972) Homepage
    
    That's a nice theory, but Cray's XD1 "supercomputer" uses the same Mellanox switch chips as some "clusters". Cray is splitting hairs to justify their product.
    
    (BTW, I get 100 us ping time on my GigE network, but you're right that that's still 100x too slow for HPC.)
    
    Parent Share
    twitter facebook
- Re:The issues are progress and long-term usefulnes (Score:4, Interesting)
  
  by einhverfr ( 238914 ) writes: <(moc.liamg) (ta) (srevart.sirhc)> on Friday August 20, 2004 @12:24PM (#10024249) Homepage Journal
  
  You might want to read the latest 10-K form from CRAY.
  
  http://www.sec.gov/Archives/edgar/data/949158/0000 89102004000325/v96761e10vk.htm [sec.gov]
  
  Here they discuss the limitations of clusters and vector-based supercomputing.
  
  Basically, they offer three types of supercomputers aimed at different markets: vector, massively parallel, and multithreaded. Not really sure why multithreaded means in this context (Microkernel capable of threading itself across many processors i.e. UNICOS/mk?) but they do a decent job of explaining the whole thing:
  
  Cray Research pioneered the use of vector systems, from the Cray-1 to the Cray C90 and T90 systems. These systems typically use a moderate number (one to 32) of very fast custom processors in connection with a shared memory. Vector processing has proven to be highly effective for many scientific and engineering application programs which over the years have been written to maximize the number of long vectors. Traditional vector systems do not scale effectively (that is, increase performance by increasing the number of processors) past a limited number of processors. We currently market one classic vector supercomputer, the Cray SX-6 system.
  
  Massively parallel processing architectures typically link tens, hundreds or thousands of standard or commodity processors to act either on multiple tasks at the same time or together in concert on a single computationally-intensive task. Type T systems connect each processor directly to its own private memory and the programmer must manage the movement of data among memory units and processors. Consequently these systems can be difficult to program. Type C massively parallel systems, unlike low bandwidth clusters, have high bandwidth and low latency interconnect systems and are said to be "tightly coupled" -- the Cray T3E, Red Storm and the OctigaBay product are examples of balanced high bandwidth purpose built systems that employ standard microprocessors.
  
  The Cray X1 system is revolutionary in that it is the first supercomputer that combines the attributes of both vector and high bandwidth massively parallel systems. The Cray X1 system has up to 64 processors per cabinet and a shared memory. The Cray X1 system can run small problems as a vector processor would or, by focusing many processors on a task, the Cray X1 system operates as a massively parallel system with a system-wide shared memory and a single-system image. The Cray X1 system is designed to provide efficient scalability and high bandwidth to run complex applications at high sustained speeds. The Cray X1E system furthers this architectural design with increased processor speed and capability.
  
  Our MTA-2 project for NRL is designed to have sustainable high speed, be broadly applicable and easy to program, provide scalability as systems increase in size and have balanced I/O capability. The multithreading processors make the MTA-2 system latency tolerant and, with the system's flat shared memory, able to address data anywhere in the system.
  
  Parent Share
  twitter facebook
- Re:The issues are progress and long-term usefulnes (Score:3, Informative)
  
  by LurkerXXX ( 667952 ) writes:
  
  Wow. 8+ GB/s. Nice.
  Unless I'm now out of date, the last figures I saw said the CrayLink Interconnect can do 102 GB/sec. That's Just a tad bit more, don't you think? No messing with masses of gig ethernet to crossconnect them. It's just done.
NO WAY! (Score:5, Funny)

by FortKnox ( 169099 ) writes: on Friday August 20, 2004 @11:07AM (#10023258) Homepage Journal

The CTO from Cray said Crays are great machines and are priced competitively!

Next you'll tell me the CEO of SCO thinks the lawsuit is completely valid and fair!

Share
twitter facebook
- Re:NO WAY! (Score:3, Funny)
  
  by Rosco P. Coltrane ( 209368 ) writes:
  
  Apparently, since Cray uses Linux in clusters now, I'm sure SCO thinks Cray machines would be even greater if they costed $699 more per node...
- Re:NO WAY! (Score:5, Insightful)
  
  by gcaseye6677 ( 694805 ) writes: on Friday August 20, 2004 @11:19AM (#10023413)
  
  In other news, a Porsche performs better than a Ford Focus. Nevermind the 'slight' price difference.
  
  Parent Share
  twitter facebook
How about... (Score:3, Funny)

by OxygenPenguin ( 785248 ) writes: <mrunyon@gmail.com> on Friday August 20, 2004 @11:09AM (#10023276) Homepage

a Linux cluster of Cray's?

Share
twitter facebook
Linux vs. linux (Score:5, Funny)

by Anonymous Coward writes: on Friday August 20, 2004 @11:10AM (#10023281)

Is MS somehow involved? Who am I supposed to hate? Editors?

Share
twitter facebook
The difference (Score:4, Insightful)

by rwven ( 663186 ) writes: on Friday August 20, 2004 @11:11AM (#10023297)

The difference is that linux clusters aren't really designed for supercomputing... more of distributed computing. Cray specializes in it. Of course they're going to come out on top....

Share
twitter facebook
- Re:The difference (Score:2)
  
  by Junta ( 36770 ) writes:
  
  Huh? Umm, no, clusters are heavily used for supercomputing. Take a glance at the top500 and see for yourself. With high-speed interconnects (i.e. infiniband/myrinet), it is very feasible.
  - Re:The difference (Score:3, Informative)
    
    by trifakir ( 792534 ) writes:
    
    With high-speed interconnects (i.e. infiniband/myrinet), it is very feasible.
    Hm, I haven't played with infiniband, but I have access to a small Myrinet cluster and it takes hell lot of efforts to write your application in such a way as to overcome the big disparity CPU power/network thoroughput and to have some normal speed-up.
    Paul Terry is right - if they remove the PCI bottleneck it will be much easier to write scalable high-performance applications and then the costs will decrease.
editor training (Score:3, Interesting)

by Knights who say 'INT ( 708612 ) writes: on Friday August 20, 2004 @11:11AM (#10023304) Journal

You really shouldnt place commentary on a story title, unless it's an "its funny, laugh" one.

Oh, by the way, everyone who has a slashdot account should go to their preferences and set the "light" layout. You wont suffer with the bad color schemes anymore, and the results are more printer-friendly too.

Share
twitter facebook
- - Re:editor training (Score:3, Funny)
    
    by drooling-dog ( 189103 ) writes:
    
    I just have to ask, why would you want to print a /. discussion?
    Well, if I make a particularly witty comment, of course I'd like to frame it and hang it on the wall behind my desk...
Slashdot Poster Says Comment Is Funny (Score:2, Funny)

by UncleBiggims ( 526644 ) writes:

And it is, too.
A better angle would have been... (Score:4, Funny)

by Linker3000 ( 626634 ) writes: on Friday August 20, 2004 @11:14AM (#10023341) Journal

...Your square boxes will never look as sexy as our 'Love Seat' [computerhistory.org]

Share
twitter facebook
He's right!! (Score:2)

by Hugonz ( 20064 ) writes:

"Linux clusters can not offer the same price-performance as supercomputers"
He's completely right, just not in the way he intended. You'd have a hard time making the cluster as expensive as the supercomputer....
- Re:He's right!! (Score:2)
  
  by rusty0101 ( 565565 ) writes:
  
  No, I can easily make a cluster as expensive as a super computer. Well, assuming I can spend all the money on the hardware necessary. Of course for a couple of million dollars, you can expect a rather impressive cluster.
  
  Let me see, we'll take a quarter mill and use that to purchase the switches an cabling needed to interconnect everything. Might have to spend a bit to upgrade the power to our facilities, and speaking of facilities, we will probably need a warehouse some place to keep all the systems we are
- Re:He's right!! (Score:3, Insightful)
  
  by mr_z_beeblebrox ( 591077 ) writes:
  
  He's completely right, just not in the way he intended. You'd have a hard time making the cluster as expensive as the supercomputer....
  
  No, he's right in the way he intended.
  He just leaves out a lot of information. The business environment determines what is or is not expensive. The computational environment determines what will or will not run fast, the two make a measure of how expensive something is.
  If you are crunching a big continuous stream of numbers with multiple small results which are then loo
Dupe! (Score:5, Informative)

by Xpilot ( 117961 ) writes: on Friday August 20, 2004 @11:16AM (#10023369) Homepage

Yeah, no wonder this post looked familiar [slashdot.org]. Yup, it's a dupe, folks.

Share
twitter facebook
- Re:Dupe! (Score:5, Funny)
  
  by glenrm ( 640773 ) writes: on Friday August 20, 2004 @11:29AM (#10023531) Homepage Journal
  
  Cary most have bought the two posting, ad package.
  
  Parent Share
  twitter facebook
Maybe "APPLE" will buy another Cray! (Score:2, Interesting)

by callipygian-showsyst ( 631222 ) writes:

Remember when Apple [tafkac.org]bought a Cray? It was mostly for show, so their R&D group can have the blinkenlights.
However it spawned a popular story about how "Cray designs on Apple and Apple designs on Cray" (see link.) [tafkac.org]
And now for the REST of the story:
Did you know that Macintoshes are designed on PCs!? That's right--PCs running WINDOWS. You see, nobody makes software to burn eproms or design printed circuit boards that runs on MacOS, so the hardware group has a bunch of Windows PCs!.
So now you know the *
- Re:Maybe "APPLE" will buy another Cray! (Score:2)
  
  by NoMoreNicksLeft ( 516230 ) writes:
  
  Bullshit. Alot of the high end auto-routing stuff for PCB design runs on HPUX, AIX, even some Solaris, if I remember. Windows is an also-ran in this category, but mostly for the small developer. I doubt that Macs are designed on windows....
- Re:Maybe "APPLE" will buy another Cray! (Score:5, Interesting)
  
  by Thagg ( 9904 ) writes: <thadbeier@gmail.com> on Friday August 20, 2004 @12:00PM (#10023922) Journal
  
  As usual, there is more to the story. Apple brought my company in on a project back in the mid 80's when they bought the Cray. While we had to sign an NDA in blood, I doubt anybody will mind me talking about it now, almost 20 years later.
  
  Apple was trying to design a new cpu chip. It would have had vector processing capabilities not all that different from the Cray, so they bought the Cray both to do circuit simulations on the chip and as a model for their own design.
  
  The chip was going to be a 100 MHz chip (an astonishing speed for the time) with a four-pipleline vector processing unit.
  
  They considered (but eventually declined to) hire us to develop some kind of 3D desktop for the Mac. The idea was this would distinguish the Mac further from other computing systems, but they wouldn't be able to emulate the interface because they didn't have the horsepower.
  
  Anyway, that's the Apple-Cray story as I understand it. I'm sure that there is a lot more to the story than I know, of course.
  
  Thad Beier
  
  Parent Share
  twitter facebook
- - Re:-1 Informative (Score:2)
    
    by callipygian-showsyst ( 631222 ) writes:
    
    What does this have to do with...
    Who cares? This is /.! Nobody reads the article, and I got modded up instantly! All it takes is a few lines of text with a few links [jerkcity.com] in it. Why bother doing any more?
heh (Score:2)

by NanoGator ( 522640 ) writes:

I saw this MST3k blooper once where Tom called out "Cray" instead of "Crow". Still in character, And with false modesty, Crow replied with "Well that's very nice of you, Tom. I'm really more of a PC though."

(Not a verbatim quote.)
Or for an alternative press release (Score:2)

by bfree ( 113420 ) writes:

You could look to SGI. Their Altix range is up to 1024 Itanium 2 processors in a single supercomputer, and they are putting 20 512 * processor nodes together in a cluster of linux supercomputers for NASA [sgi.com] while also working on doubling up the maximum single machine cpu count to 2048.
Clusters don't scale, huh? (Score:3, Informative)

by FyRE666 ( 263011 ) * writes: on Friday August 20, 2004 @11:22AM (#10023447) Homepage

Scaling or upgrading these systems requires much more than simply ordering more parts; it opens up the whole integration exercise. From an application perspective, clusters limit application scaling. Bandwidth and latency restrictions significantly constrain performance as more processors are applied to a problem.

Has this guy ever heard of Google? I can see his point to an extent; in fact his whole q&a session/blatant advert really boiled down to a single point: If you need to move a lot of data between processors, then a cluster will faire worse than one of Cray's supercomputers which have (obviously) more bandwidth between the CPUs and shared memory. It really does depend on the application, but for him to suggest an HPC is always a more economic, or even better option than a cluster of cheap x86 boxes is demonstrably false...

Share
twitter facebook
- Re:Clusters don't scale, huh? (Score:5, Insightful)
  
  by argent ( 18001 ) writes: <peterNO@SPAMslashdot.2006.taronga.com> on Friday August 20, 2004 @11:33AM (#10023587) Homepage Journal
  
  for him to suggest an HPC is always a more economic, or even better option than a cluster of cheap x86 boxes is demonstrably false
  
  It would be if he'd said it, so it's a good thing he didn't. He even commented that there are applications (emabarassingly parallel algorithms) that clusters do very well at. And Google is a perfect example of that.
  
  Parent Share
  twitter facebook
Geez (Score:5, Informative)

by iamdrscience ( 541136 ) writes: on Friday August 20, 2004 @11:23AM (#10023462) Homepage

Being the CTO of Cray, can you expect him to say anything less? Now while his points are often valid, I think his conclusion, that supercomputers outshine linux clusters is a little inaccurate. Rather, I think the real conclusion is that linux clusters and supercomputers are both good, but at slightly different things. Which one you need to solve your problem depends ultimately, on the specific details of your problem. Again, though, being the CTO of the company, can really expect him to give a balanced opinion like that, rather than the skewed opinion that his company is always on top?

Cray is a great company, but I really hate that they have to come out with things like this every now and then. Most people in need of a lot of computing power already know the difference between your products and linux clusters and really, they're going to choose whichever's most appropriate for their problem regardless of what your CTO says.

Share
twitter facebook
- Re:Geez (Score:5, Informative)
  
  by argent ( 18001 ) writes: <peterNO@SPAMslashdot.2006.taronga.com> on Friday August 20, 2004 @11:38AM (#10023645) Homepage Journal
  
  I think the real conclusion is that linux clusters and supercomputers are both good, but at slightly different things. Which one you need to solve your problem depends ultimately, on the specific details of your problem
  
  Indeed. He actually made that point himself: "There are some applications where a well-designed Linux cluster can deliver good price/performance on a particular application; those embarrassingly parallel applications where processors spend little time exchanging data."
  
  Parent Share
  twitter facebook
Correction (Score:3, Funny)

by Leomania ( 137289 ) writes: on Friday August 20, 2004 @11:26AM (#10023501) Homepage

Cray CTO Says Cray Computers Are Great

Actually, I think he said that "Cray computers rock, eh?" or perhaps it was "Cray computers kick ass, eh?" or something like that.

- Leo

Share
twitter facebook
Not quite so simple really is it? (Score:5, Informative)

by Anonymous Coward writes: on Friday August 20, 2004 @11:27AM (#10023508)

I don't think the Cray assertion is that crazy.

For a 12 CPU opteron unit the academic pricing (admittedly lower than commercial but where most of their sales will go) is about 45K. That's not too shabby. Before you bounce up and down and say I can build four times the cluster for that price, it should be noted that the XD1 gives you a single systems image, which simplifies programming and makes shared memory applications (increasingly important for areas such as bioinformatics).

We have a cluster with dolphinics wulfkit, using distributed shared memory slows us down. It's not the end of the world type slow down but it's a factor. Our cluster is a sixteen node, dual xeon 2.2GHz with wulfkit 3d torus interconnects. It cost us, at academic prices, $50K. Admittedly more CPU power than the 12 Opterons but we find ourselves using distributed shared memory alot, wulfkit is great here, and that would probably be much better on the XD1. Had the XD1 been available a year ago we may have bought one instead.

It really depends on your application. Are Crays cheaper than clusters in terms of harnessable compute power per dollar? Maybe. Depends on your application. Surely that's the correct answer.

Also, buying Cray is about getting access to their software technology too.

R-S

Share
twitter facebook
The argument (Score:5, Informative)

by manavendra ( 688020 ) writes: on Friday August 20, 2004 @11:40AM (#10023659) Homepage Journal
is based on :
1. Heritage and resultant architecture: Linux clusters are typically processors are connected through I/O links, whereas supercomputing machines where processors exchange data and instructions through shared memory.
2. PCI bottlenecks: This the key argument made - the bottlenecks introduced by PCI communication and the bottlenecks therein. He goes on to say that performance problems in any given such cluster tend to remain with any other such cluster. I agree with that.
3. High Availability: He then goes on to talk about the reliability, availability and manageability of the supercomputers against typical clusters. I think there is where the FUD creeps in, along with marketing BS.
In all fairness, he does raise a critical point, however, overall, I think considering the relative ease and popularity of building, administering and growing a cluster these days, I think cost-effectiveness of a single monolithic machine is a moot point
Share
twitter facebook
He basically said faster communications needed (Score:3, Insightful)

by VernonNemitz ( 581327 ) writes: on Friday August 20, 2004 @11:40AM (#10023661) Journal

That is, for a Linux cluster to keep up with a supercomputer, the cluster needs faster communications between processors. The bottleneck of going from processor to South Bridge to PCI Bus to Ethernet card, and back again at another processor, is the problem.

So, the answer is to recognize that in a cluster most of the machines don't need video cards. That means Somebody can design a fiber-optic communications card that plugs into the AGP slot (or maybe a PCI Express slot). Then, Cray, look out!

Share
twitter facebook
- Re:He basically said faster communications needed (Score:3, Interesting)
  
  by vidarh ( 309115 ) writes:
  
  And in doing so you are essentially building a super computer. However you'd have to keep in mind that it isn't all about total bandwidth - latency also needs to be extremely low. That said, HP is working on an open source Single System Image [openssi.org] clustering support for Linux on "normal" hardware
2 word summary of article (Score:2)

by chill ( 34294 ) writes:

PCI sucks
I for one ... (Score:3, Funny)

by cascadingstylesheet ( 140919 ) writes: on Friday August 20, 2004 @11:43AM (#10023711) Journal

I, for one, welcome our new story-duplicating, supercomputer-mocking, Slashdot editor overlords ...

Share
twitter facebook
Why are Linux clusters' interconnects slow? (Score:2)

by Louis Savain ( 65843 ) writes:

On the other hand, supercomputers are purpose-built to handle HPC applications, which place enormous demands on both processing power and inter-processor communication. Their design includes high performance interconnects that provide high bandwidth, low-latency communications across the entire system, regardless of the number of processors required.

Why can't Linux clusters use the same high performance interconnects? Is it because of cable overhead (length, signal travel, insulation, etc...) or is it bec
- Re:Why are Linux clusters' interconnects slow? (Score:2)
  
  by argent ( 18001 ) writes:
  
  Why can't Linux clusters use the same high performance interconnects
  
  They can. It's just a matter of how much you want to spend, and the result wouldn't necessarily be a "cluster" any more. It's distance, bus overhead, network overhead, chipset architecture, everything you listed and more.
ALERT!!! BREAKING NEWS!!! (Score:2)

by still cynical ( 17020 ) writes:

This just in! Company exec. says their products are great!!!

Seriously, this is news?
A little inaccurate... (Score:2)

by dfj225 ( 587560 ) writes:

While many things that the Cray CTO said are true, I think the issue (obviously) has be skewed some. It really depends on the problem you are solving. Some problems will need to have data shared between all of the the nodes, but others will require that each node only has access to the data that is important to the small part of the problem that it solves. Also, the CTO mentioned that clusters don't scale very well. I don't really know what made him think this, but it seems to me that clusters do scale
- Re:A little inaccurate... (Score:3, Insightful)
  
  by stratjakt ( 596332 ) writes:
  
  They don't scale for applications that require shared memory access.
  
  Something like SETI@home could scale almost infinitely. The data elements are completely unrelated.
  
  But if every node needed access to the same chunk of data, then the more nodes you add, the more they "fight" over that chunk of data.
  
  Ultimately, with a PC cluster solution, only one node at a time can be accessing any given section of "shared" memory.
  
  That's what he means, and he's right. ..offtopic..
  
  Look at the slashbots who can't under
In other news... (Score:5, Insightful)

by mrjb ( 547783 ) writes: on Friday August 20, 2004 @11:48AM (#10023774)

MS says their operating system is great. McDonald's says their food is great *and* cheap.

Share
twitter facebook
It ain't religion. (Score:5, Insightful)

by Performer Guy ( 69820 ) writes: on Friday August 20, 2004 @11:51AM (#10023801)

It's a but depressing to watch everyone jump on Cray here despite having no clue about the key differences between supercomputers and clusters are. All this cheerleading for clusters in various posts here illustrates how thoughtless some of these posts are. Why the heck should you care if someone makes a supercomputer or a cluster. Both clusters and supercomputers lose value fast over time.

Yes clusters are good for some stuff but we should be rooting for Cray if they're creating interesting products that fill a need, and that's exactly what they do.

It is a fact that supercomputers have an architecture that clusters cannot compete with for some classes of problem. Get over it, live with it and enjoy the fact that supercomputers are running Linux too.

It's pretty darned cool that Cray survived until now and that they still have a market for large single image systems.

Share
twitter facebook
Comment removed (Score:5, Funny)

by account_deleted ( 4530225 ) writes: on Friday August 20, 2004 @11:54AM (#10023832)

Comment removed based on user account deletion

Share
twitter facebook
Let's do some bandwidth math... (Score:4, Interesting)

by JBMcB ( 73720 ) writes: on Friday August 20, 2004 @12:26PM (#10024282)

From Cray (From XD1 page):
"A 96 GB per second, nonblocking, crossbar switching fabric in each chassis provides four 2 GB per second links to each two-way SMP and twenty-four 2 GB per second interchassis links."

-So for a dual-opteron XD1 processor unit, there is 8GB total bandwidth available.

Total aggregate PCI bandwidths (Accepted standards):

PCI32 33MHz = 133MB/s
PCI32 66MHz = 266MB/s
PCI64 33MHz = 266MB/s
PCI64 66MHz = 533MB/s
PCI-X 133MHz = 1066MB/s
PCI Express = 200MB/s (Per slot)
PCI Express x16 = 3000MB/s (Usable bandwidth)

-So for PCI Express x16 we're talking 3GB/second

SMP Opteron with two PCI Express x16 slots can do 6GB/second aggregate bandwidth. A couple of Infiniband links can easily saturate that. I'm sure this all costs quite a bit less than Cray's propriatary stuff.

Share
twitter facebook
Taken a little out of context (Score:4, Insightful)

by UnknowingFool ( 672806 ) writes: on Friday August 20, 2004 @12:28PM (#10024302)

In a way he's right. Reading the whole article, it seems apparent that he's talking about certain high performance applications. Clusters are not always the best way to solve a problem. For problems that can broken down into small independent tasks like SETI, clusters are a good solution. Clusters do have their optimization challenges with latency, bottlenecks, etc. For simulations where the tasks are dependent on each, these bottlenecks add up. The individual nodes spend as much time communicating with each other as they do computing. There are also problems that cannot be distributed. In these cases clusters are not the right solution and it may not be cost effective to use a cluster.

Share
twitter facebook
As always, it depends on the application (Score:5, Insightful)

by Orp ( 6583 ) writes: on Friday August 20, 2004 @12:53PM (#10024596) Homepage

Both clusters and big iron have their place. I am a meteorology professor and my current research involves high-resolution numerical modeling of thunderstorms. For a problem where the domain decomposition is straightforward and internode communication isn't your bottleneck, clusters are great. One huge advantage of clusters is that they are cheap and it isn't too big of a deal to get a grant together to buy the hardware, and it's YOURS and nobody else's. A huge disadvantage to big iron is that you have to share it with about a hundred other researchers. Waiting in a queue for three days only to find you goofed up in your startup script (and the model exits immediately) is NO FUN (cf the Regatta at NCSA).

I am currently running a model using legacy FORTRAN 90 code which was written before there were clusters. It does use OMP but OMP sucks and is no substitute for code which is written with MPI in mind. The model as it currently stands requires big iron to do big runs, and it is inefficient, but it works and sometimes I just need to do science and not model development. I am working on MPI-izing the code; no small feat, but the rewards would be quite worth the effort.

In summary, both clusters and big iron have their place. Folks have a habit of making a false dichotomy with regards to these two options. I wouldn't trade my cluster for the world (currently doing parallel POV-Ray rendering of my 3D thunderstorm data, see my web link and an upcoming [not sure what month] Linux Journal article if interested) as it is perfect for much of what I am doing right now and I don't have to share it with anyone. But I will also use big iron when necessary.

Share
twitter facebook
Doom III (Score:3, Funny)

by Yousef ( 66495 ) writes: on Friday August 20, 2004 @01:22PM (#10024903)

Finally, a machine capable of running Doom 3!

Share
twitter facebook
Target audience... (Score:4, Insightful)

by umshaggy ( 460672 ) writes: <damadpoet@gmail.com> on Friday August 20, 2004 @03:12PM (#10026245) Journal

Many posts have pointed out the true fact that supercomputers are better for certain jobs that are not suited to clustered solutions (and visa versa).

Most slashdotters are technical enough to realise this...but...we are not the target audience of the original article. Such articles are meant for high level executives and relatively non-specialist managers who don't always hear all sides of the story. Every day these people are seeing articles and news blurbs stating how the latest linux cluster is as good or better than a supercomputer, and gee isn't that swell! While such press is good, and important, not everyone hearing that implicitly understands that such reports only apply to SOME applications.

So what the original article is, is a message from one executive to other executives trying to clarify the situation. Basically saying "hey, just because Wired ran a story that says linux clusters are the next best thing since sliced bread, doesn't mean that this is the best solution for you. Now, let us talk about what you need."

I see nothing wrong with this. I read the article, and found nothing in it that was false.
It is good because sometimes an exec will listen to a fellow exec when they won't listed to the advice of their own techs because of something said exec read in Scientific American.

Welcome to corporate america boys and girls.

(Disclaimer: Wired and American Scientific were random examples. I know of know articles in either publication about linux clusters. Both are fine publications.)

Share
twitter facebook
- Re:*Shock* (Score:5, Insightful)
  
  by Nos. ( 179609 ) writes: <<ac.srrekeht> <ta> <werdna>> on Friday August 20, 2004 @11:09AM (#10023277) Homepage
  
  The thing is makers of big supercomputers are scared of clustering technology. Look at google. A large cluster, and if one of the machines dies, you don't worry about it. Every once in a while you go and replace those that died. If only a small portion die, you haven't seriously impacted your production. However, if your supercomputer goes down... well, your screwed. 1000 machines are more reliable then 1 big machine.
  
  Parent Share
  twitter facebook
  - Re:*Shock* (Score:5, Insightful)
    
    by krog ( 25663 ) writes: on Friday August 20, 2004 @11:14AM (#10023339) Homepage
    
    Dude, the makers of "big supercomputers" invented clustering. I don't think they're afraid of it.
    
    There are tasks that a cluster of Linux shitboxen will do well, and tasks where the cluster will not hold up so well against a real supercomputer. Google is an example of a perfect application for networked Linux servers. If you're simulating cloud physics one molecule at a time, though, you are a lot better off using the right tool for the job instead of 1,024 wrong ones.
    
    Parent Share
    twitter facebook
    - Re:*Shock* (Score:5, Insightful)
      
      by networkBoy ( 774728 ) writes: on Friday August 20, 2004 @11:28AM (#10023514) Journal
      
      If you're simulating cloud physics one molecule at a time, though, you are a lot better off using the right tool for the job instead of 1,024 wrong ones.
      
      In this case the right tool is a vector based supercomputer like the SV1 (8 vector processors at 2Gflops each . . . MMmmmmmmm). A cluster based approach will waste more processing time with the message passing than anything else. Cheaper maybe, but grosely ineffecent.
      -nB
      
      Parent Share
      twitter facebook
      - Re:However (Score:3, Informative)
        
        by lpp ( 115405 ) writes:
        
        In short, if clustering provides a better/cheaper solution, go with it.
        
        Um, yes. The grandparent and ggp were (I think) inferring though that for that particular application you actually won't be able to be both better and cheaper with a clustering solution.
        
        i.e. if you throw enough Linux boxes into the cluster to be able to achieve the "better (faster)" solution, you will no longer be cheaper.
        
        But I don't think anyone was arguing that even if a cluster is cheaper and faster you should still go with a super
    - Re:*Shock* (Score:2, Interesting)
      
      by lukewarmfusion ( 726141 ) writes:
      
      No, the inventors of big supercomputers (couple million dollars a pop) are definitely scared of clustering.
      
      If you want a Cray supercomputer, you have to buy it from Cray. If you want a Linux cluster, you can buy it (or build it) from anyone.
      
      I'm sure there are applications for a supercomputer, but I see universities, production studios (Pixar!), and research labs moving toward clusters. The supercomputer companies will do anything it takes to either stop that from happeneing or to gain in that market.
    - Re:*Shock* (Score:2)
      
      by Rei ( 128717 ) writes:
      
      It depends. If you're simulating cloud physics across a wide range of starting conditions, for example, a cluster will be your best bet.
      
      There are very few situations in which there isn't a single parallizable task, and if there is one, a cluster is probably your best bet.
      - Re:*Shock* (Score:5, Insightful)
        
        by krog ( 25663 ) writes: on Friday August 20, 2004 @11:47AM (#10023750) Homepage
        
        Networked clusters are useful only when the task is parallelizable, and each subtask is computable on a single node. Cloud physics is not like that. Cracking RC5, for instance, is.
        
        Parent Share
        twitter facebook
        
        Re:*Shock* (Score:2)
        
        by Rei ( 128717 ) writes:
        
        Please reread my post. I mentioned "a range of starting conditions" as being one possible (of many) mitigating case. Do you disagree? If there is a range of starting conditions, each condition's simulation can be assigned to a node.
        
        That's the thing about clustering - you only need *one* parallelizable cpu-intensive task, and a cluster becomes worth it.
        
        Re:*Shock* (Score:4, Insightful)
        
        by ThosLives ( 686517 ) writes: on Friday August 20, 2004 @12:16PM (#10024136) Journal
        
        Uh, I think he understands, but do you know how long a single current-day [Linux] node would take to compute a cloud simulation? The reason you use supercomputers for this is because it's a really huge set of simultaneous (possibly nonlinear) floating-point equations. Most (if not all) desktop / server type computers are not designed for that type of computation; they're better at nice sequential stuff (like RC5). For example, trying to compute one car crash on my desktop would probably take it on the order of weeks (if not months). A cray will typically do that type of computation in about 12-24 hours. So, do you have 15 computers each taking 2 weeks to crunch 15 different simulations and not get any result for 2 weeks, or do you run 1 simulation a day for 15 consecutive days and make decisions based on the current result? The latter makes much more sense for most of the applications.
        Of course, it really does depend on the problem you're facing. Most people who pay for results, though, want results as fast as possible, and that's why supercomputers win for problems that aren't "embarassingly parallel".
        
        Parent Share
        twitter facebook
        
        Exploiting parallelism vs. efficient computation (Score:3, Interesting)
        
        by billstewart ( 78916 ) writes:
        
        If you're trying to run 1024 cases with different starting conditions, then a 1024-processor cluster lets you run them all at once. A supercomputer with the same price as the cluster probably has only 1/10th the raw GFLOPS as the cluster, because supercomputer designs are much more complex and commodity cluster hardware is dirt cheap.
        
        So if each cluster CPU can run a single instance the problem efficiently, it's 10 times as cost-effective to use the cluster.
        
        On the other hand, if a single instance of t
        
        Re:*Shock* (Score:3, Insightful)
        
        by CommieOverlord ( 234015 ) writes:
        
        Because clusters are cheaper, per raw unit power.
        
        But if the supercomputer is more efficient per raw unit of power, then the price per unit doesn't matter.
        
        I work for living with HPC, buth with clusters and with large SMP machines. The cluster is nice, but there are some things than can _only_ be run a large SMP machine or are much, much faster on a SMP.
  - Re:*Shock* (Score:3, Insightful)
    
    by beh ( 4759 ) * writes:
    
    Well, supercomputing can be either of two issues
    
    a) (google-like) jobs well suited to a high degree of parallel processing.
    
    b) complicated problems that can't easily be broken down to make use of a large number of CPUs, but require a lot of operations to be completed in the proper sequence.
    
    On the first, a cluster is a great idea.
    On the second, a reaaaaaallly fast CPU is a great idea.
  - Re:*Shock* (Score:5, Informative)
    
    by ranrub ( 653694 ) writes: on Friday August 20, 2004 @11:35AM (#10023610)
    
    Have you ever worked with supercomputers?
    
    However, if your supercomputer goes down... well, your screwed
    
    Cray supercomputers have built-in redundancies. All the subsystems are separate from the processors and memory, which are actually "clustered" (depends on model). Even the OS has build-in means to survive the harshest hardware catastrophe by checkpointing the running jobs regularly, to off-site disks.
    
    1000 machines are more reliable then 1 big machine
    
    Wrong again. With 1000 lousy cheap machines, you need an on-site team of technitians to keep the all up. Supercomputers (with built-in redundancy etc.) have equal or less maintenance requirements.
    
    Parent Share
    twitter facebook
  - I dont have enough money.. (Score:4, Funny)
    
    by essreenim ( 647659 ) writes: on Friday August 20, 2004 @11:50AM (#10023792)
    
    for a Cray, you insensitive clod.
    
    Parent Share
    twitter facebook
  - - Re:*Shock* (Score:5, Insightful)
      
      by Anonymous Coward writes: on Friday August 20, 2004 @11:15AM (#10023359)
      
      FUD = Fear, Uncertainty, Doubt. Provide examples in his statements of any of those three?
      
      P.S. You are so l33t for using TT.
      
      Parent Share
      twitter facebook
- Re:*Shock* (Score:5, Informative)
  
  by Anonymous Coward writes: on Friday August 20, 2004 @11:10AM (#10023285)
  
  No, no, you misunderstand.
  He's saying that linux-based *supercomputers* are faster then linux-based *clusters*.
  (although, you can probably cluster those supercomputers...)
  
  Parent Share
  twitter facebook
  - My understanding of what Cray is actually saying (Score:3, Insightful)
    
    by einhverfr ( 238914 ) writes:
    
    Cray makes at least two types of supercomputers according to their SEC forms. These include massively parrallel clusters and vector-based supercomputers. In general massively parallel clusters are less expensive for the number of calculations per sec than the vector-based supercomputers. However, for many applications, the vector-based supercomputers will massively outperform the clusters.
    
    Cray's competitors in the cluster markets include IBM, and their main competitor in the vector-based market is NEC.
- Re:*Shock* (Score:2, Informative)
  
  by Crazy_MYKL ( 721064 ) writes:
  
  No, he's saying you should buy their Linux-based supercomputer instead of a Linux cluster. If you don't RTFA, at least skim the summary.
- Re:*Shock* (Score:5, Informative)
  
  by ohad_l ( 683421 ) writes: <lutzky@gmOOOail.com minus threevowels> on Friday August 20, 2004 @11:13AM (#10023329) Homepage
  
  Uhh, no, he's not dissing Linux at all. He's saying that one big supercomputer (running Linux, perhaps) will get you more price-performance (bang per buck, I guess) than a Linux cluster.
  
  Parent Share
  twitter facebook
  - - Re:*Shock* (Score:2)
      
      by Rei ( 128717 ) writes:
      
      As I've mentioned elsewhere, all you need is a single parallelizable task for a cluster to be worth it. For example, if you need a simulation to be run over a range of starting conditions, a cluster is probably your best bet.
      
      Unless you have a single monolithic entangled run, you don't need a supercomputer - hence, the surging popularity of clusters. Yes, not everything is suited for clusters... but most things are, because most have parallelizable components at least *somewhere* in the process.
    - Re:*Shock* (Score:2)
      
      by jedidiah ( 1196 ) writes:
      
      You really don't have any business making any comments regarding what a "computer expert" should or shouldn't know.
      
      Depending on a particular Cray, the tech may or may not be significantly different than a Beowulf cluster. Let's take NUMA as an example. NUMA started at Cray, was acquired by SGI and then sold to Sun.
      
      In those examples, the "supercomputer" is nothing more than what amounts to a fancy cluster. The interconnects are faster. However, you are still just tying together a bunch of big bricks that l
- Re:Unfuglify (Score:2, Informative)
  
  by AndroSyn ( 89960 ) writes:
  
  Viola, un-fuglied version.
  
  Not to nitpick but a Viola is a string instrument in the violin family, the word you want is voilà.
- Re:Unfuglify (Score:2)
  
  by Mr.Ned ( 79679 ) writes:
  
  http://www.electricstate.com/articles/defuglify-sl ashdot/
  
  Found this a while back, and now have it in my Firefox Toolbar - works great.
- No ... (Score:5, Informative)
  
  by gstoddart ( 321705 ) writes: on Friday August 20, 2004 @11:22AM (#10023446) Homepage
  
  There are entire classes of computational problems which are calssed as Embarassingly Parallel [duke.edu].
  
  It means it is so trivial to parallelize the problem and get gains from it (think SETI@Home) that it's a no-brainer.
  
  Other computational problems don't just simply fan out to the bazillions of nodes with tiny independant pieces of data.
  
  Your assertion that the Cray CTO is talking FUD when he uses the actual term is just plain wrong and unfair to him. He actually knows what he's talking about.
  
  Parent Share
  twitter facebook
  - Re:No ... (Score:2)
    
    by iamdrscience ( 541136 ) writes:
    
    Well, I think you're right that he's not just talking FUD and he knows what he's talking about, but still though, tell me it isn't a little unfair that he's very directly implying Cray supercomputers are, across the board, a better solution. There are some problems with which Cray's computers are a better solution (not just those that don't parallelize, but plenty that don't parallelize very well), but there are also problems where a Cray is rather unnecessary.
- Re:Yes he's talking FUD (Score:4, Informative)
  
  by CommieOverlord ( 234015 ) writes: on Friday August 20, 2004 @11:23AM (#10023457)
  
  Are you being funny or serious?
  
  There's an entire branch of parallel application which are labeled "embarrassingly parallel". This description simply means that such programs are trivially parallelized and achieve as close to linear as possible when scaled across many nodes. This is because of the low inter-node communications.
  
  For "embarrassingly parallel" applications, a cluster is a really good tool. For programs that parallelize as nicely a nice big vector or smp will do nicely. Some code will run better on small 20CPU SMP machine than on a 1000 node cluster.
  
  Parent Share
  twitter facebook
- Re:Yes he's talking FUD (Score:2)
  
  by argent ( 18001 ) writes:
  
  I think you misunderstand what he said. The term "embarasingly parallel" has been in common use for many years to describe problems that require so little communication between processors that they can be scaled up more or less indefinitely just by adding more computers. The ultimate examples of "embarassingly parallelizable problems" are things like the human genome project or SETI-at-home, where it's practical to farm it out to completely disconnected computers to do bits of the work in isolation.
- Re:Whee! (Score:3, Informative)
  
  by the_mad_poster ( 640772 ) writes:
  
  Hi, clueless Slashbot. This is a quick rundown of why your post was stupid, and why Cray supercomputers do, in fact, do some things better than a PC cluster regardless of price.
  
  If you have a supercomputer, you have a very, very, very fast internal bus handling all necessary data transfer. Even with the advent of PCI Express, a cluster of PCs must run in a network model. Therefore, any data crunching that occurs must pass through a network layer, the bus, the physical medium, and back through those limiters
- Re:Cool, competition (Score:2)
  
  by Mistah Blue ( 519779 ) writes:
  
  If you had RTA you would have learned that these new Cray systems run Linux as their OS.
  - Re:Cool, competition (Score:2)
    
    by Progman3K ( 515744 ) writes:
    
    So the claim is "Our supercomputer performs better than a PC?" No kidding. Even for geeks, this isn't really news.
    - Re:Cool, competition (Score:2)
      
      by Mistah Blue ( 519779 ) writes:
      
      No, the claim is a supercomputer generally outperforms a cluster. The fact that Linux is running on either/both/neither has no relevancy.
- Re:Two words: (Score:4, Insightful)
  
  by vidarh ( 309115 ) writes: <vidar@hokstad.com> on Friday August 20, 2004 @12:09PM (#10024049) Homepage Journal
  
  Cray supports Linux. In fact, the supercomputer platform Cray's CTO was pushing in the interview is running Linux. What he's saying is really that supercomputers can handle classes of problems that clusters have problems with.
  If your goal is to run simulations where each piece of the simulation depend on large subset of the other pieces, then you will need ridiculous interconnect speeds, and you're likely to end up with something you could have bought from Cray or SGI or some of the other remaining supercomputer manufacturers for a fraction of the price.
  Luckily for you and the rest of us many problems can be split into relatively independent pieces, in which case a Beowulf cluster or similar is more than adequate.
  If you seriously believe that clusters can compete with supercomputers for every type of problem, you need to think again.
  
  Parent Share
  twitter facebook
- Re:Two words: (Score:3)
  
  by CommieOverlord ( 234015 ) writes:
  
  Hmmm...
  
  1. Cray is definitely pro-linux. It's what their XD1 runs. Though not their bigger computers.
  
  2. There are some problems for which that a cluster can not even come close to achieving the performance of a supercomputer. For a lot of problems yes, for some maybe if you spend a fortune on fancy interconnects, and for some no.
  
  3. If you're commercially building clusters let me know company it is. I'm in the market for a 128CPU cluster and I want to know who not to buy from.
- Re:Two words: (Score:5, Interesting)
  
  by iPaul ( 559200 ) writes: on Friday August 20, 2004 @12:52PM (#10024575) Homepage
  
  Not quite true. First off, you get much higher bandwidth between processors using proprietary (NUMA) based interconnects than you can with commodity hardware. Why? Because you can optimize for your situation. Second you can exploit things like cache-coherency between processors (even if they're in different "nodes") and therefore true shared memory. So, a 1024 processor SGI Altrix, or a 256 processor Cray is one computer as far as the OS and user-land stuff is concerned.
  
  There's another advantage Cray has on the SV and X series and that's a vector unit on the processor. That allows you to conduct operations on arrays of numbers at once instead of having to cycle through the numbers in a loop. For example, the dot_product between two small arrays might be accomplished with one or two instructions, as opposed to a loop. Apple's AltiVec is also a vector unit.
  
  If you took money out of the picture it would be easier to deal with a big-honkin' super computer like an SGI or Cray rather than a cluster. One computer is easier to manage and you could always use threads and plain old heap memory (which is much faster than message passing over a network).
  
  Add money back in and 500,000 goes a lot farther in raw compute power when you're buying racks of DELLs and infiniband interconnects. However, depending on the application, you may be faster, slower, or even dog-slow compared to the cray. If you need the answer today, and the $ is not a factor, go to Cray or SGI with a blank check. If you have to balance cost and time, then a cluster might be better.
  
  Essentially, it boils down to how much communication you do between nodes. Cray does it orders of magnitude faster than off-the-shelf stuff. If you hardly ever pass messages between nodes, clusters are fast. If you have to pass a lot of messages between nodes, one big computer will trounce lots of little ones.
  
  Parent Share
  twitter facebook
- - - So why did you say it was FUD? (Score:3, Insightful)
      
      by argent ( 18001 ) writes:
      
      You and him, you're saying the same thing, you're spinning it your own way, but the actual content is the same. So why are you describing his as FUD?

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Theyyyyyy'rrrrrre Great! (Score:3, Funny)

Imagine... (Score:5, Funny)

The issues are progress and long-term usefulness (Score:5, Informative)

Re:The issues are progress and long-term usefulnes (Score:5, Interesting)

Re:The issues are progress and long-term usefulnes (Score:5, Informative)

Re:The issues are progress and long-term usefulnes (Score:2)

Re:The issues are progress and long-term usefulnes (Score:4, Informative)

Re:agreed (Score:3, Interesting)

Re:The issues are progress and long-term usefulnes (Score:5, Informative)

Re:The issues are progress and long-term usefulnes (Score:5, Informative)

Re:The issues are progress and long-term usefulnes (Score:5, Insightful)

Re:The issues are progress and long-term usefulnes (Score:3, Informative)

Re:The issues are progress and long-term usefulnes (Score:5, Insightful)

Re:The issues are progress and long-term usefulnes (Score:3, Funny)

Latency (Score:3, Informative)

Re:Latency (Score:4, Insightful)

Re:The issues are progress and long-term usefulnes (Score:4, Interesting)

Re:The issues are progress and long-term usefulnes (Score:3, Informative)

NO WAY! (Score:5, Funny)

Re:NO WAY! (Score:3, Funny)

Re:NO WAY! (Score:5, Insightful)

How about... (Score:3, Funny)

Linux vs. linux (Score:5, Funny)

The difference (Score:4, Insightful)

Re:The difference (Score:2)

Re:The difference (Score:3, Informative)

editor training (Score:3, Interesting)

Re:editor training (Score:3, Funny)

Slashdot Poster Says Comment Is Funny (Score:2, Funny)

A better angle would have been... (Score:4, Funny)

He's right!! (Score:2)

Re:He's right!! (Score:2)

Re:He's right!! (Score:3, Insightful)

Dupe! (Score:5, Informative)

Re:Dupe! (Score:5, Funny)

Maybe "APPLE" will buy another Cray! (Score:2, Interesting)

Re:Maybe "APPLE" will buy another Cray! (Score:2)

Re:Maybe "APPLE" will buy another Cray! (Score:5, Interesting)

Re:-1 Informative (Score:2)

heh (Score:2)

Or for an alternative press release (Score:2)

Clusters don't scale, huh? (Score:3, Informative)

Re:Clusters don't scale, huh? (Score:5, Insightful)

Geez (Score:5, Informative)

Re:Geez (Score:5, Informative)

Correction (Score:3, Funny)

Not quite so simple really is it? (Score:5, Informative)

The argument (Score:5, Informative)

He basically said faster communications needed (Score:3, Insightful)

Re:He basically said faster communications needed (Score:3, Interesting)

2 word summary of article (Score:2)

I for one ... (Score:3, Funny)

Why are Linux clusters' interconnects slow? (Score:2)

Re:Why are Linux clusters' interconnects slow? (Score:2)

ALERT!!! BREAKING NEWS!!! (Score:2)

A little inaccurate... (Score:2)

Re:A little inaccurate... (Score:3, Insightful)

In other news... (Score:5, Insightful)

It ain't religion. (Score:5, Insightful)

Comment removed (Score:5, Funny)

Let's do some bandwidth math... (Score:4, Interesting)

Taken a little out of context (Score:4, Insightful)

As always, it depends on the application (Score:5, Insightful)

Doom III (Score:3, Funny)

Target audience... (Score:4, Insightful)

Re:*Shock* (Score:5, Insightful)

Re:*Shock* (Score:5, Insightful)

Re:*Shock* (Score:5, Insightful)

Re:However (Score:3, Informative)

Re:*Shock* (Score:2, Interesting)

Re:*Shock* (Score:2)

Re:*Shock* (Score:5, Insightful)

Re:*Shock* (Score:2)

Re:*Shock* (Score:4, Insightful)

Exploiting parallelism vs. efficient computation (Score:3, Interesting)

Re:*Shock* (Score:3, Insightful)

Re:*Shock* (Score:3, Insightful)

Re:*Shock* (Score:5, Informative)

I dont have enough money.. (Score:4, Funny)

Re:*Shock* (Score:5, Insightful)

Re:Shock (Score:5, Insightful)

Re:Shock (Score:5, Insightful)

Re:Shock (Score:5, Insightful)

Re:Shock (Score:2, Interesting)

Re:Shock (Score:2)

Re:Shock (Score:5, Insightful)

Re:Shock (Score:2)

Re:Shock (Score:4, Insightful)

Re:Shock (Score:3, Insightful)

Re:Shock (Score:3, Insightful)

Re:Shock (Score:5, Informative)

Re:Shock (Score:5, Insightful)

Re:Shock (Score:5, Informative)

Re:Shock (Score:2, Informative)

Re:Shock (Score:5, Informative)

Re:Shock (Score:2)

Re:Shock (Score:2)