
Fastest Commercial Supercomputer To Be Built 106
Zeus305 writes: "Today NuTec Sciences, Inc. will be announcing its purchase of the world's fastest commercial supercomputer, second overall only to ASCI White. NuTec will use and lease time on the 1,250 clustered IBM servers to analyse genes decoded by the human genome project to try to better understand the causes of diseases like cancer by running month-long algorithms that analyse the relationships between different areas of the genome. This beast will have 2.5 terabytes of RAM and 50 TB of disk space."
1250 CPU's just the beginning (Score:1)
Re:Windows 2000, Linux or AIX? (Score:1)
Re:Protein folding? (Score:1)
Faster Commercial... (Score:1)
just a quick thought ... (Score:1)
What a ratio! (Score:1)
Look at that - 2.5 TB RAM, 50TB disk means that RAM is 5% of disk. Compare that to a reasonable system today, where you have 128MB, and 30GB disk, which is somewhere around 0.4% of disk...
What's the point of virtual memory when you've got RAM like that?
Re:What a ratio! (Score:1)
Ehmm. Very small disk bandwidth? I expect every node to have its own physical disk should there be a need to add swap. In that case you can expect every node to have about 40Mb per second worth of swapping potential. The machine would have a total of about 50Gb per second of disk throughput.
(Aren't you happy that your PC can do 800Mb per second to RAM? this thing does 60 times that to DISK!)
Roger.
Memory to Hardrive (Score:1)
For every one byte of memmory you have 20 btyes of harddrive space. Damn could you imagine the amount of data you can hold in memory to work your magic on. I wonder if that puppy needs any swap space.
Re:Ho-hum (Score:1)
If all you need is a place to temporarily stash some DNA input files (which are certainly duplicated somewhere else) than cheap-as-shit IDE
drives are just fine for what you're.
And I haven't forgotten about controllers, cabinets, powersupplies, etc. Assuming 1,200 machines, you can put 40 gig IDE drives in each machine and have 48 terabytes of disk. Use a 2U or even a 1U case and you don't need anything else, other than the few buck IDE cable.
Ho-hum (Score:1)
2. 50 Terabytes of disk isn't really that big of a deal. 80 Gig Drives are $250 off pricewatch, so we're talking roughly 3k per terabyte, or only $150,000 for the disk, which in the high-performance community really isn't all that much money. (And don't whine to me about "Well, super-duper-ultra-scsi7 isn't that cheap" because many many apps simply don't need fast disk - they may very well be CPU bound or maybe memory bandwidth bound.)
Re:Parallel computing & computer science... (Score:1)
Re:Don't make that mistake.. (Score:1)
Fast, not first. (Score:1)
wow! (Score:1)
_joshua_
Slashdot PR again! (Score:1)
I wish Slashdot was more interested in the real science of the genome and less PR orientated. Slashdot aint what it used to be...
Re:Protein folding? (Score:1)
http://www.pcworld.com/news/article.asp?aid=141
Basically, it's a million processor giant that will hopefully crank out a petaflop. I'm sure they're doing reasearch somewhere, but the only thing I've seen is bad marketing crap. First, "Blue Gene", which is just an obvious, stupid joke. Then the article says it will use a new type of architecure called SMASH...simple, many, and self-healing. How long did them spend coming up with words that fit "SMASH"?
Anyway, it will probably be rad when the build it. But then again, any supercomputer built four years from now will be pretty amazing by today's standards.
-B
Re:Ho-hum (Score:1)
Bottom line is that just becasue you can go to Best Buy and buy a 80GB drive doesn't mean that you can/should use that drive in server.
Metaverse (Score:1)
Cure fo Cancer (Score:1)
Re:This could be dangerous. (Score:1)
It really is true (if you can't upgrade the equipment whilst the calculation is underway). Say you've got a problem that would take 10 years to solve on todays computers, and computers double in power every 18 months. If you wait 36 months and then start you could finish the problem in only 5.5 years total! (the optimum time to wait with these figures is a little more, just under 40 months)
Re:just a quick thought ... (Score:1)
Re:Parallel computing & computer science... (Score:1)
(-;
cool page though! can you come up with a problem that must be solved in an entirely linear fashion? (that can't be memoized or divided into a search space)?
i'm sitting here trying to remember one from algorithms and i can't, and it's starting to drive me nuts. (:
Re:Parallel computing & computer science... (Score:1)
or maybe i missed something?
Re:I'm glad to see... (Score:1)
I mean, to me, a "traditional computer" is the C64, so I am really not talking about a lot of speed...
Re:Ho-hum (Score:1)
-Egon
Re:Good cause? What about the starving people? (Score:1)
2.5 terabytes of RAM? (Score:1)
Now we can replay all those rigidly linear levels spent stacking rubber boxes in rubber trenches - except now we can do it with a frame rate >20!
Re:Good cause? What about the starving people? (Score:1)
Actually, I think the more realistic exchange rate for people/people is 16:1.
Which means, Americans are contributing more to global population problem than africans.
Or more bluntly, 1 american baby born, 16 (or 3714) african babies must die to keep the planet in equilibrium. (as baby-specific political spin practiced by so many American politicians.)
It's ugly outta here.
Re:Unleased time. (Score:1)
Re:Unleased time--charity (Score:1)
On my desk? (Score:1)
Re:An even better idea... (Score:1)
For the curious... (Score:1)
Since the thing has 2.5 TB of RAM, my best guess is that each p640 has 2 Power 3's and 2 GB of RAM (1 GB per CPU). It is possible that each p640 has 4 cpus (and 500 MB RAM per cpu), but that seems like an odd RAM/CPU ratio for a hefty IBM machine.
dopp
Unleased time. (Score:1)
Re:Parallel computing & computer science... (Score:1)
Distributing this across a dozen or so processes wouldn't be that hard.
Re:I'm glad to see... (Score:1)
Curing of diseases is basically not a function of "speed of investigation" but of politics. We have known how to cure a simple ulcer for ~30 years and yet doctors STILL don't utilize this knowledge. Infact Dr's still prescribe antibiotics for viral based colds which has created huge problems with resistance.
Western medicine is great for acute conditions but totally rots for chronic conditions. After years of study we still can't cure much. How could this be? Looking in the wrong place or in the wrong way. Why can this go on? Human nature. We wont see what's going on until we want to (or have it rammed down our throat).
Answers come from asking a wise question with open ears and eyes.
Windows 2000, Linux or AIX? (Score:1)
How can I post some tasty bait w/o this essential information?
Re:Memory to Hardrive (Score:1)
is this a problem for quantum comps? (Score:1)
The msnbc article glances over this a bit, but I'm hazarding a guess that what they're trying to do is statistical analysis based on protein folding relating to the entire active genome. From what I understand, this is a lot like the traveling sales person problem and the whole genre of NP v. P problems that are more suited to quantum computing.
So as our society's tech progress in the quantum computing realm to the point where we can actually build these things, does that mean we get faster answers to questions like those posed of the genome map?
If anyone has any informed insights (not my bs guesswork) I'd like to hear them.
Thanks, -- RLJ
Re:Protein folding? (Score:1)
Re:I'm glad to see... (Score:1)
Re:Unleased time--charity (Score:1)
I'm sure that they would do it if the following conditions were met:
* There were no paying projects waiting.
* There were to be some sweet publicity for them.
* They could claim some huge tax deduction.
How every version of MICROS~1 Windows(TM) comes to exist.
Re:Ho-hum (Score:1)
Perhaps not, but they'll probably be using Fibre Channel anyway, if not solid-state storage.
Does this seem kind of boring to anyone else? (Score:1)
Re:Unleased time. (Score:1)
One more thing.. (Score:1)
Best thing in the world for a hangover. You can buy little packets of pills in gas stations that advertise increased energy.. etc.
Look on the back of the package to ensure that it is indeed B12 you're getting. Pop'em and wait half an hour.
I've heard of a lot of things to do to treat hangovers.. this is the only thing I've ever encountered that cures one. (Along with a glass or two of water.)
-Pip's drunken uncle
--
Re:useful? (Score:1)
Jesse Jackson will be there, claiming that the very *nature* of the computer is discriminatory: "Ones and Zeros? That's just another way of saying Black and White! We demand that the computer *always* have an equitable number of ones and zeros stored in memory! And if there's a register that has too many ones, by God, we'll send some zeroes *to* that register. On the *bus*!"
--
Re:I'm glad to see... (Score:1)
---
useful? (Score:1)
Re:I'm glad to see... (Score:1)
Re:I'm glad to see... (Score:1)
Slaps forehead!
Of course, DB2... I really don't care too much about what OS is running - Linux would be nice to extend The Cause(tm), but one thing I've always been curious about is the partitioning of the data in particular.
What really constitutes the data for this job - the entire DNA for some subject? Or is the data collected from several people with and without the disease(s) to determine which genes are likely candidates for leading to the disorder?
Just curious. If anyone has a link to such answers, it would be muchly appreciated.
Thanks
The applications dump state periodically... (Score:1)
trying to make some super fault tolerant system.
Flat5
Linpack is not the final word... (Score:1)
their actual codes on. The answer is the blue machines.
Flat5
Re:But could they.... (Score:1)
Flat5
But will it run Word smoothly? (Score:1)
Re:This could be dangerous. (Score:1)
---
Posting at -1 means never losing karma.
Other uses for Nutec's new box... (Score:1)
Are there Voodoo3 drivers for this thing yet?
Ridiculopathic Zombies From Outer Space [ridiculopathy.com]
The ultimate news server! (Score:1)
What...h0rr0rs! (Score:1)
There's no mention of NT? What's wrong with these people? NT is the single best OS out there, powerful and simple to use. Obviously for a top-notch supercomputer, NT is the only obvious choice for the system. I read about it in PC Magazine, just last week, and as everyone knows they are the defacto standard for everything related to computers.
I just can't believe they didn't mention NT.
</sarcasm>
Re:Unleased time. (Score:1)
Re:I'm glad to see... (Score:1)
Re:Fast, not first. (Score:1)
It isn't a matter of reading the article, the bloody twit couldn't even understand the title.
new UT server... for christmas... (Score:1)
My current UT server is just not fast enough....
--
Re:Ho-hum (Score:1)
firingsquad! (Score:1)
Re:Don't make that mistake.. (Score:1)
Re:I'm glad to see... (Score:1)
btw- anyone know if this thing will do quake3 better than a p4?
An even better idea... (Score:1)
It seems like every time I pick up a newspaper or turn on the tv, there's some idiot telling me that genetic researchers have found the gene that "causes" cancer or some other dreaded disease.
We already know the cause of cancer: carcinogens. We eat, drink, and breathe them, and have since the rise of consumer culture. The same marketroids who have pushed all of this crap on us are the same marketroids who are going to sell us the high-priced cures.
There is a much better use for a machine like this--use it to model human population negative growth from cancer, etc... Maybe then we can get all of the marketroids and GM researchers/investors into cattle cars and ship them off to Madagascar or somewhere...
I'd rather be a unix freak than a freaky eunuch
Better Uses (Score:1)
Re:Unleased time. (Score:1)
Program Failure (Score:1)
Reminds me of the Hitchhiker's guide.. instead of the Vogons, wouldn't it have been pathetic if the Earth (the most powerful supercomputer in the universe) had an Exception 0E error instead?
Re:Ho-hum (Score:2)
Of course, I'm not sure what type of storage requirements they're going to have in that field or if they're planning on keeping the majority of the archival storage on the main system's 50TB or if that's just a "scratch pad" for huge jobs that will eventually be moved to storage arrays.
:::shrug:::
Re:Ho-hum (Score:2)
If the data is really important, a redundant system is needed and that ofcourse can double the price. I have looked at good 10TB RAID systems and they are not cheap.
Protein folding? (Score:2)
Re:Parallel computing & computer science... (Score:2)
Or, transform the generic Fibbonaci recurrence into a dynamic programming problem an solve it in linear time
I guess my point is that many problems that appear to be "0% verctor-optimized" actually are not.
fastest COMMERCIAL computer (Score:2)
Labs. The ASCI series are sub-commercial proof-of-concept computers. That is, the mainstream makers are always bragging they can configure a teraflop computer, but no customer can afford them. So Uncle Sam kicks in a few bucks to call them at their word. Everyone wins. The government gets something really fast. The computer companies get an R&D test at government expense. The second customer, a commercial site, gets a more affordable computer.
Ah, but when will we have streamlined processors? (Score:2)
Using a similar method as outlined in the book, I suspect that chips that are custom built to understand DNA and how it acts (or at least the inherant algorithms that will be used in studying DNA)are possible to build. Ironically, they probably will not be built until we understand genes and DNA better than we currently do. Once machines like this start to be built, how far off will we be from the machines described in Michael Chritons book Jurassic Park? Some scientists have purposed that it will take 100 years to understand the human genome (or other genomes) perhaps, but I think we are closer than that.
How does this compare to projects like seti@home? (Score:2)
Compare this to Seti@home's 26.11 TeraFLOPs/sec.
Why wouldn't NuTec develope the software so every joe blow and there handheld could run a distributed client that does this. I personally have a hard time justifying time spent installing distributed.net or seti@home clients on all the machines I have access to, as I know my boss wouldn't understand the importance of cracking encryption, or searching for aliens on company time.
However searching for cures to human illnesses, who wouldn't want to do this? With a good piece of software and some proper advertising, theres no doubt they would surpass 30 or even 50 teraflops.
Though this may not be a possibility if huge amounts of data are required for the calculations. Anyone have some ideas about this?
Hardware specs of ASCI White (Score:2)
Here are the specs from a web page I found at LLNL [llnl.gov]:
Re:useful? (Score:2)
What do you want, to give the results of this effort to those who don't need it and can't use it? "Here are your food stamps and your array processing ration card."
Re:How does this compare to projects like seti@hom (Score:2)
doomed to fail (Score:2)
Sig:
Re:I'm glad to see... (Score:2)
Re:What a ratio! (Score:2)
That's the thing: I'm guessing that this machine doesn't use swap at all. There would be so much memory to swap with a (relativly) very small disk bandwidth. For that reason, if you want any performance, you need to put almost everything in memory. The disk is only there when you need permanant storage.
Re:Protein folding? (Score:2)
You're thinking of Blue Gene, which is supposed to hit 1 petaflop.
Unfortunately it isn't supposed to be operational until 2005.
Re:Ho-hum (Score:2)
finally a good cause (Score:2)
But could they.... (Score:2)
________
Re:But could they.... (Score:2)
Flat5
Re:Payment (Score:2)
Re:fastest COMMERCIAL computer (Score:2)
They're actually leased by the DoE, not owned, according to this CNet story [cnet.com].
ASCI = Accelerated Strategic Computing Initiative
Their site is here [llnl.gov]. More on ASCI White, including a picture, is here [llnl.gov].
This could be dangerous. (Score:2)
long time (Score:2)
I'm sure than in only 47 years not 447, that there will be pcs that can put ascii white to shame.
Finally! (Score:2)
Re:But could they.... (Score:3)
The switch could be a bottleneck. It doesn't have to be. It all depends on how much data has to be transfered.
I know that with a render farm, which is a NOW (network of workstations) the switch is not the limiting factor. The machine pulls in data, thinks and spits out data, in small chuncks (a couple K at a time). Any switch should be able to handle this.
Also, if more of an interaction is needed between machines, they can be networked in hypercube configs by adding a few more switches.
The backbone can get in the way, but if that is what the limiting factor is, then maybe smaller groups should be used, each working on different segments. Or, more stuff should be stored locally with some sort of smart push script.
Re:Parallel computing & computer science... (Score:3)
1.) Marketing - Computer manufacturers and systems integrators/consultancies purport to be able to solve bigger/more ambitious problems. Moreover, it makes good business sense to be able to do so. Business got a hit from the crack pipe of information and they got hooked. These problems now fall outside of the realm of national security. Furthermore, government work can be precarious for many companies, and by diversifying their wares and selling to public corporations, vendors spread the risk around.
2.) Homogeneous (relatively) dev/prod environments - Not too many people can claim knowing how to program for a Cray or Thinking Machines box, but a lot of intelligent people can move around in/administrate a UNIX environment, and some of them can code to a messaging interface. For that matter, some know tools like Ab Initio or Orchestrate and can create parallel applications very easily.
3.) Mathematics and methodology have changed - People now recognize the conceptual and practical challenges of parallel computing, and can tailor the algorithms, hardware, and OS to accommodate the challenges of that paradigm.
Seeing random poster's on Slashdot recognize that compartmentalized data and code is necessary for distributed computing to be effective is a tribute to how far this field has come. The engineering has come a long way, as has the marketing, and overall level of conciousness.
As for the new adjective, I would say that wider is o.k., but you have to recognize that a machine does not have to be uniformly wide. I think of parallel programming as a stream metaphore, with speed (CPU), width, (#of CPU's or units of work/data ways parallel) and depth (depth of queue/instructions between checkpoints). How about liters?
Re:I'm glad to see... (Score:3)
Sorry to be cynical, but. . . (Score:3)
I was surprised to read the responses and not see any discussion of these increasingly "super" computers' ability to crack strong encryption.
Believe me, I'm not an Area-51-head, but in the few short years after strong ecryption has been widely available to concerned citizens and terrorists alike there seems to be many more huge supercomputers getting built, each with a greater altruistic purpose attached. "It will allow us to test nuclear weapons without building them! It will cure cancer!"
It's wholly reasonable to assume that there are military initiatives to ensure that we can't be snuck up on by PGP-wielding bad guys. As someone not wanting to be blown up, I hope there are, anyway. It's also wholly reasonable to assume that the military couldn't amass the kind of hardware necessary to do this without lighting up some analyst's bat-computer.
But does anyone feel that the initiative could survive being entirely in the light of day? What would the /. response be to an announcement that says "Military announces super-computer initiative break strong encryption in real time, promises to leave private citizens alone!"
Of course I'm not saying that every computer faster than a Pentium 4 is part of an arms program, there are serious economic incentives to making progress in cancer treatment. I just think that we would expect to see the military arming itself with the weapons du jour, and my guess is that a few are probably sitting in plain view.
I'm glad to see... (Score:4)
That very powerfull distributed systems are starting to become more mainstream. It's about flipping time that companies made use of computing resources beyond their previously wildest dreams.
Estimates of 437 years compressed to 1 month timeframes are awesome! The next big issue will be how fast will they be churning out cures/treatments? If this helps speed this up, there will certainly be a great number of lives saved.
Hopefully though more companies will jumpt to the forefront, and try to outdo each other ( you know they will) and come up with more radical applications and solutions.
I was curious - it had been asked - what OS are these beasts running?
Parallel computing & computer science... (Score:5)
Remember the Cray Y-MP? Used to be the world's fastest computers were designed to be extremely fast CPU's, built as a sphere to shorten contact length and liquid-cooled. Parallel computing was possible then - the problem was that we couldn't break down the problems we wanted to solve into parallel events.
Today's brand of parallel supercomputers exist to solve a different kind of problem - a problem in which the "search space" can be compartmentalized and distributed- like the RSA challenge, fluid dynamics, chess, and -of course- the human genome.
The thing to remember when we read about ever-faster parallel computers is that, for all intents and purposes, when you have to solve a truly sequential problem (what the cray folks would have called a 0% vector-optimized problem) - today's supercomputers usually aren't any faster than the desktop computer you're sitting in front of. Compute the Fibbonaci sequence (without solving it for x) and race your PIII with this computer - and you might win.
Just something I wish they'd point out. We need a better adjective than faster for parallel computers. They're something else. Maybe... wider.
Suggestions?