Become a fan of Slashdot on Facebook

IBM Building 120PB Cluster Out of 200,000 Hard Disks 290

Posted by Soulskill on Friday August 26, 2011 @10:16AM from the go-big-or-go-home dept.

MrSeb writes "Smashing all known records by some margin, IBM Research Almaden, California, has developed hardware and software technologies that will allow it to strap together 200,000 hard drives to create a single storage cluster of 120 petabytes — 120 million gigabytes. The data repository, which currently has no name, is being developed for an unnamed customer, but with a capacity of 120PB, it's most likely use will be a storage device for a governmental (or Facebook) supercomputer. With IBM's GPFS (General Parallel File System), over 30,000 files can be created per second — and with massive parallelism, and no doubt thanks to the 200,000 individual drives in the array, single files can be read or written at several terabytes per second."

This discussion has been archived. No new comments can be posted.

IBM Building 120PB Cluster Out of 200,000 Hard Disks

Load All Comments

Search 290 Comments Log In/Create an Account

Comments Filter:

What's it for? (Score:2)

by yomammamia ( 1916736 ) writes:

A billionaire's porn collection?
- Re: (Score:2)
  
  by sgt scrub ( 869860 ) writes:
  
  Billionaire porn collections are stored in multiple locations. billionaire pr0n collection [google.com]
- Sounds like a data orgy.. (Score:2)
  
  by katz ( 36161 ) writes:
  
  ...for hoarding whorecookies.
- Re: (Score:2)
  
  by Scareduck ( 177470 ) writes:
  
  What's it for? No surprise, domestic spying.
  - Re:What's it for? (Score:5, Funny)
    
    by Given M. Sur ( 870067 ) writes: on Friday August 26, 2011 @10:43AM (#37218864)
    
    What's it for? No surprise, domestic spying.
    I think you mean "protecting your freedoms, fellow patriot."
    
    Parent Share
    twitter facebook
  - Re: (Score:2)
    
    by CharlyFoxtrot ( 1607527 ) writes:
    
    What's it for? No surprise, domestic spying.
    Well the butler always did it, this way they'll have proof.
  - Re: (Score:2)
    
    by Just Brew It! ( 636086 ) writes:
    
    All your bits are belong to us!
- Re: (Score:2)
  
  by Hatta ( 162192 ) writes:
  
  Why would a billionaire need porn?
  - Re: (Score:2)
    
    by Z00L00K ( 682162 ) writes:
    
    Being a billionaire would attract a lot of women regardless of how you look.
    - Re: (Score:2)
      
      by Rob Riggs ( 6418 ) writes:
      
      Being a billionaire would attract a lot of women regardless of how you look. [Emphasis mine]
      I don't think that gender matters much here.
  - - Re: (Score:2)
      
      by Hatta ( 162192 ) writes:
      
      I'm just suggesting that billionaires would have a better option than porn. Why collect porn when you could collect porn stars?
- Re:What's it for? (Score:4, Funny)
  
  by erroneus ( 253617 ) writes: on Friday August 26, 2011 @10:55AM (#37218994) Homepage
  
  Yes, he's an admitted petaphyle.
  
  Parent Share
  twitter facebook
- Re: (Score:2)
  
  by ObsessiveMathsFreak ( 773371 ) writes:
  
  China's internet surveillance records?
- Re: (Score:2, Informative)
  
  by GodfatherofSoul ( 174979 ) writes:
  
  A billionaire's porn collection is called a "harem".
- - Re: (Score:2)
    
    by yomammamia ( 1916736 ) writes:
    
    Could be a company that intends to rent out space to such agencies and for such uses or for cloud computing (amazon).
  - Re: (Score:2)
    
    by AJH16 ( 940784 ) writes:
    
    Could be a 4 and a half day buffer of raw data from the LHC (ok, unlikely). The data rate that thing generates blows my mind.
Not done yet (Score:2)

by TheVidiot ( 549995 ) writes:

Do they back up to tape or external USB drive?
- Re:Not done yet (Score:5, Funny)
  
  by S.O.B. ( 136083 ) writes: on Friday August 26, 2011 @10:28AM (#37218686)
  
  Punch cards.
  
  Parent Share
  twitter facebook
- Re: (Score:3)
  
  by Hatta ( 162192 ) writes:
  
  Imagine a Beowulf cluster of these!
I wonder.. (Score:3)

by eexaa ( 1252378 ) writes: on Friday August 26, 2011 @10:21AM (#37218586) Homepage

...about the sound and torque generated when all these disks start to spin-up.

Share
twitter facebook
- Re: (Score:3)
  
  by jhoegl ( 638955 ) writes:
  
  It may very well alter time as we know it!
  - Re: (Score:3)
    
    by rrohbeck ( 944847 ) writes:
    
    Yup. Don't mount them all in the same orientation as the Earth's axis or you can probably measure the change in the day's length.
- Re: (Score:2)
  
  by crow ( 16139 ) writes:
  
  If the torque were an issue (which it's not), you could mount the drives in alternating directions to balance them out.
  - Re: (Score:3)
    
    by eexaa ( 1252378 ) writes:
    
    My geek nature disapproves such torque-negating behavior. Instead, it totally wants to see the petabytes spin at some insane RPM, cancelling the gravity and possibly crushing some enemies.
  - Re: (Score:2)
    
    by rubycodez ( 864176 ) writes:
    
    mounting in alternating directions? I saw some twin girl porns like that.....
  - Re: (Score:2)
    
    by turbidostato ( 878842 ) writes:
    
    Alternating directions you say? How exactly do you expect that to cancel torque?
    Upside-down.
    - Re: (Score:3)
      
      by crow ( 16139 ) writes:
      
      Yes, alternating directions. That assumes the drives are mounted vertically. If they're mounted horizontally, then yes, upside-down.
      If they're using SSDs, then they need special leveling algorithms to keep the accesses spread out so that they don't get out of balance. If you access the left side of all your SSDs in the rack, the rack might fall over. :)
- imagine the brown out! (Score:2)
  
  by Lead Butthead ( 321013 ) writes:
  
  Can you just imagine the brown up when they power up the drive farm?
  In practice they would be doing sequential spin up. I do however, wonder how long that would take to sequentially spin up 200k drives.
- - Re: (Score:2)
    
    by ELCouz ( 1338259 ) writes:
    
    Obviously, they are forming a deathstar ;)
  - Re: (Score:2)
    
    by ae1294 ( 1547521 ) writes:
    
    And the heat, assuming they're using all their old Hitachi Deskstar drives.
    That sounds like a plot to a disaster movie... "Sir, the cluster won't shut down! We're looking at a full melt down!"
Finally... (Score:2)

by TheAngryArmadillo ( 158896 ) * writes:

Somewhere I can store _all_ my porn in one spot.
- Re: (Score:2)
  
  by hot soldering iron ( 800102 ) writes:
  
  I think you mean "store _all_ THE porn".
Paranoid much? (Score:2)

by skids ( 119237 ) writes:

it's most likely use will be a storage device for a governmental (or Facebook) supercomputer.
Actually, given the explosion of data storage needs in the bio-informatics area, it's most likely use would be in storing DNA sequences for research purposes.
- Re: (Score:2)
  
  by ByOhTek ( 1181381 ) writes:
  
  The human genome can effectively be stored in about 750MB (each base being only 2 bits). The largest genomes are only abut 10x that size. IIRC the FASTA files for it take only about 3GB uncompressed.
  Even with specific protein sequences, etc. I think that's a bit excessive the bio-informatics field.
  Also, I'm not sure if even the NIH could afford that kind of storage cluster.
  - Re: (Score:3, Informative)
    
    by Anonymous Coward writes:
    
    modern gernome compression techniques only store the edits needed to convert the reference genome to your genome. And the diff file is just around 24 MB per person. I am an ex-bioinformatician.
    - Re: (Score:2)
      
      by ByOhTek ( 1181381 ) writes:
      
      So am I. I was just talking about the base genome, not the diffs.
    - Re: (Score:2)
      
      by Beorytis ( 1014777 ) writes:
      
      ...the diff file is just around 24 MB per person.
      OK, so 120 petabytes will store the genomes for about 5 billion people, not accounting for the further compression that could probably happen. Maybe this is for everyone's genome.
  - Re: (Score:2)
    
    by tomknight ( 190939 ) writes:
    
    Data requirements are doubling faster than disk storage capabilities. We're needing to find ways of dealing with this, but ideally without simply asking for more money for more disks. I've just been told a new academic here will need about 200TB in a few months. I can see my (fairly small set) of Bioinformatics researchers needing a PB before the end of next year.
  - Re: (Score:3)
    
    by biodata ( 1981610 ) writes:
    
    Our modest lab turns out roughly 100GB a week of finished sequence, from a single sequencer, which is only a very small fraction of the temporary disk storage needed along the way to get to finished sequence. Genome centres with many machines will turn out an order of magnitude (or two) more, and believe me, these machines are kept busy week after week. Once we have finished sequences, the assembly process adds a multiple to this. Yes, a genome is only XMB, but when you have to effectively sequence it 40
  - Re: (Score:2)
    
    by skids ( 119237 ) writes:
    
    My understanding is that that amount of data is post-processed information, and that there are reasons not to be throwing out some of the the intermediate data (it could be re-analyzed by better algorithms in the future), but it gets thrown out anyway just because there is no space to store it.
Fill 'er up (Score:5, Funny)

by mmarlett ( 520340 ) writes: on Friday August 26, 2011 @10:22AM (#37218596)

All I know is that if you put it on my computer, I'll have it filled in two years and have no idea what's actually on it.

Share
twitter facebook
- Re: (Score:2)
  
  by rrossman2 ( 844318 ) writes:
  
  Sadly... that would apply to me as well :)
Finally! (Score:3)

by AngryDeuce ( 2205124 ) writes: on Friday August 26, 2011 @10:23AM (#37218604)

Woot! Torrent all the things!

Share
twitter facebook
Would be a good fit for CERN LHC (Score:3)

by Tynin ( 634655 ) writes: on Friday August 26, 2011 @10:30AM (#37218714)

My understanding is that the LHC generates so much data, that most of it is discarded immediately without going to disk. Seems like this would be a good solution to there data problems.

Share
twitter facebook
- Re: (Score:2)
  
  by rubycodez ( 864176 ) writes:
  
  they discard the common uninteresting decays, no point in storing it
- - Re: (Score:2)
    
    by Tynin ( 634655 ) writes:
    
    A good solution to THEIR data problems.
    Irregardless, grammer and spelin ain't no science, its a art form... and for all intensive porpoises, I lost power last night do to the slight'est bit of wind from whether system Irene (I live in south florida) from about ~6:30PM till ~3:00AM... at lease my generator worked, and FPL was on my road buy 8PM. Not sure why I came into work... I'm so tierd.
Not the government. (Score:5, Interesting)

by girlintraining ( 1395911 ) writes: on Friday August 26, 2011 @10:31AM (#37218724)

It's not the government guys, at least not the cloak and dagger kind. They're too paranoid to let you know how much data they can store. They also don't want you to know that even with all that data, they're still only able to utilize a fraction of it. People are still going through WWII wire intercepts *today*. No, the problem in the intelligence community is making the data useful and organized as efficiently as possible, not collecting it.
That leaves only one real option: Scientific research. Look at how much data the Hadron Supercollider produces in a day. ..

Share
twitter facebook
- Re: (Score:2)
  
  by Yvan256 ( 722131 ) writes:
  
  http://www.youtube.com/watch?v=-tNMGev1t9M [youtube.com]
- Re:Not the government. (Score:5, Insightful)
  
  by DrgnDancer ( 137700 ) writes: on Friday August 26, 2011 @10:50AM (#37218946) Homepage
  
  This is generally something I have a hard time convincing people of. I've worked for spooky organizations. Not at the highest levels or on the most secret projects, but in the general vicinity. The government is not monitoring you. Not because they lack the legal capability (though they do, and that is mostly, but not always, respected), but because they lack the technical ability. There are only so many analysts, only so much computer time, only so much storage. Except in cases of explicit corruption or misuse of resource, those analysts, that computer time, and that storage is not being wasted on monitoring Joe and Jane average.
  I'm not going to say that there aren't abuses by the people who have access to some of this stuff; they are human and weak like the rest of us and are often tempted to take advantage of their situation I'm sure. In general however, unless you've done something that got a warrant issued for your information, the government doesn't care. They just don't have the resources to be big brother, even if they want to be.
  
  Parent Share
  twitter facebook
  - Re: (Score:3)
    
    by AmiMoJo ( 196126 ) writes:
    
    There are only so many analysts, only so much computer time, only so much storage.
    The government has found a solution to that problem. Distribute the computing and storage requirements.
    These days if you want a license to sell alcohol in your shop you have to get agreement from the police, and they usually require you to have extensive CCTV systems covering the area outside your shop as well as inside it. They shift the burden of installing and maintaining the system to the shop owner and can access the video any time they like. If a crime is reported the shop owner gets a demand for CCTV
  - Re:big brother (Score:2)
    
    by TaoPhoenix ( 980487 ) writes:
    
    I'll give you credit for "this used to be true" back in the day when a computer was a 486 on a modem. It's absolutely not true any more.
    Govt is Big Brother, and they Like it. And they absolutely have the resources to do it.
    Why? Because all they need to do is a Red Flag system. Joe Average doesn't really produce that much data per day all by himself, and .gov isn't trying to perfectly reproduce the entire activity. They just need to know if something is getting juicy.
    "Look! Here's a 12 Gig file of Joe's acti
  - - Re: (Score:2)
      
      by DrgnDancer ( 137700 ) writes:
      
      It's not besides the point, it's the practical side of the point. This doesn't mean we should ignore questions of morality, how much power is too much, how much monitoring is appropriate, etc... It just mean that while these philosophical questions are both interesting and relevant you don't really need to worried about the practical implications day by day. Practically, the government *can't* watch you all the time, or really at all, unless you are the subject of some investigation worth those resources
      - Re: (Score:2)
        
        by m50d ( 797211 ) writes:
        
        Practically, the government *can't* watch you all the time, or really at all, unless you are the subject of some investigation worth those resources
        
        Trouble is, if the government does something I don't like, and I start taking (perfectly legal) political action against that, I become someone "worth" watching. So surveillance capability is something to worry about now; otherwise, when something directly problematic comes up, you're a dissident and it's too late.
      - Re: (Score:2)
        
        by mlts ( 1038732 ) * writes:
        
        Things can change though. For example right now, monitoring by the USG is not on my list of worries, because I'm sure i'd bore to tears any people watching.
        However, governments can change; the LEOs who are looking for felonies being committed and are abiding by their oath have a possibly of being replaced by people more interested in getting rid of any opposition.
        Take a system for figuring out if someone gets an intensive or routine search at customs. That same technology can be used to data mine social n
      - Re: (Score:2)
        
        by misexistentialist ( 1537887 ) writes:
        
        don't really need to worried about the practical implications day by day
        Normal activities like traveling or opening a bank account are quite noticeably affected by government surveillance of the financial and transport systems. Any practical limitation of government capabilities can be made up for by requisitioning private resources or by simply blocking events that are difficult to monitor from happening at all.
      - Re: (Score:2)
        
        by triffid_98 ( 899609 ) writes:
        
        Practically, the government *can't* watch you all the time, or really at all, unless you are the subject of some investigation worth those resource
        Do you own a cell phone? Your carrier knows where you are, right now and has records of where you've been this week. They know who you've talked to, and they're more than happy to share that information with 'interested parties' in the government, no warrant required.
        Given that the data sizes are small, there's no reason they can't store everyone's location/ph
    - Re: (Score:2)
      
      by afabbro ( 33948 ) writes:
      
      Entirely besides the point. They could, therefore it can be abused, therefore there must be EXTREME oversight. Period.
      Yeah, we should setup a government board or require government courts to monitor and oversee the government...er wait...
    - Re: (Score:2)
      
      by LWATCDR ( 28044 ) writes:
      
      1. Get back on the meds, really.
      2. If they did get ride of the "organs of state secrecy" how would you know? They are secret after all.
      Really if you got rid of the CIA, NSA, and NRO they would still be there. No nation can survive with out intelligence gathering organisations. So what would happen is they would be hidden and secret. Being pubic means that there is oversite.
      So really get back on the meds and the voices will stop.
I propose a name for it ... (Score:2)

by tomhudson ( 43916 ) writes:

FTFS:
The data repository, which currently has no name, is being developed for an unnamed customer,

It's the tech equivalent of Prince - it's "the data repository with no name." We can denote it with some sort of unicode glyph that slashdot will mangle.
And of course it has amazingly fast read speeds - if each drive has a 32 meg cache, that's 6.4 terabytes just for the cache.
BTW, it's for the ^@#%^&^+++NO CARRIER
Loading times (Score:2)

by ifrag ( 984323 ) writes:

Perhaps this cluster can load Deus Ex : Human Revolution levels in a reasonable amount of time!
Good job for a HS kid... (Score:2, Interesting)

by spagthorpe ( 111133 ) writes:

Run around with a shopping cart and swap out drives as they fail. Kind of like they did back in first computer days with vacuum tubes.
Constant failures? (Score:2)

by LordNimon ( 85072 ) writes:

With 200,000 hard drives, won't there always be at least one hard drive that is failing? You'll need an IT guy 24/7 swapping out the failed drives. As soon as he swaps out one drive, another one will fail. It just seems kinda ridiculous.
- Re: (Score:3)
  
  by SuperQ ( 431 ) * writes:
  
  This is what MTBF is all about. "Enterprise" drives are rated at 1.2 million hours MTBF. 1,200,200 hours / 200,000 drives = 6 hours per drive failure. Not too bad, only 4 a day.
  - Re: (Score:3)
    
    by Marc Desrochers ( 606563 ) writes:
    
    How long does it take for the cluster to rebuild after a drive fails, and does this involve downtime?
    - Re: (Score:2)
      
      by h4rr4r ( 612664 ) writes:
      
      Even ancient RAID5 implementations are not that bad. Most likely this is really some sort of RAID over RAID over RAID, or some sort of RAID like software that does similar actions. This means no downtime and most likely nearly no speed costs for a single drive.
  - - Re: (Score:2)
      
      by Rockoon ( 1252108 ) writes:
      
      It will probably take longer than 45-minutes to find, verify, and replace a drive in that vast sea of 200000 water cooled drives.
      
      Also, that 1% figure is bullshit. Expect 6% to die in the first year (nearly 3% in the first 3 months)
- Re: (Score:2)
  
  by bigredradio ( 631970 ) writes:
  
  Since they can't backup to tape, maybe they will convert their old tape library to swap out hard drives.
- Re: (Score:2)
  
  by Jeng ( 926980 ) writes:
  
  I would guess that would be the reason for the water cooling, to increase the drives reliability.
  Also from the article it sounds like they may have more than 200,000 hard drives hooked up, but only use 200,000 at a time so the computer can automatically begin recreating the dead drive as soon as it occurs.
  - Re: (Score:2)
    
    by fuzzyfuzzyfungus ( 1223518 ) writes:
    
    I'm assuming that IBM has better plumbers than I do; because "reliability" is not the first word that comes to mind when somebody suggests water-cooling 200,000 hard drives...
    - Re: (Score:2)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
- Re: (Score:2)
  
  by Manfre ( 631065 ) writes:
  
  http://blog.backblaze.com/2011/07/20/petabytes-on-a-budget-v2-0revealing-more-secrets/ [backblaze.com]
  Backblaze provides some metrics about their drive failure rates. It's surprisingly low (1-5% per year). If they had 200k drives, they would need to replace 39-192 per week. I'm sure the cluster is built with lots of redundancy that doesn't require a person to immediately replace a failed drive. They'll probably need a full time staff of at least 3 to maintain it.
Google? (Score:2)

by JustAnotherIdiot ( 1980292 ) writes:

This just kinda strikes me as who would need this. Backing up the entire internet has to take up some space.
Not so impressive as a floppy RAID (Score:2, Informative)

by erroneus ( 253617 ) writes:

If they could make a 120PB cluster using floppy disks, I would be much more entertained by this.
- Re: (Score:2)
  
  by rrossman2 ( 844318 ) writes:
  
  Man.. and make sure its the 5.25" drives that love to chatter... kind of like a commodore 64 drive loading up flight simulator ][
- Re: (Score:2)
  
  by jandrese ( 485 ) writes:
  
  Just for the heck of it, I worked out the math on this. Assuming 1.44MB 3.5" floppies, you will need 83,333,333,333 disks to store all of that data. Not even accounting for the drives, the disks alone would fill a volume of 2,240,418.91 m^3 (591,856,062 US gallons). I don't know for sure, but I suspect that number exceeds the number of floppies that have ever existed, although it is only about 12 floppies for every man, woman, and child on the Earth.
lots of aluminum (Score:2)

by buback ( 144189 ) writes:

Someone should manufacture industrial sized hard drives for this type of application. Like full height x2, so you could cram 30 platters in there.
- Re: (Score:2)
  
  by BetterSense ( 1398915 ) writes:
  
  It's not as straightforward as that, because current multi-platter hard drives have all the read heads attached to the same "tonearm" (I don't know the proper term). So even with a 4-platter drive, you can only read 1 platter at a time, I assume, unless they somehow sync the platters together. With a 30-platter drive, your throughput would be much worse than with 10 3-platter drives, because you would have 10 times the usable read-heads at any given moment.
That's only 400MB for every US American (Score:2)

by Dr. Spork ( 142693 ) writes:

If this were for an American spy agency, maybe that would be enough. But when I think about how I have ten times this much data in my Gmail, and that Gmail isn't limited to only the US, I suspect that Google has a lot more storage space than this. Of course it's probably all very decentralized.
- Re: (Score:2)
  
  by account_deleted ( 4530225 ) writes:
  
  Comment removed based on user account deletion
Re: (Score:2)

by account_deleted ( 4530225 ) writes:

Comment removed based on user account deletion
Someday, we will carry these in our pockets (Score:2)

by NicknamesAreStupid ( 1040118 ) writes:

Hard to image? Yes. But forty years ago, the largest computing center on earth had 57GB of disc storage.
Failure rate? (Score:2)

by erice ( 13380 ) writes:

We know the capacity. We know the transfer rate. But how quickly do disks need to be moved in and out of the system in order to keep it running?
200,000 is a lot of disks. I assume they are all hot swap with a great deal of redundancy because I would expect multiple drive failures every day. A raid0 with that many disks might never boot.
MTBF Question (Score:2)

by skelly33 ( 891182 ) writes:

Just curious if anyone has experience managing large, mechanical disk arrays, if you installed an array of such a size using identical hard drives and bringing everything online relatively at the same time, would there be an increased likelihood of ALL the drives dying at roughly the same time? Could failure statistics bite you with enough simultaneous failures to negate redundancy?
- Re: (Score:2)
  
  by SuperQ ( 431 ) * writes:
  
  I do manage large storage farms in the petabytes range. There is a curve to the rate at which disks die. It mostly seems kinda obvious.
  #1 - Infant mortality. I see a bunch of drives fail within the first few months of a new install.
  #2 - Increased death rate as the drives age. Usually when the drives start to reach the warranty age. This can be accelerated depending on the IO load of the system.
  There's a lot of great info out there. Here's one good whitepaper:
  http://static.googleusercontent.com/externa [googleusercontent.com]
- Re: (Score:2)
  
  by m50d ( 797211 ) writes:
  
  I can see the likes of the LHC or the AEA using something like this - they generate enough data. But if it were a "good guy" why would they keep it secret?
  - Re: (Score:2)
    
    by BitZtream ( 692029 ) writes:
    
    Because it's a target regardless of who owns it. God could own it and call it the garden of Eden and people would still blow it up
- Re:Depressing (Score:5, Insightful)
  
  by PPH ( 736903 ) writes: on Friday August 26, 2011 @10:32AM (#37218736)
  
  Facebook and presumably a spy agency?
  You're repeating yourself.
  
  Parent Share
  twitter facebook
- emo? (Score:2)
  
  by luis_a_espinal ( 1810296 ) writes:
  
  Anyone else find it depressing that the two top suspects for the use of this system are Facebook and presumably a spy agency?
  Can humanity come up with no better use for the biggest iron than a bunch of frivolous, narcissistic ad profiling and covert spying on people living in an allegedly free country?
  No wonder F@H doesn't post more progress. Our hardware is going towards people sharing their naked bong photos and government spooks cataloging your naked bong photos.
  You are trying too hard looking for something to be upset about (in a very attention-whorish manner to boot.)
  - Re: (Score:2)
    
    by luis_a_espinal ( 1810296 ) writes:
    And just to prove my point.
    Can humanity come up with no better use for the biggest iron than a bunch of frivolous, narcissistic ad profiling and covert spying on people living in an allegedly free country?
    Yes. It ain't that hard to come to that answer, you know? The slashdot's story half-seriously hints at either a government agency (NSA) or somebody like Facebook. And obviously in Emo fashion, you took it as an statement about humanity. It's more a statement about you.
    I find these type of opinions rather simplistic as other opportunities in large-scale application engineering abound:
    
    Data collection and simulation done by the DoE or DoT (not necessarily just a DoE-related agenc
    - Re: (Score:2)
      
      by GooberToo ( 74388 ) writes:
      
      Don't forget the IRS.
- Re: (Score:2)
  
  by Abstrackt ( 609015 ) writes:
  
  I'd go with the flux capacitor personally. Then you can go back in time, invest in IBM, Microsoft, Google, and Apple when shares are still cheap and buy the 120PB cluster. Assuming you drive a DeLorean anyway.
- Re: (Score:2)
  
  by owlstead ( 636356 ) writes:
  
  What do you mean: can big disk arrays be build so that replacements can be automated? Of course they can be build, it would not even be that hard. Well, as long as you don't put drive/server production and delivery of the components or auto assembly in the automated system. I could not find one on google, I guess on such a large drive array, you can afford a human to replace some disks now and then. Humans are more flexible and more prone to see other problems occuring as well.
- Re: (Score:2)
  
  by rubycodez ( 864176 ) writes:
  
  even in "small" disk arrays the replacements are automated with hot spares. of course you periodically replenish the hot spare pool, but one doesn't need to go running every time a disk fails
- Re: (Score:2)
  
  by fuzzyfuzzyfungus ( 1223518 ) writes:
  
  Pulling and replacing drives from hot-swap slots in a drive shelf would only be a slight change from the long-available robotic tape silo systems; but I've never heard of a situation where rigging such a thing up made economic sense...
  
  Your hot-spares provide immediate 'replacement', which allows you to make physical replacement less time-critical just by adding more drives to the system, and most big-huge-storage systems have front mounted indicator lights for drive health.
  
  Having a human on duty who g
  - Re: (Score:2)
    
    by mlts ( 1038732 ) * writes:
    
    I looked into making a hard drive silo as a business. Even dropped the business proposal by some vendors. You would put bare SATA or SAS drives in a load port and they would be dropped into place in groups for reading/writing. Critical data would have four HDDs writing at a time (three way mirror, plus one HDD that would go offsite.) Non critical would get 5-8 HDDs writing in a RAID 6 configuration. It would have been nice to have because disks can be erased faster than tapes for security (just do an A
    - Re: (Score:2)
      
      by fuzzyfuzzyfungus ( 1223518 ) writes:
      
      Out of curiosity: I've seen a number of systems that use screwless drive rails that take advantage of the fact that the 3 screw holes on each side, and 4 on the bottom(for 3.5", 2.5" has something slightly different; but also fairly standard) by having pegs, just slightly smaller than the screws would be, that slide in to the holes. As long as modest pressure is applied to keep them in the holes(they aren't threaded or tight enough to be friction fit), the setup is pretty solid. Could a gripper mechanism em
- - - Re: (Score:2)
      
      by rubycodez ( 864176 ) writes:
      
      the government is too busy with its War on Terrabytes to worry about the petafiles
- Re: (Score:2)
  
  by tenco ( 773732 ) writes:
  
  3x 1 GW PSUs?
- 600GB drives? (Score:2)
  
  by Lead Butthead ( 321013 ) writes:
  
  based on 120 million GB and 200k drives, the per-drive capacity works out to 600GB a piece. Sounded like they're stringing together a bunch of WD Velociraptors.
- - Re: (Score:2)
    
    by rrossman2 ( 844318 ) writes:
    
    So how long would a consumer grade "raid chipset" take to rebuild that raid if it was a raid 5 setup (with the drives split into 3 different raid 0 setups)?

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

What's it for? (Score:2)

Re: (Score:2)

Sounds like a data orgy.. (Score:2)

Re: (Score:2)

Re:What's it for? (Score:5, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re:What's it for? (Score:4, Funny)

Re: (Score:2)

Re: (Score:2, Informative)

Re: (Score:2)

Re: (Score:2)

Not done yet (Score:2)

Re:Not done yet (Score:5, Funny)

Re: (Score:3)

I wonder.. (Score:3)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

imagine the brown out! (Score:2)

Re: (Score:2)

Re: (Score:2)

Finally... (Score:2)

Re: (Score:2)

Paranoid much? (Score:2)

Re: (Score:2)

Re: (Score:3, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Fill 'er up (Score:5, Funny)

Re: (Score:2)

Finally! (Score:3)

Would be a good fit for CERN LHC (Score:3)

Re: (Score:2)

Re: (Score:2)

Not the government. (Score:5, Interesting)

Re: (Score:2)

Re:Not the government. (Score:5, Insightful)

Re: (Score:3)

Re:big brother (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

I propose a name for it ... (Score:2)

Loading times (Score:2)

Good job for a HS kid... (Score:2, Interesting)

Constant failures? (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Google? (Score:2)

Not so impressive as a floppy RAID (Score:2, Informative)

Re: (Score:2)

Re: (Score:2)

lots of aluminum (Score:2)

Re: (Score:2)

That's only 400MB for every US American (Score:2)

Re: (Score:2)

Re: (Score:2)