

The Ultimate All-In-One Storage Solution 387
karnifex writes "Filled up your LaCie Bigger Disk already, and looking for a little more storage space? Good news! The Petabox is ready! 'The petabox by the Internet Archive is a machine designed to safely store and process one petabyte of information (a petabyte is a million gigabytes).' And luckily, as the Internet Archive notes, it's shipping-container friendly (20' x 8' x 8'). So save on delivery costs and order two!"
Finally (Score:5, Funny)
But the question is, do my monkeys use VI or Emacs? That shall remain a mystery.
Re:Finally (Score:5, Funny)
vi would seem like a perfect fit, yes?
Re:Finally (Score:5, Funny)
Re:Finally (Score:4, Funny)
Re:Finally (Score:5, Interesting)
Monkey Shakespeare Simulator [tninet.se]
Maybe not as much fun, but without the faeces
I've noticed that Mozilla Firefox seems to give better results than IE
actually... (Score:5, Funny)
I have been using emacs for nearly 10 years now and I swear sometimes I have been seriously considering adding a foot pedal or 2 to my setup (besides control, shift and meta I also routinely use Super and Hyper, xmodmap is great!)
colossal... (Score:2, Funny)
Re:colossal... (Score:5, Funny)
Re:colossal... (Score:2, Interesting)
Re:colossal... (Score:4, Funny)
Re:colossal... (Score:5, Informative)
The current genome [nih.gov] build has a size of 3,020,300,000 bp, at 2 bits per bp and 5(?) spice girls, that's about 3.5 GB (uncompressed).
Of course with a mostly static database like that you only want to store the diffs, not the whole thing. The bulk of the diff would be SNPs, roughly 1 per 1000 bp: 3,020,300,000 / 1000 / 4 / 1048576 that's about 0.72MB per spice girl. An if you only store the ones actually different from wildtype you probably don't need more than 20% of that.
You can fit a Spice Girl on a floppy.
Re:colossal... (Score:4, Funny)
Or, you can fit two floppies on a Spice Girl
"Sock it to me."
I'd buy one.... (Score:5, Funny)
...drumroll
Peta-files
Re:I'd buy one.... (Score:5, Funny)
Exa-Files.
=simdge=
Business idea (Score:2, Interesting)
Re:Business idea (Score:3, Insightful)
Price? (Score:2)
Re:Price? (Score:3, Insightful)
Re:Price? (Score:3, Informative)
From the forum [archive.org]:
Rack materials cost is currently estimated to be $121K for 96TB. Node materials are a just under $1450. This price does not include markup, assembly or burn-in from the system integrator and thus will increase by another 5-7% to approximately $130K/rack.
So, about $1.3M (10 racks)
cLive ;-)
Re:Price? (Score:3, Insightful)
What would be interesting is to know the estimated maintenance costs as well. With than many drives, I imagine you'd be changing them like light bulbs, especially as time passes and the probability of each drive failing get's higher and higher.
If one was really clever, they could use the failure rate of a typical hard disk and Moore's Law to estimate monthly replacement costs for the next 100 years or so. I would expect them to rise in the short term as the drives age, but
Re:Price? (Score:5, Informative)
Rack materials cost is currently estimated to be $121K for 96TB. Node materials are a just under $1450. This price does not include markup, assembly or burn-in from the system integrator and thus will increase by another 5-7% to approximately $130K/rack.
The weight of a fully-loaded rack is estimated to be 1500 lbs. That figure may rise depending on what hardware is required for rack cooling.
Power is estimated to be 5500 watts. This too will depend on rack level cooling equipment.
These figures assume no external 1G Ethernet NICs.
For a breakdown of all the above, see the attached spreadsheet.
One question... (Score:3, Funny)
Petabox is ready! (Score:5, Informative)
From the article:
PILOT STATUS 5/2004
* The first 100TB Rack is up and running!
* The second 100TB Rack will be up by the end of May
Apparently this is some new use of the word "ready" with which I am not familiar. Neat technology, no doubt, but it doesn't really look like it's ready for prime time just yet.
Re:Petabox is ready! (Score:2)
In other news... (Score:3, Funny)
In 10 years ... (Score:5, Insightful)
Comment removed (Score:5, Insightful)
Re:In 10 years ... (Score:3, Insightful)
The storage problems I have these days are almost entirely organisational.
Don't get too excited (Score:4, Informative)
Re:Don't get too excited (Score:4, Funny)
Re:Don't get too excited (Score:3, Funny)
To give you an idea of how much that truly is: (Score:5, Interesting)
Re:To give you an idea of how much that truly is: (Score:5, Informative)
WTF? (Score:5, Funny)
yeah, but if you looked closer, it's the same 6 gigs over and over again.
Potential customers (Score:5, Funny)
Can we say, Goooooooooooooooogle?
themselves, not google... (Score:4, Funny)
I'm guessing they were referring to themselves, not google.
Re:Potential customers (Score:5, Interesting)
So don't laugh!
(I'm sure there are PLENTY of organizations which could use this type of storage. The IRS and NASA being among them)
Re:Potential customers (Score:5, Funny)
Re: (Score:2)
I am NEVER going to get caught up... (Score:2)
Ah well, all I'd do is fill it up with Simpsons episodes....
Ummmm (Score:2, Funny)
Re:Ummmm (Score:3, Funny)
In 2010 it better be 1 by 4 by 9.
Sooooo.... (Score:2, Interesting)
Ahhh... (Score:2, Funny)
Ohh Good, now I can buy Longhorn (Score:5, Funny)
Uber-huge! (Score:2)
Jesus, that's more volume than the average room in a house! What year is it, 1984?
Re:Uber-huge! (Score:2)
Re:Uber-huge! (Score:2)
Re:Uber-huge! (Score:2)
two words (Score:4, Interesting)
or alternatively
What for?
At least as far as the next year or two is concerned. RIAA has all but outlawed music on the computer and even so, a petabyte of $1.25 songs would cost you more than bill gates makes in a year. If you have a petabyte of home movies, you must be making porno films.. If you have a petabyte of DVD's ripped, you have several life sentences coming, even if you own all the dvd's somehow (more bill gates salary multiples). And if you have text files, then holy grapes batman, youll never read all that in 10 lifetimes.
I can see uses in the comercial realm, buying multiple units in order to backup. But if this is in anyway marketed toward the consumer, only the biggest 'mine has to be bigger than yours' geek would buy something like that right now. I'll probably have one of those on my desk/floor about 5 to 7 years from now when its affordable/realisitic for me.
Scientific Data (Score:5, Interesting)
Over the couse of the survey we expect to take about 1 PB of data. We're still trying to figure out exactly how we will process and store it all.
For more info, you can poke around here [naic.edu].
Re:two words (Score:5, Insightful)
100 disk -> 1 TB
15000 disks -> 150 TB.
Netflix has a "mere" collection of 15000 disks. Your patebyte disk is only 1/6th full.
You upload all music CDs: 1 GB per disk (feeling generous).
How many CDs can be in print? Maybe a 500,000?
That is only 500 TB. Now your disk is 2/3rd full.
Lets upload all printed material. May or may not fit in the rest.
Then again, if you want to archive the internet: ~6G pages. 10kB each. 60 TB. each run. Store the last 16 versions -> 1TB.
Not really a Petabyte...yet (Score:4, Insightful)
PILOT STATUS 5/2004
* The first 100TB Rack is up and running!
* The second 100TB Rack will be up by the end of May
* Thermal Targets have been met
* Systems Booted from USB Dongle
* Reiser FS running
* PC-based Router running
Maybe I'm missing something but this looks to me like they don't really have a Petabyte of storage working but plans to incorporate a Petabyte of storage with only 100 TB up and running now. Not that 100 TB is anything to brush off.
one petabyte? (Score:5, Funny)
To bad it won't last... (Score:5, Insightful)
Hard drive lifespans (Score:3, Interesting)
If you know of such a site, tell me.
Re:Hard drive lifespans (Score:2)
Re:To bad it won't last... (Score:5, Funny)
What tiny fraction of our history is actually preserved in a useful manner will be misinterpreted and spun in ways you can't possibly imagine. 400 years from now you will be known as an ignorant fool guilty of untold crimes against lord knows whom. This will be true regardless of the quality of the archive used to figure you out.
So don't worry about it.
Ozymandias (Score:4, Insightful)
You're complaining that these hard drives won't run forever and you're right. Neither will CD's. However, I would also like to point out that the vast majority of ancient egyptian papyrus isn't around today. Also, don't start goign off on using clay or stone tablets, because they break (even the Rosetta stone is broken).
Honestly, computers are still far superior to what we were using before. It's not like we've got Homer's original version of the Illiad sitting in a museum somewhere; we just have many duplicated copies that have been reproduced over the years. You're right that hard drives fail and CDs break, but we can keep updating onto new media. Besides, when a monk drops an iota when transcribing the Bible, Jesus goes from being God to godlike. When a computer adds an iota, the checkbit fails and the data is resent.
Somebody is also going to point out that, as systems change, data can become unreadable. Heck, I had a professor who couldn't update his lab instructions because the software that read the lab printouts wouldn't run on new machines and the fileformat wasn't understood by any other software. So, want to stop our data from becoming unreadable? Well, let's just do what the Etruscans did! Of course, we don't have a clue what they did because nobody can read Etruscan. For a more familiar example, think of heiroglyphics before the Rosetta stone. It's pretty common for data to become lost and unreadable. Also, this bring us back to the solution. Along with the data, include the source code for the software that can read it. If you really want to be anal, you could even include the source to an emulator for the machien it was designed to run on.
Still, you might point out, 400 years from now, we'll still lose 99% of that do to failures of whatever nature. Once again, you would be be right. However, do you honestly believe that we have 1% of all the data that was collected in 1604? Hell, most of the people couldn't even right, so we don't know ANYTHING about their lives. I'm sorry that we can't digitally preserve our wonderous society for all of eternity, but it's completely blind to believe that this makes us in ANY way different to any other culture. Read Percy Shelley's Ozymandias before complaining about how people in the future won't know what our lives were like.
Yeah but... (Score:2)
Backups: That's a big stack of DVD-R disks (Score:2)
I don't even want to think about backing this up on a million some-odd CD-Rs. I suspect that the first CD-R would have rotted long before the last CD-R was written.
Re: (Score:2, Interesting)
Aha! (Score:3, Funny)
See? Those Longhorn specs are quite easy to achieve... Now let's sit back and wait for Intel or AMD to come up with a 1x1x1m slab of silicon that can melt graphite and run Longhorn at the same time!
Useless Statistics! (Score:5, Funny)
Re:Useless Statistics! (Score:5, Informative)
50
How many 128kbps MP3s can you store on it.
250-300 million depending on song length
And most importantly, how many floppy disks is this equivalent too?!
700 Million - nearly 40,000 miles when laid end on end, or about 1500 miles when stacked on top of each other.
Re:Useless Statistics! (Score:4, Informative)
> 50
According to this article, a Library of Congress is approximately 10 TB (who knew--this obtuse metric actually has a measurement!!!)
http://articles.findarticles.com/p/articles/mi_
So the device actually can contain 100 Libraries of Congress.
Re:Useless Statistics! (Score:4, Interesting)
50
Oh, it isn't, either. Will you people knock it off already with the Library of Congress == 20TB comparison? It's some sort of inane computation made as if the collection were only books, and all the books were represented as ASCII text only. Well, guess what? It's not, and they're not.
American Memory [loc.gov] alone is a good bunch of terabytes, and that's just a wee digitized slice, just several million objects, of all the stuff in the Library. There's a lot. Of Stuff. A lot a lot a lot. Pictures. Maps. Movies. Big ol' stuff.
Well, I feel better. Thanks!
Re:Useless Statistics! (Score:3, Funny)
What the hell does that mean? How many times around the earth is that? How many times to the moon and back?
START MAKING SENSE!
Re:Useless Statistics! (Score:5, Informative)
Glad you asked. Assuming that we have a 10^15 byte disk (which is how those decimal-loving hard disk manufacturers would define it), and your MP3s are encoded at 128kbps (where 1 kb = 1024 b = 128 B, as the binary folks would have it), then you could listen to MP3s nonstop for:
10^15B/(128kb/s * 128B/kb) = 61035156250 seconds
Cheers,
IT
I can see the office manager now... (Score:5, Funny)
I surreptitiously conceal the firewire cable going out the side door.
Adam: "No, John, I haven't the foggiest."
OM: "Ok, well I'll ask Kim when I talk to her about the strange shipping container outside. Thanks."
-Adam
A petabyte here, a petabyte there... (Score:2, Funny)
Immortality? (Score:5, Funny)
In his novel 3001 Arthur C. Clarke asserted/speculated that one petabyte would be sufficient space to store a lifetime's memories. (He didn't say if this was compressed.)
So, assuming you can handle the trivial exercise of transferring your memories (the implementation of which is left as an exercise for the reader), immortality is yours for the buying!
Re:Immortality? (Score:5, Funny)
Don't encrypt! Ye gods man!
I don't know about you, but I have a few things I wouldn't like people to know about, even long after I'm dead.
I mean, let's face it, not all our memories are that flattering.
And anyways, I'm pretty sure some of the memories from my college years are already "encrypted".
The cost... (Score:2, Informative)
http://www.archive.org/iathreads/post-view.php?id
read the docs (Score:2, Interesting)
If you look down in the message list, you see a reference to pdf + ppt docs. Here's another related project Planet Ten Modular Data Centers [planetten.biz].
Yes, it's a petabyte once you fill the shipping container. Honestly, I thought of this idea last year (using stock shipping containers), and now I'm fascinated that they've made it happen.
My only suggestion is that this is prototype: the eventual production systems (say, a couple of years time) should have custom shipping containers for:
* any of the side panels
Microsoft (Score:2)
Interesting link... (Score:5, Interesting)
http://capricorn-tech.com/ [capricorn-tech.com]
The site is rather empty right now, but it seems this is the company that will market this petabyte machine... er... box... er... whatever the name is.
Nice design (Score:2)
It appears that the nodes are half sized, allowing for 40 systems per side, or 80 systems total. The null modem console cables connecting node pairs together is clever... if any one machine fails, you can restart it as long as its neighbor is still alive.
Mark
There's an easier way to do this... (Score:3, Insightful)
Petabox? BAH - GMAIL! (Score:5, Funny)
If you're a big fan of this much storage? (Score:5, Funny)
[massive karma burn detected]
Replacing HDs could be a pain (Score:4, Insightful)
Re:Replacing HDs could be a pain (Score:3, Insightful)
So assuming 3 failures a day, at most 3 RAID's would be running slower a day. Assuming 4 disks per RAID that's at most 12 disks at reduced performance, or 0.3% of the total data set that isn't available at full speed. If that is an issue, you duplicate any data that MUST be available on multiple nodes.
That's Nice But... (Score:4, Funny)
Is there enough disk space left to do anything useful after installing WinXP on it?
Can you get one... (Score:2, Funny)
Re:wrong (Score:2)
Re:wrong (Score:3, Funny)
Wrong
One PetaByte is 1,000 TeraBytes which are 1,000,000 GigaBytes wich are 1,000,000,000,000,000 Bytes wich are 3.35 LotsOfPr0n.
Re:wrong (Score:4, Funny)
In case you were wondering.
It's my favorite unit of measure.
Coincidentally, I have a buttload of porn.
Re:wrong (Score:5, Interesting)
Re:wrong (Score:2)
Specifically, 1,048,576 GB
=Smidge=
Re:wrong (Score:2, Informative)
Re:wrong (Score:2)
No, Specifically, 1,000,000 GB. GB == Gigabyte. Giga == 10^9, always. No "in computing it doesn't" crap. Giga is an SI prefix.
Re:wrong (Score:2)
a byte is from the computing world. it's a computing term. it was invented for computers. before people were sending bits over networks and before hard-drives existed, a kilobyte had been defined as 1024 bytes. that's how it works in the computer world.
Re:wrong (Score:3, Informative)
Actually the SI defines the prefixes irrelevant of units used. Think of the mil ('milli-inch'); how many do you think there are in the inch? If I had a thousand cats I could refer to the set as one kilocat, and hence if I had 1024 cats I could refer to it as a kibicat, Tweety-pie style; note that a cat is not an SI metrological term. Try playing around with the units(1) command sometime; to get a feel for these SI prefixes.
Re:Dupe (Score:2)
Re:"a million gigabytes"... (Score:3, Informative)
As always, wikipedia [wikipedia.org] has the answer(s):
Damn! Ambiguity!
Re:"a million gigabytes"... (Score:2)
Re:LaCie disk (Score:2)
Re:Degaussing Cannon Ready (Score:2)
Re:You think they could spare a boot disk. (Score:3, Insightful)
The last thing you want with a setup like this is having to haul hardware around or disconnect stuff if you for any reason can't boot of the disks anymore. And you certainly don't want to reduce density by wasting spa