The Sum Total of the World's Knowledge: 250 Exabytes 168
arkenian writes "The BBC reports on an article in Science about scientists who calculate that the sum of all the world's stored data is 250 exabytes. Perhaps more interestingly, the total amount of data broadcast is 2 zettabytes (1000 exabytes) annually. In theory this means that the sum of the world's knowledge is broadcast 8 times a year, but I bet mostly that's just a lot of American Idol reruns."
And a lot of it is free (Score:4, Informative)
http://en.wikipedia.org/wiki/Free-to-air [wikipedia.org] - "Free-to-air (FTA) describes television (TV) and radio services broadcast in clear (unencrypted) form, allowing any person with the appropriate receiving equipment to receive the signal and view or listen to the content without requiring a subscription (or other ongoing cost)"
http://www.hulu.com/ [hulu.com] (free tv)
http://www.youtube.com/ [youtube.com] (free music vids and tv)
http://www.piratebay.org/ [piratebay.org]
Re:And a lot of it is free (Score:4, Insightful)
Re:And a lot of it is free (Score:4, Funny)
How many Library of Congresses is that? I just have no perspective without it being expressed in LOC units.
Re:And a lot of it is free (Score:4, Informative)
Well, according to the Library of Congress' website [loc.gov], they have collected "over 200 terabytes of data". But since they don't specify an exact number, let's call it at 200 TB.
295 exabytes / 200 terabytes = 1,546,649.6 Libraries of Congress [google.com].
Just so long as they don't... (Score:2)
Just so long as they don't keep it all in one place.
------
MS Fnd in a Lbry
HAL DRAPER
From: Report of the Commander, Seventh Expeditionary Force,
Andromedan Paleoanthropological Mission
What puzzled our research teams was the suddenness of collapse
and the speed of reversion to barbarism, in this multigalactic
civilization of the biped race. Obvious causes like war, destruction,
plague, or invasion were speedily eliminated. Now the outlines of the
picture emerge, and the answer makes me apprehensive.
Part of the sto
Re:And a lot of it is free (Score:5, Interesting)
Re: (Score:2)
Never say never... because that's what they said when they used two char for the year field in old banking databases and programs. And so many database that overflowed because they got to the point that should never be encountered.
Re: (Score:2)
Assuming one bit per atom, and assuming carbon atoms being used for storage, a memory of 2^128 bytes (assuming 8 bit/byte) would have about the same mass as the world oil production of 14 years (using the data for 2001 from Wikipedia [wikipedia.org] as estimate of the average yearly oil production). A silicon-based storage (again, assuming one atom per bit) of that size would have more than twice that mass.
Something I'd like to know is... (Score:4, Funny)
How much of that is pornographic "knowledge"?
Re:Something I'd like to know is... (Score:5, Funny)
Re: (Score:1)
I remember when porn was hard to get. I'd download 4000-color nudie pics or SI Swimsuit scans to my 1985 Amiga, and treasure them like rare gold. (The floppies were hidden with creative names like "Image XXX part 1".)
But now twenty-five years later, there's so much porn I couldn't keep-up even with Viagra.
Re: (Score:2)
Random thinking out loud : : 4000-color Amiga photos were 704x240x5bits per pixel == 845 kilobits. My ZMODEM protocol transferred 2 kbit/s or 7 minutes just to view one photo! I'd forgotten. No wonder I used to leave the computer downloading by itself.
Of course back then you could only fit 8 photos per floppy, so you had to pause the download every hour, change floppies, and then resume.
Good thing the Amiga multitasked (so you could view photos and download at the same time). All. Good times.
Re: (Score:2)
Of course back then you could only fit 8 photos per floppy, so you had to pause the download every hour, change floppies, and then resume.
I suspect there were other reasons you needed to occasionally change floppies.
Re: (Score:2)
I didn't know that that floppy object was replaceable.
Re: (Score:2)
You didn't find the right FTP site back then, but it was like Fight Club - Nobody talks about it.
Re: (Score:2)
well needless to say, that doesn't include porn. my collection alone is 500 esabytes
Re: (Score:2)
How much of that is pornographic "knowledge"?
approximately 250 sexabytes.
Re: (Score:3)
You mean carnal knowledge?
absolute value? (Score:4, Insightful)
Re:absolute value? (Score:4, Interesting)
Perhaps some of the knowledge broadcast has a negative value, so the absolute value of the knowledge broadcast is high, but the net information distributed is much smaller?
Carl Sagan addressed this in Cosmos. He said there was more data broadcast in TV programs every day than the combined written works of all of history.
But, as he said, "not all bits have equal value."
A quote I had laser engraved on the back of my Nexus One. :)
-Taylor
Re: (Score:2)
Information theory is interesting stuff. I think that the information content can in some way be measured by what the size of the maximum compressed version of the object is. Things get tricky though when you realize that you could compress a tv signal by transmitting just the script and some instructions on how to re-film it. Worse still the average news broadcast repeats the same sentence at least 10 times, so the text ends up 1/10 the size by trivial compression. So we end up with the unfortunate discove
Re: (Score:2)
You just reinvented Kolmogorov complexity.
Re: (Score:2)
It's a good feeling to learn that something I figured out on my own was already invented by someone else and is famous. That's vindication of my thought processes.
I did that with the automatic transmission (I was like 10), the toroidal supercomputer layout used by the early Crays, and variable-bit-rate encoding.
Inventing something already well-known is not a bad thing. It's a very good thing.
"Stored Data" does not equal "Knowledge" (Score:5, Insightful)
Nice way to conflate terms for a sensational headline. What a bogus metric. A good chunk of that "stored data" is junk. Probably most of it. Not to mention duplication. (Duplication? I told you not to mention duplication :-)
Re:"Stored Data" does not equal "Knowledge" (Score:4, Funny)
Re: (Score:2)
Wanted to mod that '+1 Knowledge' but then I realized '++Knowledge' might be more accurate. Slashdot provides neither as an option. :(
Re: (Score:3)
Re: (Score:2)
Doesn't stop folks from trying...
Re:"Stored Data" does not equal "Knowledge" (Score:5, Funny)
Nice way to conflate terms for a sensational headline. What a bogus metric. A good chunk of that "stored data" is junk. Probably most of it. Not to mention duplication. (Duplication? I told you not to mention duplication :-)
Sorry, i'm just increasing world's knowledge database at the moment.
Re: (Score:2)
Two comments for this one...
1) Heh heh... I'm not X, I'm increasing the world knowledge database. Where X = whatever annoying internet trope is being used against us at the moment.
2) And you thought there was no useful purpose for rickrolling.
Re: (Score:2)
Re: (Score:2)
The world's knowledge is 250 exabytes.
(Checks my own comment)
Wait, now it'll be 250 exabytes + 61 bytes.
(checks again)
Wait, now it'll be 250 exabytes + 61 bytes + 48 bytes.
(checks again)
Wait...
Re: (Score:2)
At this point I'm convinced /. does it on purpose, whether for more hits, or more comments - there is no other reasonable explanation.
From a WoW comic:
[chat] Noob : Hey, how do I get to the blacksmith?
*crickets*
To assist this noob simply give the wrong directions.
[chat] Player A : Take a left by the boat house.
[chat] Players B, C, D : No it's not you idiot, you take a right by the mailbox. What a noob.
Conclusion:
You now have 4 active participants instead of just 1.
a little more filtering needed (Score:2)
Re: (Score:2)
It's all knowledge, and virtually all of it is worthwhile to someone. The subjective value of any piece is just that, subjective. Calling it junk just reveals a bias.
Re: (Score:2)
I ran a little freeware product called "double killer on me Windows partition, there were thousands of individual "dupes" about 3Gigs total.
Re: (Score:2)
Re: (Score:2)
Well, if we're not deduplicating, I'm sure a billion teaspoons/day of baby batter creams the blogobytes.
Re: (Score:2)
It gets even better.
There is a lot of knowledge that is not stored on any physical media (besides our brains). For instance, I "know" that I went to the grocery store yesterday and spend three minutes looking at candy without buying any. This is something that very few people have knowledge of and I guarantee it was not stored anywhere (until now). However, it does remain knowledge.
There is also a lot of unrecorded meta data associated with stored data. Consider this post. It records the words I type,
So, then, get the backlog done. (Score:5, Interesting)
So, then, get the backlog done.
It is about time we have high definition copies of all old texts, like the all hieroglyphs ever documented, all Babylonian texts, all Sanskrit texts, the Dead Sea scrolls, all Medieval hand writings, etc.
I guess all these together could not muster 1% of all the crap that is out there today. I wouldn't be surprised if all the foolish blabber-blobber-blubber on Facebook a single day outcompete all pre-1700 texts combined.
So, back to work. Get the backlog done.
Re: (Score:3)
Much of what you ask for is already on line in one form or another. Often its in the form of on-line books, either from Google or other Libraries.
See this example for Hieroglyphs [archive.org].
The rest is there if you google hard enough, some times in image form, some times translated.
However, TFA is about All the data we have stored, not All the data we have.
The huge amount of bitching that flared up when Google wanted to scan all old books and make them available on line shows that there are deeply entrenched, and lar
Re: (Score:2)
Climb down before you hurt yourself.
The parent was asking for the moon.
The parent wasn't offering to help, or asking where s/he could volunteer time.
The parent was making a petulant demand that other people do the grunt work, apparently for free, and then you come storming in in support.
How much of your "significant amount of archival research" have you made available on line?
What exactly counts as "knowledge"? (Score:2, Insightful)
E=mc^2 represents a lot more knowledge to me than the entire 3,000 episode run of "The View" or similiar programs -- even though it's a lot more concise.
I could take a yottapixel photo of dirt and it sure won't tell me a lot.
Re: (Score:1)
e=mc^2 tells me nothing, its a concept, but it means nothing w/o understanding how many people died from a few pounds of nuclear mineral
Re: (Score:2)
e=mc^2 tells me nothing, its a concept, but it means nothing w/o understanding how many people died from a few pounds of nuclear mineral
A little radiation never hurt anybody.
Re: (Score:2)
A little radiation never hurt anybody.
True, but a lot of it will burn you to a crisp!
Re: (Score:2)
A little radiation never hurt anybody.
True, but a lot of it will burn you to a crisp!
Moderation is the key to all fun, god and clean or bad and nasty.
Re: (Score:2)
Nothing?
"Holy shit, mass and energy are equivalent" doesn't tell you anything?
It should tell you:
Whenever I compress a spring that spring must increase in mass.
A spinning top has more mass than a non-spinning top.
And numerous other amazing implications.
Re: (Score:2)
Re: (Score:2)
Damn, I never thought humans could get pregnant from dolph....
Re: (Score:3)
You're revealing a pretty heavy bias there. I'd guess a geologist would find the dirt photo much more valuable than either the view or the mc^2, and a bored housewife whose life has been closed down to the point where her only social outlet is tv would find the view more valuable than the other two.
Re: (Score:2)
A yottapixel picture of dirt would tell you a lot. A _lot_.
Editors, please edit (Score:5, Informative)
Re: (Score:1)
Maybe the submitter thought it would be helpful to convert the figure to exibytes and call it exabytes. 295 / 1.024^6 = 255.87... ~= 250.
Re: (Score:2)
What I found much more interesting in the article is that in 2002 we had for the first time more information stored digitally than in other formats, and in 2007, 94% of all information in the world was stored digitally.
Re: (Score:2)
The submitter messed up two of the basic details of this story - the number is actually 295, not 250, and this value is as of 2007, rather than the implied present day.
(I know, I must be new here.)
Maybe it was 250 and after all of the meaningless comments on slashdot about it, it actually increased to 295?
You have my permission to count this as one of the meaningless comments.
Re: (Score:2)
You think either of those numbers is close to the actual value? It's just someone's crude estimate, one could say 400 or 310 or 240 EB and be just as correct.
Re: (Score:2)
Well three actually: Data != Knowledge
Data processed may turn into information.
Information when consumed by an individual may turn into knowledge.
The sum of the world's knowledge is therefore not measurable since it resides in the minds of individuals, not in books or other recorded material.
295 exabytes (Score:3, Interesting)
Re: (Score:2)
Re: (Score:2)
Well, its certainly a number. (Score:4, Insightful)
...not meaningful in terms of the headline. The number is just addressing storage capacity potential available, not as unique meaningful data. All its saying is that the average person has access to x terrabyes of digital storage. That number is just taking manufacturing numbers for electronic hardware, and dividing by number of people.
It's not addressing the actual complexity generated or used by people. It's not actually addressing any actual people or what they do.
There is, however an interesting deeper meaning behind a number like this - the more this number multiplies, the harder it is going to be to control information, as people have more and more diverse options for storing and transferring data.
This means that even as processing power multiplies - it becomes even more impossible to police all the data of the world for improper uses.
That's the more interesting aspect of this number.
Ryan Fenton
Zero-sum game (Score:3, Funny)
Re: (Score:3)
This PHD Comics issue is particularly appropriate here:
http://www.phdcomics.com/comics.php?f=878 [phdcomics.com]
Wrong! Its infinite! (Score:3)
Let me increase that (Score:1)
1 zetabyte = 1024 exabytes (Score:1)
not 1000 exabytes
Re:1 zetabyte = 1024 exabytes (Score:4, Informative)
Wouldn't that be 1 zebabyte=1024 exbabytes?
*ducks*
Re: (Score:2)
Re: (Score:2)
Only to people who don't understand SI units or the meaning of the word "approximation."
No, to anyone other than SI fanatics. Metric megabytes are hopelessly painful in the IT world where everything is measured in powers of two: saying my laptop has 6 binary gigabytes of RAM is far more useful than saying it has 6.442450944 metric gigabytes.
Of course (Score:2)
What about brains? (Score:4, Interesting)
It's my understanding that each human brain can store roughly 4-5 PentaBytes (entheogen.com [entheogen.com]). So if the human population* is about 6,775,235,741 (Google Public Data [google.com]) then I think this would blow the 250 exabytes estimate out of the water.
*Excluding Gwyneth Paltrow
Re: (Score:2)
'It's my understanding that each human brain can store roughly 4-5 PentaBytes (entheogen.com).'
So, that's like five bytes per brain? Or does this have something to do with diesel engines?
Re: (Score:2)
Re: (Score:2)
You're a slashdot reader.
And you start talking about.. what? "penta"bytes?
It's called _PETA_bytes, dumbass. Go see the fucking SI-prefixes. Then think at least 20 times before ever posting again. This is just too stupid.
Yes, I know the fucking article you're talking is just as dumb, but that doesn't excuse you for being a dimwit.
Sheeez.
Re: (Score:2)
... all of which information takes an area of ... (Score:2)
I learned a while back that for reasons having to do with the event horizon of a black hole and the conservation of entropy/information, a bit does not have mass but it does have area. One bit requires an area of 2 Planck lengths [wikipedia.org] on a side, which is 4 * 16.163e36 m = 6.4652e35 m^2
So 'all the information in the world', multiplied by 1,000, would require an area about 2 femtometers on a side. :D
Re: (Score:2)
Replying to self - yes, I know I'm playing fast and loose with the terminology. IANA physicist. But the concept stands. See black hole entropy.
Gee, thanks Slashdot. (Score:2)
Re: (Score:2)
You're giving Slashdot too much credit. Many submissions here are knowledge neutral - and a fair number appear to remove knowledge from the universe.
But wait! (Score:2)
But wait, now that we know this hasn't the sum of stored knowledge increased? And now that we know it has increased, doesn't that make it increase again? And wait, now that we know it increased again, doesn't that make it increase again? When will it ever end?
Re: (Score:2)
When will it ever end?
When we reach the end of the internet.
Now I can have it all without the internets (Score:2)
"American Idol reruns" (Score:2)
Wouldn't that count as negative information?
2 zetabytes =1000 exabytes (Score:2)
Yay!
Yeah, that bugged me too.$ (Score:2)
n/t
Compressed or Uncompressed? (Score:3)
Pretty narrow definition of "knowledge" (Score:2)
Another good chunk of what would be considered "knowledge" ... my suspicion is "chunk" is woefully inadequate and perhaps this 295 excabytes pales in comparison ... is what humans know but have not committed to another recorded form.
Call me crazy, but "the sum total of the world's knowledge" doesn't imply just "some form" of it; it pretty much states boldly that it's the works.
From TFA: ... The researchers calculated the figure by estimating the amount of data held on 60 technologies from PCs and and DVDs
"
295? is that all? (Score:2)
HAH! exabytes, my _ss (Score:2)
Data, not knowledge. big difference. mostly anti-data. distracting from reality. lies. bs (see politicians).
Pure cr_p. un-analyzed photos and movies. pron. dups.
totally bogus figures (see above)
cool number (Score:2)
Nice to see we can fit all of the info on one piece of paper, as a number and say ok, if we need to back up the internet, this is how much space we need.
Re: (Score:1)
no i NEED MOAR
Re: (Score:2)
Q: Deep Thought, what is the ratio of the number of porn files to atoms in the universe?
Re: (Score:2)
Re:In conclusion (Score:5, Insightful)
38 Gigabytes per person is enough? I don't think so.
Re: (Score:2)
38 Gigabytes per person is enough? I don't think so.
How much data storage do people in rain forests need? How about nomads, bedouins and bushmen?
There are a lot of people in the world who have lower storage requirements than you.
LK
Re: (Score:2)
One byte per person on an average.
Because some of them have been registered through scientists.
Re: (Score:2)
Technically I don't need any storage either. The real question is how many bytes would a bush tribesman want if he could get them.
Re: (Score:3)
Say no to XKCD! [abstrusegoose.com]
I figure so long as Lrrr is further away than Altair, we're safe... for now.
Re: (Score:2)
I think they'll invade earth to enact the end of Single Female Lawyer.