Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Data Storage Facebook Social Networks

Where Facebook Stores 900 Million New Photos Per Day 121

1sockchuck writes: Facebook faces unique storage challenges. Its users upload 900 million new images daily, most of which are only viewed for a couple of days. The social network has built specialized cold storage facilities to manage these rarely-accessed photos. Data Center Frontier goes inside this facility, providing a closer look at Facebook's newest strategy: Using thousands of Blu-Ray disks to store images, complete with a robotic retrieval system (see video demo). Others are interested as well. Sony recently acquired a Blu-Ray storage startup founded by Open Compute chairman Frank Frankovsky, which hopes to drive enterprise adoption of optical data storage.
This discussion has been archived. No new comments can be posted.

Where Facebook Stores 900 Million New Photos Per Day

Comments Filter:
  • by NotQuiteReal ( 608241 ) on Wednesday July 01, 2015 @10:04AM (#50025741) Journal
    They could just delete most of the photos after they age a bit, analyzing it with some of their AI whiz-bang software.

    If anyone ever asks to see the image again, they can just show one that is "close enough" and nobody would ever know the difference.

    I personally, have never posted a photo to Facebook, so I'd be OK with that.
    • by Anonymous Coward

      It's a bit like memory that way. You have some short term memory, and those become long term memory, which you can never really recall exactly. In some ways, this might be the solution to the "problem" of the internet never forgetting.

    • by QuietLagoon ( 813062 ) on Wednesday July 01, 2015 @10:30AM (#50025937)

      They could just delete most of the photos after they age a bit, analyzing it with some of their AI whiz-bang software....

      More than a few of my [real world] friends use facebook as their archive for photos, eschewing local or cloud-based storage for their historical family photos. They would be unhappy if facebook were to randomly start deleting photos just because they've been on facebook for a period of time.

      .
      Of course, I've told those friends that facebook may not have the same photo-preservation goals as they do, but they seem to be unconcerned.

      • by Anonymous Coward

        More than a few of my [real world] friends use facebook as their archive for photos, eschewing local or cloud-based storage for their historical family photos.

        . If they are storing their photos on facebook, they ARE storing them in the cloud.

        Of course, I've told those friends that facebook may not have the same photo-preservation goals as they do, but they seem to be unconcerned.

        Why would they be concerned? Ignorance is bliss.

        • .... If they are storing their photos on facebook, they ARE storing them in the cloud....

          In a general sense, correct.

          .
          However, when I said "cloud-based storage" I meant the cloud service was a storage service, not a social media service. If I had meant facebook, I would have said cloud-based social media service.

        • by bondsbw ( 888959 )

          If they are storing their photos on facebook, they ARE storing them in the cloud.

          It depends on what you mean "store". Dictionary.com [reference.com] provides this as a definition: "to accumulate or put away, for future use". (emphasis mine)

          I don't think Facebook guarantees future retrieval, so it is probably not proper to classify it as storage.

        • by ncc74656 ( 45571 ) *

          If they are storing their photos on facebook, they are doing it wrong.

          FTFY. I can kinda understand posting stuff to Farcebook so others can view it, but using it as your primary storage medium? That's at least a dozen different kinds of wrong.

      • by lgw ( 121541 )

        Facebook seems to have your friends in mind, at least for now. They have a system where old photos are store quite cheaply, because they simply fail to display the first time you try to view them. By giving up on storing them in a way that can serve a web page hit, Facebook can be quite cheap (though I hear they use powered-down HDDs, not optical - and Western Digital has a new line of HDDs just for this purpose).

      • More than a few of my [real world] friends use facebook as their archive for photos

        hahahahahahahahahahaha

        Of course, I've told those friends that facebook may not have the same photo-preservation goals as they do, but they seem to be unconcerned.

        So what makes you think they would be unhappy if facebook started deleting their photos? Apparently they don't care :p

      • Facebook practices severe compression on uploaded photos, so it's not a great option for archival. I upload family photos there for my wife's friends/family to see, but use CrashPlan for actual archival. These days most people view photos as ephemera, given that so many people can take thousands of good-enough-for-casual-use photos with their phones. Back in the day when taking / printing a photo was an event, people valued them much more.
  • After 3 months of no views, just replace them with a goatse image.
    That way, you only need to store one image which replaces 99.999% of all pics uploaded. No need for complex storage solutions!
    Another advantage would be that you can serve it really, really fast. No wait time!

  • What happens when a user wants to delete an image permanently. If it's stored on an optical disc are they going to destroy the whole disc and burn it again?
    • Re:Delete? (Score:5, Insightful)

      by bill_mcgonigle ( 4333 ) * on Wednesday July 01, 2015 @10:11AM (#50025797) Homepage Journal

      What happens when a user wants to delete an image permanently.

      What gave you the idea that's a service Facebook offers?

    • Re:Delete? (Score:4, Informative)

      by NatasRevol ( 731260 ) on Wednesday July 01, 2015 @10:16AM (#50025829) Journal

      I see you haven't read Facebook's terms of service.

      There is no delete.

    • The ultimate truth of the internet... there is no delete. Just some things are a bit harder to find.
    • by dave420 ( 699308 )
      They remove the image from the catalogue. The image still exists on disc, but won't be copied to new media or be available for retrieval. This is a solved problem, and has been for decades.
  • by neilo_1701D ( 2765337 ) on Wednesday July 01, 2015 @10:07AM (#50025763)

    Should have read:

    You won't believe this one weird trick [datacenterfrontier.com] Facebook uses to store data!

    Other than that, fascinating look at how all that data is being stored and retrieved.

  • is that their monthly AWS fees must be ENORMOUS!

    • An interesting question is at what point does it become viable for FB to follow Amazon's model to scale its own system as a business unit...

      As in, when will FB conclude that it again needs to widen its revenue stream portfolio, and it therefore makes sense to offer its own version of AWS?

      Any predictions on FBWS?

      And there's the FB hardware development division, a business unit that so far has also remained in-house but has its own revenue potential. I think people tend to underestimate MZ's am
      • It might be that using Blu-Ray autochangers may be a very useful thing to have, especially for something that can fill the gap between HDDs and LTO tapes for backups [1].

        The pathetic thing is that this technology isn't new. We used to have 100, 200, even 400 disk CD and DVD carousels. By replacing the CD reader with a burner, and using 128 GB BDXL media, that means tens of terabytes of tamper-resistant (important with all the ransomware out there) WORM storage.

        The trick is getting BD media into the teraby

        • I was actually thinking of the potential of FB's networking hardware initiative, though you make a good point about this storage angle.
        • by ncc74656 ( 45571 ) *

          The trick is getting BD media into the terabytes and getting it at a price point where it is decently affordable. For example, a 100 GB BDXL disk is $65, but it should be about 10% of that price in order to be a viable backup medium.

          My last spindle of 25 GB BD-Rs cost me maybe $0.60 each or so. I could drive down to Fry's right now and pick up a spindle for about $0.80 each. A 4x increase in storage density isn't worth a two-order-of-magnitude increase in price. I would be surprised if Farcebook didn't

  • is it cold in here or what?

  • Going on for a while (Score:4, Informative)

    by Alomex ( 148003 ) on Wednesday July 01, 2015 @10:23AM (#50025889) Homepage

    I've noticed large latency for rarely used pictures in FB for over eight months now, and by large latency I mean visit the page, then come back the next day to see the next batch of > 5 year old pictures and wait another day for the final batch of ~10 years ago pictures.

  • People upload the same memes all the time. Just hash and store the common images and you'll reduce the unique photos to one or two unique images per day. :)

  • Amazing (Score:4, Insightful)

    by bws111 ( 1216812 ) on Wednesday July 01, 2015 @10:56AM (#50026081)

    Wow, they discovered HSM [wikipedia.org] only 40 years after it was introduced. Amazing.

    • Re: (Score:3, Insightful)

      by Ravaldy ( 2621787 )

      Pointless arrogant comment.

      Nobody claimed it was new or that they had reinvented anything. They just applied modern technology to a well know strategy to solve a known problem. In the modern age of storage and data centers I have yet to see this (not to say that nobody has done it).

      When someone shows you an electric car do you tell them cars have had 4 wheels since before 1903? I assume you do.

  • Is facebook still a thing? People still use it after all the security problems and personal information screw-ups?
  • How is using blu ray cheaper than hard drives? Not only is it slower, but the medium + the hardware to burn them + the robotic retrieval system..
    Seems like there could be an easier solution to this: hard drives in racks. No robots, no optical drives, and no blu ray discs.
    One 500gb hard drive already has 10x the amount of storage as a dual layer bluray. In fact, a 10 pack of dual layer blu ray discs on amazon costs twice as much as a 500gb 3.5" drive. Am I missing something?
    • by jp10558 ( 748604 )

      Electricity use - did you even watch the video? Of course not. Also, the data survives a drive failure.

      What I wonder is why they think this is better than LTO6, which already has robots etc COTS solution. It's possible, maybe, that it takes less space. It is resilient to stray magnets in a way tapes maybe wouldn't be - but is that a common issue with LTO?

      • No I didn't watch the video. I'm at work. But I did skim the article, and apparently I missed some details, because the comment police are crawling all over the place.
    • by ncc74656 ( 45571 ) *

      How is using blu ray cheaper than hard drives?

      3 TB will fit on 120 25-GB BD-Rs. At 40 cents each [newegg.com], that's $48 in media costs. If you do like I do and reserve 20% for dvdisaster error-recovery data, you're still only looking at $60.

      A 3 TB WD Green will set you back $95 [newegg.com]. (Want to spring for the NAS-rated Red drives instead? That'll be $119 [newegg.com]. Their absolute cheapest 3 TB hard drives are a couple of models from Seagate and Toshiba at $90 each.)

    • by Whiteox ( 919863 )

      You are right. There's no reason for why you can't 'spin down' a rack of cheap server grade HDs to save power.
      What happened to Bernoulli disks anyway?

  • First thing I did was open facebook and look to see what my oldest picture was. I don't have that many and it came up pretty quickly but I'm sure lots of other people had the same impulse.

  • Didn't we see a story about this last year?
  • by wonkey_monkey ( 2592601 ) on Wednesday July 01, 2015 @01:14PM (#50027141) Homepage

    In the cloud, obvs.

  • The nsa built that huge data center in Utah for nothing?

    Now if the nsa would just open an api to retrieve it....

Passwords are implemented as a result of insecurity.

Working...