Catch up on stories from the past week (and beyond) at the Slashdot story archive

 



Forgot your password?
typodupeerror
×
Data Storage The Internet

Lost Something? Search Through 91.7 Million Files From the 80s, 90s, and 2000s (arstechnica.com) 57

An anonymous reader quotes a report from Ars Technica: Today, tech archivist Jason Scott announced a new website called Discmaster that lets anyone search through 91.7 million vintage computer files pulled from CD-ROM releases and floppy disks. The files include images, text documents, music, games, shareware, videos, and much more. The files on Discmaster come from the Internet Archive, uploaded by thousands of people over the years. The new site pulls them together behind a search engine with the ability to perform detailed searches by file type, format, source, file size, file date, and many other options.

Discmaster is the work of a group of anonymous history-loving programmers who approached Scott to host it for them. Scott says that Discmaster is "99.999 percent" the work of that anonymous group, right down to the vintage gray theme that is compatible with web browsers for older machines. Scott says he slapped a name on it and volunteered to host it on his site. And while Scott is an employee of the Internet Archive, he says that Discmaster is "100 percent unaffiliated" with that organization.

One of the highlights of Discmaster is that it has already done a lot of file format conversion on the back end, making the vintage files more accessible. For example, you can search for vintage music files -- such as MIDI or even digitized Amiga sounds -- and listen to them directly in your browser without any extra tools necessary. The same thing goes for early-90s low-resolution video files, images in obscure formats, and various types of documents. "It's got all the conversion to enable you to preview things immediately," says Scott. "So there's no additional external installation. That, to me, is the fundamental power of what we're dealing with here."
"The value proposition is the value proposition of any freely accessible research database," Scott told Ars Technica. "People are enabled to do deep dives into more history, reference their findings, and encourage others to look in the same place."

"[Discmaster] is probably, to me, one of the most important computer history research project opportunities that we've had in 10 years," says Scott. "It's not done. They've analyzed 7,000 and some-odd CD-ROMs. And they're about to do another 8,000."
This discussion has been archived. No new comments can be posted.

Lost Something? Search Through 91.7 Million Files From the 80s, 90s, and 2000s

Comments Filter:
  • by drinkypoo ( 153816 ) <drink@hyperlogos.org> on Wednesday October 19, 2022 @08:08AM (#62979787) Homepage Journal

    Search has been down since 8PM last night. Good thing this site isn't popular any more, a slashdotting would probably not help

  • by Gibgezr ( 2025238 ) on Wednesday October 19, 2022 @08:30AM (#62979841)

    This is a cool project. I hope they keep expanding it even beyond their current plans.

  • I had to say it (Score:5, Informative)

    by war4peace ( 1628283 ) on Wednesday October 19, 2022 @08:39AM (#62979863)

    There's loads of uncensored porn. I give the website two weeks before it's brought down by a load of Cease-and-Desist requests, together with angry mobs of "think-of-the-children" people with torches and pitchforks.

    • I was basically thinking the same thing, though for slightly different reasons. There have been a lot of legal changes to who can upload porn of whom over the last two decades. I can see a lot of lawsuits happening over old porn from people that didn't authorize it's existence happening. Precursors to modern revenge porn and so forth.
      • It's a rather low chance for someone to recognize someone else from 20+ or 30+ years ago in a rather obscure corner of the Interwebz and say something about it.

  • by LondoMollari ( 172563 ) on Wednesday October 19, 2022 @08:48AM (#62979879) Homepage

    The only thing that brings a new technology to the forefront is porn.

  • Just not the same... (Score:3, Interesting)

    by Shaitan ( 22585 ) on Wednesday October 19, 2022 @09:09AM (#62979955)

    Nothing is going to bring back those print shop pro 5 and other cd's that exploded in drives with high read speeds (56x if I recall).

    I remember when mythbusters declared the exploding CD in high speed drives myth 'Busted' and laughing. I think their testing failed because the issue was a combination of flaws in the discs from some manufacturers which only resulted in a fault at the high rpm rate after enough internal stresses had accumulated over time. I probably swapped about a hundred drives filled with shattered disc fragments. So that is no myth.

    Of course most people probably didn't see that many. At the time I worked for a local pc shop in the midwest and we'd bundled paint shop pro 5 along with new systems and this disc being the most common culprit quickly became obvious to us. They also didn't fail immediately... those calls came in sporadically over the next couple years. Incidentally, if you disassembled the drive and shook out all the fragments the drives typically worked fine. It was cheaper to just swap them, they didn't qualify for warranty replacement, so I think everyone in the shop had drives they'd fixed after the fact in their home systems. ;)

  • by Thelasko ( 1196535 ) on Wednesday October 19, 2022 @09:31AM (#62980037) Journal
    Hopefully someone found old copies of Iomega's 1-step backup utility. They released it with their Zip and Jazz drives as an archival tool, and then dropped support for it. [computerhope.com] If you didn't keep the disk it came on, your backups are useless.

    I still have some backups from the 90's that I can't access.
    • Re:Iomega 1-step (Score:4, Informative)

      by EvilSS ( 557649 ) on Wednesday October 19, 2022 @09:38AM (#62980063)
      • That appears to be it! Thanks!
      • It has some sort of primitive DRM! It must check the registry keys for the correct drivers. I'll have to find a windows 98 or NT dist to get it to work. Ahhh! What a headache!
    • IIRC, Iomega was purchased by HP who promptly dropped support to eliminate competition for their own products. I got burned at the time. I'll keep an eye out for my old Zip Drive cartridges which may still be here somewhere.
      • Nothing about HP here:

        "Iomega&rsquo;s fortunes were tied to its Zip drive and when sales began to wane, its stock went from $100 per share in the 1990s, and plummeted to just $2 in the mid 2000s."
        [...]
        "In 2008 Iomega was acquired by EMC and it became a division of the storage giant offering a range of storage servers."

        HP did buy and kill Palm though...

        https://www.silicon.co.uk/data-storage/storage/tales-tech-history-iomega-zip-drive-209499
        • I'm apparently thinking of a different backup device. It was a high-density mag tape cartridge, not a disk. I'm sure none of this would do me any good as I'm sure the device was disposed of when rendered unusable when support was dropped. What I'm remembering is a similar scenario with a different device an HP. I have a distant familial connection with David Packard, so it stung a bit at the time.
          • It was a high-density mag tape cartridge, not a disk.

            The Iomega one was the "ditto" serie.
            The first models (named simply "Tape") used the same kind of QIC tape cartridge as other manufacturer in the field.
            Later models (including the LPT-connected "ditto 800" I got second hand) used Travan tapes (slightly larger cartridge and much more capacity, the tape drive itself is still backward compatible with classic QIC).

            A quick glance at Wikipedia reveals that "Tecmar" (never heard of them) bought ditto from Iomega.

            Most of the industry has moved on.
            - On the p

    • I have copies of a bunch of floppy backups made with Norton Backup 3.0. I can't believe there's not something that can decode them. I supposed I could try running it in a VM (or DOSBox), but I haven't bothered yet.

      • https://superuser.com/questions/1615820/how-to-restore-an-old-windows-7-norton-backup-from-2008-or-2009-on-a-modern-pc
    • by Reziac ( 43301 ) *

      Back in the era of hacked FTP servers hosting all manner of hidden files, one could come across peculiar and odd things.

      One that I tripped over was the source code for Colorado Backup for DOS.

      Your plaint reminded me... good luck. Best bet might be someone's old ZIP drive being sold intact on eBay. (I'm pretty sure I have what you need in a box somewhere, but finding it... unlikely.)

  • by Applehu Akbar ( 2968043 ) on Wednesday October 19, 2022 @09:32AM (#62980045)

    Betcha that the original reason this team bothered to start this project is to look for some of those legendary troves left behind by early Bitcoiners, in the days when people could mine thousands of them on an idle PC and leave them on a hard drive that would later be abandoned. After a specialized search of their data trove to grab any such files, why not release all the miscellaneous junk to the general public?

  • History loving? (Score:5, Insightful)

    by The Evil Atheist ( 2484676 ) on Wednesday October 19, 2022 @09:45AM (#62980097)
    Unfortunately, copyright law cares not about your love of history. Copyright is basically long enough to cover things that would be considered historical rather than contemporary.

    No files from the 80s through to the 2000s is out of copyright, so basically this would count as 91.7 million cases of copyright infringement.
    • I'm basically a Techie Downer.
    • Unfortunately, copyright law cares not about your love of history. Copyright is basically long enough to cover things that would be considered historical rather than contemporary.

      No files from the 80s through to the 2000s is out of copyright, so basically this would count as 91.7 million cases of copyright infringement.

      Half of Archive.org is copyrighted files. It's just a matter of whether the copyright holder cares enough to go searching for 20 year old files. A lot of them don't.

    • by lsllll ( 830002 )

      I agree, and I question why they didn't just build the index and link all the media to archive.org.

      Take this CD [textfiles.com] for instance. They have a link to the CD's page on archive.org, which includes all the same media: the .iso, the images, but Discmaster has copied the files onto its own web server. I would rather have them work something out with archive.org so that they can use the images from archive.org on their site and have all links to download go to archive.org. That way they can shield themselves a lit

    • No, there are a few files I created which are definitely not infringements. So that should be 91,699,994.

  • by haruchai ( 17472 ) on Wednesday October 19, 2022 @09:53AM (#62980123)

    SEARCH IS DOWN
    Oct 19 8:16AM EST
    Good morning.
    The database restore is taking longer than we had hoped. Sorry :(
    New ETA: Today, Oct 19 around Noon

    Oct 18 9:24PM EST
    We identified 2 different issues that caused the search to crash.
    The search database is being recovered, which will take several hours.
    We hope to have search back online tomorrow, Oct 19 by 9AM EST.
    Once again, sorry about the search downtime.

    Oct 18 8:04PM EST
    Sorry, but search is currently DOWN. We are super bummed about it.
    We are ACTIVELY working on it.
    No ETA at this time.

  • Tried to go to the site and Malwarebytes blocked it, claiming it harbored a trojan. ????
    • malwarebytes hasn't been relevant since at least 2011
    • Malwarebytes does this a lot.

      Note that it's not saying a Trojan was found, just that the site *might* contain one.

      Their heuristic seems to be "anything not mainstream is dangerous so don't go there".

    • by guruevi ( 827432 )

      Probably accurate, given the amount of shareware on the site, that stuff was full with viruses.

    • MBAM is pretty good, except for their web stuff. If you have pro so you have real time scanning, you might as well just disable the web crap. If you're going to sleazy parts of the web, you should use noscript anyway.

  • Gotta let these young whippersnappers feel the pain of trying to uninstall that bastage.

    • Gotta let these young whippersnappers feel the pain of trying to uninstall that bastage.

      Or get it to work consistently.

  • 91.6 million hits
  • Where do these files come from?
        All files were uploaded to archive.org by thousands of different users.
  • I have been looking for a game I played as a kid on an old apple 2e. It was distributed in a magazine that was on two 5 1/4 floppies. I have no idea which magazine it was unfortunately, but remember the game was called something like “Sylven Idol” or “Silven Idol”

    It was a kind of top down adventure RPG, where the player moved around a pretty large map, shooting fireballs at bad guys such as walking trees, giants, and other fantasy creatures. One had to collect keys to open magic g
    • Sylvan Idyll is from the Apple II disk magazine "Softdisk". Specifically, issue #127. It is available (and playable in-browser!) here: https://archive.org/details/21... [archive.org]

      It is a sequel to / new levels for "Catacombs" which was written by John Carmack (from issue #114), and there is third game Ether Quest (issue #134)

      • Holy shit wow!
        I had no idea it was a John Carmack work! Like I said, its been close to 30 years since I’d played it! Thank you so much!
  • I searched for 'PDP-11' and got a total of 5 hits.
    It needs to incorporate bitsavers 'bits' for paper tape images and listings.
    Also listings in BYTE, Kilobaud, Creative Computing etc.
    Yes I am completely serious.

Utility is when you have one telephone, luxury is when you have two, opulence is when you have three -- and paradise is when you have none. -- Doug Larson

Working...