Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Data Storage Software IT Linux

StorageTek Announces Linux Based Storage Solution 10

njcoder writes "TechWorld has a report on StorageTek's new content-based disk and tape storage product. 'The software runs on a cut-down Linux kernel on dual Xeon nodes in a meshed network. The content-addressable store (CAS) makes the system more efficient than standard CIFS and NFS access, but supports all three.' The disk based storage uses SATA drives."
This discussion has been archived. No new comments can be posted.

StorageTek Announces Linux Based Storage Solution

Comments Filter:
  • IntelliStore uses the NIST-developed SHA 256 algorithm

    A stored object can have its hash re-created and if it differs from the original hash then the data has changed. It can be regenerated from the hash.

    It seems that they have developed in inverse function for the SHA 256 algorithm. Nice!

    Seriously though, I would be interested to see how this works. Some sort of brute force attack? Or do they store some extra error-correction bits?
  • by freality ( 324306 ) on Wednesday June 08, 2005 @04:00PM (#12761150) Homepage Journal
    "A 4TB starter IntelliStore costs around $75,000 (£42,000) with each additional terabyte costing $9,000 (£5,000) A set of additional compliance functions adds $15,000 to the cost (£8,400)."

    Better:

    - Storage: SuperMicro 15-bay disk array with 2 RAID-core controllers (2 RAIDs), SCSI-attached, for a total raw space of 12 disks (-1 for each RAID, -1 for OS). 2 fast processors and a bunch of RAM. Mount all data under /mnt filesystem.
    Size: 12*400GB=4.8TB
    Approximate cost: $5-10K depending on disks, processors and RAM.

    - File-server: mathopd static web-server pointing at /mnt filesystem. 10-20k transactions per second for small files. Fast transfer of large files.

    - Database: Nutch open-source search engine capable of indexing 40M pages per 1GB/RAM. Like the article says "millions of objects" now, "billions in the future". Point nutch at mathopd and watch your "content-based" storage come online. You can even get an RSS feed of newly added items.

    - Offline: Dell PowerVault 8 tape changer, SCSI attached + mtx for automatic tape changing. + 5k

    Now, a web search engine isn't a database, at least not off the shelf. But with this configuration you can afford a l33t programmer for half a year and still come in under the price of the StorageTek solution. Plus, once you've customized it, your capacity upgrades are much cheaper. And I bet it's faster. Dunno though, I bet you can't evaluate most of the relevant system parts from StorageTek before you make a buy decision (unlike the system above). ;)
  • Data is accessed by content rather than a hierarchical filesystem based on unique filenames. Has anyone used a system like this and can explain advantages/disadvantages? Would this ever be used on home PCs?
    • CAS products typically identify objects by a hash of the contents. They're usually accessed via an API: you give it the stuff to store (and, optionally, a retention policy) and you get an opaque ID in retrun. You can get the content back if you have the ID, so IDs normally are capabilities (in the authorisation sense).

      These things often sit behind document or other content-management systems. The advantage is that you only store one copy of the data; the disadvantage? Well, mostly, it's just a different wa
      • Meant to add: "Content Addressable Storage" does NOT mean you can look up objects by their metadata attributes, normally. That kind of functionality is usually a part of the application that uses CAS and sits on top of it (although IIRC the Celerra which I mentioned earlier does have some kind of metadata search facility).

Our OS who art in CPU, UNIX be thy name. Thy programs run, thy syscalls done, In kernel as it is in user!

Working...