Rethinking the Nature of Files 369
An anonymous reader writes "Two recent papers, one from Microsoft Research and one from University of Wisconsin (PDF), are providing a refreshing take on rethinking 'what a file is.' This could have major implications for the next-gen file system design, and will probably cause a stir among Slashdotters, given that it will affect the programmatic interface. The first paper has some hints as to what went wrong with the previous WinFS approach. Quoting the first paper: 'For over 40 years the notion of the file, as devised by pioneers in the field of computing, has proved robust and has remained unchallenged. Yet this concept is not a given, but serves as a boundary object between users and engineers. In the current landscape, this boundary is showing signs of slippage, and we propose the boundary object be reconstituted. New abstractions of file are needed, which reflect what users seek to do with their digital data, and which allow engineers to solve the networking, storage and data management problems that ensue when files move from the PC on to the networked world of today. We suggest that one aspect of this adaptation is to encompass metadata within a file abstraction; another has to do what such a shift would mean for enduring user actions such as "copy" and "delete" applicable to the deriving file types. We finish by arguing that there is an especial need to support the notion of "ownership" that adequately serves both users and engineers as they engage with the world of networked sociality. '"
There is no "issue." *I* own my files and data (Score:2, Insightful)
I'm sorry, but MS issuing a paper on the "issues of file ownership" and the cloud sends a little chill up my spine. Makes me think that engineering may not be the only impetus behind their paper. It also makes me wonder if someone isn't looking to take a little more "ownership" of what has traditionally been considered *my* data.
It's bad enough I'm already forced into "buying" software and media that I can never resell. Now they want my fucking Word files too I guess.
Re:There is no "issue." *I* own my files and data (Score:5, Insightful)
We just want to install some robust Technological Protection Measures to preserve your ownership of those files across all devices and platforms and legal systems aligned with international norms... Totally harmless, nothing to worry about.
Re:There is no "issue." *I* own my files and data (Score:2, Insightful)
Microsoft: All your files^h^h^h^h^hdata are belong to us!
Are they confusing form with function? (Score:2, Insightful)
A file is essentially just a collection of data - no more and no less. To try and add attributes to that makes little sense and seems as futile as trying to say that each collection of molecules should have a tag saying what it is, who it belongs to and what it's for. Sure, you can add abstractions and structure on top of the basic form, but when you do that you are adding a layer - not redefining the basic building block.
Re:Are they confusing form with function? (Score:2, Insightful)
To be honest, this sounds like MS is inventing something that Apple already invented. Apple has had forked files for how many years now? With one fork for the data and a resource fork for the icon and a few related pieces of information.
Personally, I don't like it, it's non-standard and requires special steps to work with at times, and I'm don't really understand why it's needed in the first place. If it's really that big of a problem you can always zip up the meta data file and the data file and call it a day, but for most purposes I'd rather than the data not get corrupted when the meta data does.
Re:There is no "issue." *I* own my files and data (Score:4, Insightful)
You should read the article, you are illustrating their point. They talk about how users associate ownership with having a file on a known physical location and how in order for people to feel comfortable with cloud storage the definition of file needs to be redefined in a way that people feel they have ownership over data that exists "out there".
"[...] ownership is what we are thinking of, when ownership stands as proxy for what used to be knowledge of location and responsibility for that location. What was once a relationship between a user and a physical thing now needs to stand as a relationship between a user and a digital thing. Just what this ownership might be and how it might function in terms of what is specified in this new entity we are thinking of, one that somehow has the properties we have described above and which also allows this new characteristic, we have begun to outline but a beginning is all it is."
Part of this is the ability to be able to delete their data even when it has been put out there in the wild.
"A boundary object needs to be developed that can bridge the abstraction of the user and the one of the engineer, who needs to worry about where this thing that keeps growing and changing, and where the locale of storage changes too, such that when a user says ‘delete’, the thing whatever it is and wherever the entities constitutive of it are, are indeed, done away with."
This is a paper talking about your concerns and how to address them.
Re:There is no "issue." *I* own my files and data (Score:3, Insightful)
A quote from the conclusion of the article:
A boundary object needs to be developed that can bridge the abstraction of the user and the one of the engineer, who needs to worry about where this thing that keeps growing and changing, and where the locale of storage changes too, such that when a user says ‘delete’, the thing whatever it is and wherever the entities constitutive of it are, are indeed, done away with.
I'm sorry, but that sounds a *lot* like DRMing every file to me, with a central service controlling every file (how else could you implement such a system?). The authors even admit as much a few sentences later:
At first reading one might think they are alluding to digital rights management.
Of course, they seem to deny that this is DRM. But that's sure what it sounds like to me. And DRM needs some sort of central service to work, which I'm sure MS will be happy to provide of course.
Re:There is no "issue." *I* own my files and data (Score:2, Insightful)
Of COURSE they are. They are trying to find a different way to market it - since DRM has no user benefits and users actively dislike it, they 'need' to redefine the issue so users have no choice.
This is marketing.
Oh dear god, please, please, please.... (Score:4, Insightful)
Do NOT "improve" the file. I'd like to continue to be able to use my computer and other devices.
POSIX xattrs (Score:4, Insightful)
Look them up. They already allow you to attach arbitrary metadata to a file. Most modern filesystems and user-level utilities support them already. They're even used as the underpinnings for security mechanisms such as POSIX ACLs and SELinux. Sure, there are issues with performance when you have *lots* of xattrs on a file, and that's a fruitful area of research, but we sure don't need some brand-new Microsoft-invented thing to deal with metadata.
Re:There is no "issue." *I* own my files and data (Score:4, Insightful)
Poetic justice would have Apple purchase Microsoft and break it into divisions.
Re:There is no "issue." *I* own my files and data (Score:5, Insightful)