Vendor Neutral File Formats? 83
timmyv asks: "I have recently been tasked with developing a corporate wide policy that will standardize all employee created documents on vendor neutral file formats. OASIS is good in theory, but I haven't been able to locate enough concrete examples of policies or implementation schemes that work at a corporate level. Does anyone work at a company where documents can only be saved as RTF, HTML, etc. or have any experience with this type of problem?"
RTF (Score:2, Informative)
OpenOffice (Score:3, Informative)
Re:Not sure what the question is limited to (Score:3, Informative)
also don't forget that it may be made an ISO standard [slashdot.org].
Re:PDF (Score:3, Informative)
The only issue with PDF is the tendancy to be one-way. But there are programs out there designed to convert PDF documents to other formats.
Re:PDF (Score:2, Informative)
My company is standardized (at least for production work) on PDF format, which everything can make. The problem is getting things back out or editing such documents...
It seems that the only truly accurate interpreter is Adobe's Acrobat Software, but it 'just works' for the final output. Converting it to anything else useable doesn't seem to work vey well or reliable.
Editing these things is a bit of a pain, but it can be done, and we do for a chunk of the production----> but this is definitely beyond the capabilities of any of the PHB's or the multitudes of Customer reps/etc, so the 90% paperwork and other miscellaneous office/corporate documentation never sees PDF format. A lot of that gets done in MS Word/Excel. Oh well... I find myself idly wondering/dreading what the next version of Acrobat is introducing...
Re:"Vendor Neutral"???!!! (Score:2, Informative)
RTF does contain, in theory, sufficient control words to describe everything that Word 2000 can do, but it's hardly a direct translation and things get lost a lot. Furthermore, RTF contains a few control words that Microsoft didn't put there: such as \collapsed (added by NeXT to describe paragraphs that had been hidden by the user).
There is a huge difference. RTF is a formally published, open specification [microsoft.com] and Microsoft openly encourages [microsoft.com] third-party implementations. It's been stable for 5 years now. WordRe:You're asking the wrong questions (Score:2, Informative)
Re:You're asking the wrong questions (Score:3, Informative)
For free access to documents by citizens, PDF is pretty good. There are viewers for most platforms (I don't know about BSD or Solaris, but Mac/PC/Linux all are OK); and there are non-Acrobat print-to-PDF knockoffs at economical prices. Requiring PDF publication of all publicly available printed documents in, say, PDFv1.2, PDFv1.3 or PDFv1.4 would be a useful and not overly onerous step. (Adding forms-completion ability to the PDF requirement might well be too much.) The PDF standards are public, although copyrighted. .DOC under Linux. =)
M$ Office has free viewers for older versions on Windows, but the Mac version isn't native on the current Apple OS, and OpenOffice is the only viewer I know for
As far as permanence of data, nothing beats the long term unkillability of a bare TXT file; it also allows improved handicapped accessibility to the data in the process. For databases (w/o queries) and single-page spreadsheets, CSV comma-separated text format is similarly hard to destroy. Most Office Suites will read in such applications. For charts and other pictures, JPG may eventually be replaced, but will probably be readable for a long time. Of course, data corruption is always a risk (especially for JPG), so backups should be made redundantly, and be prepared for at least one major media format migration (EG: CD to DVD-Blue, or whatever). Requiring that any software be able to import from and export to these as relevant would be a reasonable and not overly onerous step.
Security is a more problematic issue. Some documents are meant to be kept non-public, barring (or even given) FOIA requests. Were it in my desmene to do so, I would still require the creation of the files for archive purposes, but storage off-site at a secure abandoned-salt-mine-type facility. Given that Security is oft diametrically opposed to Accesibility and/or Permanence, this may be a problem.
Oh... and PDF has some built-in security features. Requiring them to be used only when such security is mandatory might be worth thinking about.
what are you talking about? (Score:1, Informative)
It is restricted by patents, see..
http://news.com.com/Microsoft+seeks+XML-re
Re:PDF (Score:3, Informative)
There's also -
pdf2txt@adobe.com
pdf2html@adobe.com
Re:PDF and the Things That Turn Into It (Score:2, Informative)
I don't know of converters that turn XML,SGML->HTML, but they probably exist.
The tool to convert from domain-specific XML to XHTML is called XSLT. For more info, Ask Google [google.com].
I always try and use portable files (Score:4, Informative)
Well, for CAD, its a screwed up world. The best/most portable format is probably IGES, except its such a huge specification that nobody's IGES file is compatible with anybody else's. I'm an engineer and for myself I use Turbocad 10 professional at home. It reads/writes AutoCAD files and numerous other formats, and is somewhere in between AutoCAD and Pro/Engineer in terms of its capabilities. You'll have a tough time convincing any corporation to use TurboCAD though.
For text documents, HTML would be good, except MS products tend to produce the most screwed up HTML files I've ever seen. All I can recommend is to use PDF files for important and official documents because they are essentially immutable and tend to produce consistent hardcopies from any computer.
OpenOffice formats are nice, and if I were starting up a new business I would of course set up Linux workstations to use OO exclusively, and put a Windows machine down in the IT room so the IT staff could convert any troublesome documents that come through the email.
For Visio, there is no equivalent, other than exporting the visio file as a DXF or maybe a WMF. Windows MetaFiles never seem to load right in other apps though so thats something to think about. SVG files will probably be the future here if Dia starts using them.
Re:OpenOffice (Score:1, Informative)
So... the format is pretty much vendor neutral.
Cheers,
Daniel Carrera.
OpenOffice.org volunteer.
The Shot Heard Round the World (Score:2, Informative)
There are no "StarOfficeisms" in the OASIS XML Open Document file format specification. Least ways not any we know of. By December of 2004, when the OASIS TC submitted the XML file format specification to ISO, all known references and anachronisms that might be called starisms were changed. Neutralizing changes were even made to such things as the file format extensions and mime type registrations. We even changed the name from OASIS Open Office to OASIS/ISO Open Document.
Separating the file format from any particular application or applications suite is a big deal. Especially if there is a rising demand from enterprise level end users for an applications independent universal structured file format solution. tty. Separating the file format from any particular application or applications suite is a big deal. Especially if there is a rising demand from enterprise level end users for an applications independent universal structured file format solution.
So the OASIS/ISO TC chose to keep that most powerful of technology terms, the word "Open", but lose the direct reference and/or suggestion to OpenOffice.org.
The second reason for changing the name to "OASIS Open Document" is far more interesting, and directly relates to the European Union "TAC/IDA" task force recommendations based on the infamous Valoris Report. You will recall that by September of 2004, the EU had evaluated responses from both Sun and Microsoft regarding the Valoris recommendation that all EU information system purchases be required to support an open standards based XML file format specification.
Microsoft's open XML proposal was determined by the EU to be "not open enough". This criticism was in the original Valoris Report, and not altered by subsequent Microsoft arguments. After much squealing, squawking, finger pointing, complaining and outrageous misrepresentation, in mid November of 2004 Microsoft finally conceded and agreed to meet EU requirements. More about this in a moment, but for now the important thing to note is that the EU held firm. A remarkable feat even though there is currently a range of cross platform alternative solutions that meet EU requirements, including the open and free OpenOffice.org, Sun's StarOffice, IBM's WorkPlace, and Novell's Open Office. And if Microsoft had not sold their share in Corel to a vulture investor outfit for pennies on the dollar, an investor who then proceeded to cut XML out of Corel, WordPerfect Office would also be OASIS/ISO XML compliant.
Meanwhile, the EU was also not entirely satisfied with the OASIS XML specification as explained in Sun's response to the EU requirements recommendation. Three things in particular concerned the EU.
First, that OASIS submit the file format specification to ISO. In September of 2004, OASIS management and the OASIS TC came to agreement with ISO that the file format specification would be submitted to ISO before years end, but maintenance and improvement would remain with the current OASIS TC. Hence the combo moniker "OASIS/ISO".
Second, there was a great deal of concern about "custom-defined schemas". Sometimes this issue is also referred to as "user-defined schemas". Others just call it a "forms" or "template" issue. Basically it refers to an applications ability to load (or consume) an externally defined schema template that might include specific user interfaces (forms), business - workgroup logic (routing), meta data interfaces, and other things related to the emerging world of collaborative computing.
Microsoft of course champions the auxiliary Office productivity application, InfoPath. However, in September of 2004, the OASIS TC finished work on extending the specification to include XForms, SVG, and SMiL. Current OOo -v.2 builds fully demonstrate the powerful capabilities of these extensions, including the binding of web services and data to graphical objects and forms/template widgets. Move over InfoPath. Hello OASIS UBL!
The third issue involves EU concerns fo