Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
×
Microsoft Software Linux

Open XML Translator for Microsoft Word Available 96

narramissic writes "The first phase of a Microsoft-funded project to create software that can convert Microsoft Word documents between Open XML and Open Document Format (ODF) has been completed. As a result, the Open XML Translator is now available for download in version 1.0 from SourceForge.net. A ComputerWorld article details the history of the project, discussing the work of companies like CleverAge and AztecSoft, as well as community efforts to bring this project to realization."
This discussion has been archived. No new comments can be posted.

Open XML Translator for Microsoft Word Available

Comments Filter:
  • Aw... (Score:5, Funny)

    by darkhitman ( 939662 ) on Friday February 02, 2007 @06:11PM (#17866186)
    Please no clippy, please no clippy...

    "It looks like you're trying to convert to a non-Microsoft proprietary format. I can't let you do that, Dave"
  • Relation to Linux? (Score:4, Insightful)

    by Anonymous Coward on Friday February 02, 2007 @06:13PM (#17866204)
    And how's this related to Linux? It is just a ODF - OpenXML convertor for Windows.
    • by mtenhagen ( 450608 ) on Friday February 02, 2007 @06:29PM (#17866486) Homepage
      Software Requirements

      Before installing the add-in, make sure you have one of the followings...

              * Microsoft Word XP
              * Office Compatibility Pack
              * .NET framework 2.0

      or

              * Microsoft Word 2003
              * Office Compatibility Pack
              * .NET framework 2.0*

      or

              * Word 2007 with .NET Programmability Support activated
              * .NET framework 2.0*

      Minimum Software Requirements

      To compile the source distribution, you will need Microsoft Visual Studio 2005.
      • It's under the BSD license, so I'm sure someone can redo it so it can be compiled on Linux. After that, surely the Word hooks aren't that difficult to emulate? (IANAMP (not a mono programmer), so I don't know how difficult it'd be. Assuming it works better than what the OO.org people have, though, I suppose they'd rather adapt well-working code than write it from scratch...)

      • It didn't mention clippy anywhere??

        Bah!

      • by albalbo ( 33890 ) on Saturday February 03, 2007 @05:36AM (#17871372) Homepage
        Michael Meeks made a version of this converter available which compiles using mono, see entry 2007-01-29 on http://www.gnome.org/~michael/ [gnome.org] .

        Realistically, there's no reason it even needs to be in C# - the various bits of wrapper could be rewritten into other languages, and the main work is done by an XSLT. The OpenDocument Fellowship might include a similar tool in future tool sets, translated to be a bit more native.
  • Why is this such a big thing considering that OpenOffice has the ability to import from and export to MS Word format? It even allows you to email the document in Word format without having to explicitly save it in that format.
    • The idea is that it would let Microsoft Word users do the conversion, and save their documents in ODF, rather than leaving them in DOC and requiring OpenOffice users to do the conversion.

      The big difference is which format the documents get stored in. If they're being stored in DOC, then you're still mostly at the mercy of Microsoft; it's easy for someone to open the document in some new version of Word, save it, and silently move it into some new MS-created "binary blob" format, breaking backwards compatibility.

      So basically, a converter would let states like Massachusetts start to move away from DOC as the de facto standard format for electronic documents. They'd probably still use it as an editing format, because I don't see them tossing Word for OO.org anytime soon, but it would help get rid of the huge "silos" of DOC stuff that's sitting around, getting silent migrated from one version of Microsoft's formats to the next.
    • IIRC (correct me if i'm wrong), but i think OpenOffice can only handle the word-processor part of Open XML, not the rest of it (spreadsheet, presentation, etc).

      and, since many large organizations/governments have already switched to ODF, those groups wouldn't really be able to switch back to Microsoft without a conversion tool, preferably built into Office. this could be MS's attempt to get them to switch back.
      • IIRC (correct me if i'm wrong), but i think OpenOffice can only handle the word-processor part of Open XML, not the rest of it (spreadsheet, presentation, etc).

        OpenOffice.org is a complete office suite, comprising Word Processor (Writer), Presentations (Impress), Spreadsheets (Calc) and Vector Graphics/Diagrams (Draw). The Open Document Format (ODF) is able to encapsulate all these document types.

        Whether the Office Open XML (OOXML) to ODF convertor can handle all of these transformations, I don't know. I'm not holding my breath for a complete converter from OOXML to ODF either - 6000+ pages of OOXML spec is going to be hard to read, let alone code all the diff

        • yeah, i'm an idiot. in my head "Microsoft Word" and "Microsoft Office" are interchangeable. didnt occur to me that they were just talking about Word, not all of Office.
    • Why is this such a big thing considering that OpenOffice has the ability to import from and export to MS Word format?

      Some governments have conflicting directives including support for ODF and a contract to buy MS Word. Many tools designed to allow the blind to use computers work only with specific products, like MS Word. As a result, some governments asked for a converter that would move documents back and forth between these formats and for some reason they asked that MS not contribute or control the co

      • Re: (Score:1, Informative)

        by Anonymous Coward
        //As a result, some governments asked for a converter that would move documents back and forth between these formats and for some reason they asked that MS not contribute or control the code, just fund it.//

        1. They didn't ask for a converter, they asked for the ability to save OpenDocument (ODF) as the default file format. That is something that the CleverAge plugin specifically cannot do.

        2. They said nothing whatsoever about Microsoft controlling it or funding it.

        3. Due to its sever limitations, the Clever
    • When OpenOffice imports the Word document and it does not look exactly like it did in Word, people will blame OpenOffice. You can cry till you are hoarse that it was because of bugs in word, and no one would even listen to you.

      Now you export word doc to ODF using Microsoft Ceritified Export tool, and if the user complains that it does not look exactly as it did in Word, we can 1. Blame Microsoft for insincere export. 2. Read it into Word and see if the document survives round-tripping. If it does not, we h

  • by User 956 ( 568564 ) on Friday February 02, 2007 @06:13PM (#17866214) Homepage
    The first phase of a Microsoft-funded project to create software that can convert Microsoft Word documents between Open XML and Open Document Format (ODF) has been completed.

    Unfortunately, when you run it, it starts off with, "Hi! It looks like you're trying to convert a Microsoft Word Document! Would you like some help?"
  • by mandelbr0t ( 1015855 ) on Friday February 02, 2007 @06:13PM (#17866220) Journal
    Anyone else feel chills? Remember how good the Import/Export of .WPD files was in Word? I'm guessing that this will be of similar quality. At least it's OSS. But I wouldn't hold my breath waiting for this to bridge the gap between ODF and OpenXML. Best is to use OpenOffice and save as .DOC if you have to. Here's the Microsoft Press Release [microsoft.com] about it.
    • At least it's OSS
      Maybe I'm reading too much between the lines here, but it said the project was OSS *and* either sponsored or blessed by Microsoft. Again, maybe it's nothing, but I see MS setting a trap here so that anybody with DOC and ODT conversion becomes a target for future lawsuits by MS (kinda like the SCO deal - "Oh, you must have our code, so we're gonna sue the crap outta you")...
    • This MS-sponsored converter converts between ODF and OOXML, both of which are publicly spec'ed. So it can be *perfect* in theory.
      OO.o's conversion converts between ODF and OO.o's best guess as to what the binary .DOC format is. And OO.o's best guess is pretty poor for anything but the simplest of documents.
  • by Ace905 ( 163071 ) on Friday February 02, 2007 @06:17PM (#17866266) Homepage
    Can I ask, since the article doesn't seem to really explain -- what good is this? I know converting to XML is supremely important _in theory_ so that your documents can be easily parsed and used among other software applications - but say for example:

    I have a document
    I convert it to XML

    then what? Is this excellent news in theory, or is there a demand for this?

    I honestly don't know, I'm not claiming there isn't. Please tell me.

    ---
    this isn't xml [douginadress.com]
    • by Iphtashu Fitz ( 263795 ) on Friday February 02, 2007 @06:21PM (#17866330)
      I have a document
      I convert it to XML

      then what?


      The latest and greatest(?) versions of the MS Office programs save natively in XML. This converter lets you convert to ODF, which lets you read the files into OpenOffice on any operating system, or any other application that supports ODF. It basically lets you get out from under the MS proprietary format and into an open standard.
      • by Ace905 ( 163071 )
        Ok, my mistake - it converts TO ODF from XML ; but aside from the incredible sense of freedom you experience getting out from under the crushing weight of MS proprietary format (haha) -- does this actually, at the moment, open up any new avenues for anybody?

        Like someone else commented, OpenOffice and already import and export to microsoft word. So is this really a practical utility, or does it just make everybody happy that hates Microsoft but still actually uses them?

        ---
        Open What? [douginadress.com]
        • The problem is that OO will screw the formatting for anything that's a little more complex. If whenever you open something, everything is out of place, or you can't be sure that somebody will be able to open the document how you saved it, it's best just to use MS Office.

          It means that now, I will be able to send ODT's to people who seem to think OO is somehow inferior to the "real" MS Office.
          • Re: (Score:3, Interesting)

            by Steve001 ( 955086 )

            LordVader717 wrote as part of a post:

            The problem is that OO will screw the formatting for anything that's a little more complex. If whenever you open something, everything is out of place, or you can't be sure that somebody will be able to open the document how you saved it, it's best just to use MS Office.

            The problem is, this is not even viable in a pure MS Word environment. An often-heard complaint is that MS Word documents will look different on different computers, even if both users are using MS W

            • by Bert64 ( 520050 )
              Even other microsoft products will fail badly...
              The mac versions of word have many compatibility problems with the windows versions, and just try loading/saving word documents with ms publisher (publisher even comes bundled with the more expensive versions of office)
              • Bert64 wrote:

                Even other microsoft products will fail badly...
                The mac versions of word have many compatibility problems with the windows versions, and just try loading/saving word documents with ms publisher (publisher even comes bundled with the more expensive versions of office)

                Maybe the solution is to move away from the entire WYSIWYG-while-editing method of document preparation, to an instruction-based system like LaTeX where the document will print on paper exactly as instructed regardless of the

    • by gvc ( 167165 )
      XML (and open for that matter) is just a synonym for "good." So read "OOXML format" simply as "(MS) Office format." The fact that it is sort of based on XML is irrelevant.
    • by kfg ( 145172 )
      Is this excellent news in theory. . .

      No.

      . . . , or is there a demand for this?

      Yes.

      Demand trumps excellence.

      KFG
    • Re: (Score:1, Troll)

      by woodhouse ( 625329 )
      Since when is completely missing the point of the article +3 interesting? I think there should be some kind of basic IQ test before you're allowed to moderate...
      • by Ace905 ( 163071 )
        At first I felt kind of stupid posting the question, but after reading the responses I think it was mod'd up for being a valid question. I completely missed the point, yes. It converts XML to ODF - but the question is still valid, no? what's the point. Where's the actual practical use. Heck, I should get +5. Nobody can really tell me aside from,

        "You're NO LONGER GOING TO BE CRUSHED UNDER THE WEIGHT OF MICROSOFTS PROPRIETARY XML.... which isn't proprietary...."

        ---
        speaking of a big crush [douginadress.com]
        • by asills ( 230118 )
          The practical use in this case is a business need. Microsoft has customers saying "we want the option to leave you" and competitors saying "hey we can convert their documents! Move to OpenOffice!". The customer might not actually want to leave Microsoft, but doesn't want to be forced to stick with them forever so they want that option. Microsoft made the right move and sponsored a way to convert their document format to ODF to make customers believe they can move away from Office if necessary.
        • by asills ( 230118 )
          And furthermore, while the Office XML format may not be proprietary due to it being published to a standards body, it's not something one should expect anyone other than Microsoft to implement.

          Microsoft published it as a standard in response to external pressures to shut some critics up. There are things in that standard that are nowhere near spelled out how to implement (for instance I think there's a Word95 compatibility flag but no definition of what changes when it's on other than "renders in Word95 com
    • by kabocox ( 199019 )
      Can I ask, since the article doesn't seem to really explain -- what good is this? I know converting to XML is supremely important _in theory_ so that your documents can be easily parsed and used among other software applications - but say for example:

      I have a document I convert it to XML then what? Is this excellent news in theory, or is there a demand for this? I honestly don't know, I'm not claiming there isn't. Please tell me.


      Can some one please correct me if I'm wrong, but I thought MS's new format was
      • by asills ( 230118 )
        That's more the party line of XML rather than the real world of XML. While translating XML from one format to another may be trivial in some cases, translating the content the XML contains (in this case the content is document formatting) may be incredibly difficult when the two XML formats are incredibly different (which they are, considering the 6000 page doc versus the 700 page doc).
    • step one: convert to xml
      step two: <shrug>
      step three:collect underpants
    • You save the document in XML. 6 months down the road, you think, gee, bob edited a document I worked on, but I don't remember which one. Because XML is basically a text flat file, you do a search for edits to your documents done by Bob. It checks all documents, including the files done in word, wordperfect that the legal team uses, OO that the engineers use, etc. Or, again, because its not a proprietary format, you send the same document out to 10 people to edit.. they all email back changes.. very, ver
      • What are you talking about?

        Say I create a document in Word 2007. Say it's my resume. I would save it "resume.docx". Say I create it on OpenOffice. I would name it "resume.odt". Six months from now, I would see one of those files and know what it is based on the name, extension, and icon.

        It's not any different than going by doc, xls, ppt, etc now. Why would it being a zipped XML make a difference? To a user the only difference is the extension and what programs can open/save it.

        Open Office Extensions:
      1. I have a document
      2. I convert it to XML
      then what? Is this excellent news in theory, or is there a demand for this?

      3 Profit!!
  • by Anonymous Coward
    Surprised? Seems Microsoft just see this as another way to infect the better platforms with their CLR, an attempt to start the countdown on the patent timebomb.

    If you're writing cross platform code at least have the decency to use C, C++ or Java, requiring a CLR is insulting.
    • Surprised? Seems Microsoft just see this as another way to infect the better platforms with their CLR, an attempt to start the countdown on the patent timebomb.

      If you're writing cross platform code at least have the decency to use C, C++ or Java, requiring a CLR is insulting.


      OMG! Mono & Rotor! WTFBBQLOL!?!?!
    • by Allador ( 537449 )

      If you're writing cross platform code at least have the decency to use C, C++ or Java, requiring a CLR is insulting.

      Then why is using Java okay?

  • by schwaang ( 667808 ) on Friday February 02, 2007 @06:25PM (#17866412)
    From the Microsoft press release:

    The second phase of the translator project, including translators for Spreadsheet (Microsoft Office Excel®) and Presentation (Microsoft Office PowerPoint®), will begin in February. Regular customer technology previews will be posted to SourceForge.net beginning in May 2007, and the final versions are scheduled to be available for customers in November 2007.
    One thing I'm wondering is how to automatically keep the OpenXML translator up to date on windows. If you install it from the MS Office Downloads [microsoft.com] site, will WindowsUpdate just keep it updated for you?
    • No. Normally, 'office' downloads (if not 'critical' for security) are request-only. An that means 'genuine advantage check' too..
    • by Bert64 ( 520050 )
      No...
      windows update only really updates what comes bundled with the os, and not any additional apps, even if those additional apps came from microsoft
  • WTF?! (Score:3, Funny)

    by TheWoozle ( 984500 ) on Friday February 02, 2007 @06:27PM (#17866448)
    I just tried to use it, and here's what I got:

          This is not a winning document. Better luck next time.
     
    • What happened next time? :)

      Oddly, that text isn't in the source code anywhere, so I wonder what's causing it..
  • What's the point? (Score:4, Insightful)

    by protactin ( 206817 ) on Friday February 02, 2007 @06:28PM (#17866468) Homepage

    What's the point if the add-in doesn't allow ODF to be set as the default file type, or even used via the Save As menu [robweir.com]?

    Hopefully the Word "interop" API actually allows for this sort of thing to be properly integrated.

    • Re: (Score:2, Informative)

      by blowdart ( 31458 )
      Actually if you look at where it appears it's right off the root of the File menu. So it stands out more than Save As, which needs to be chosen; then subtype chosen. It looks (to my mind) to be more important in the menu structure.
    • by juhaz ( 110830 )
      The point? Being able to pretend to support ODF while not actually doing so in any meaningful sense, of course.
      • by bbtom ( 581232 )
        Yes, it's a masterful move on the part of Microsoft's PR department.

        If you want to satisfy large government and corporate buyers who want the data security that XML/(your favourite open data interchange standard) brings, you have three options:

        1. Make a sensible, well-designed XML standard to facilitate interop.
        2. Don't bother.
        3. Make an insanely complex XML standard to facilitate interop in theory while hindering it in practice.

        Microsoft have chosen the last option. Anybody surprised?
    • Thanks to this Microsoft funded and approved converter, I won't have any problems working exclusively in ODF now will I? OK, you can stop laughing but the last time I tried to open the crappy new .DOCX with Open Office, it did not work. The save as dialog was difficult to find on Word and confusing when you did find it. Chances are, this converter will be just another trap like Windoze itself.

      • the last time I tried to open the crappy new .DOCX with Open Office, it did not work.

        Cos OpenOffice doesn't support it yet? Hardly Microsoft's fault.

        (Nor, on the other hand, is it OpenOffice's fault. It's still a new format, for christ's sake.)

  • A while back a state IT Department (I think Massachusetts) decided to only use open-source document formats and talked back and forth with Microsoft. The head of the IT Department (or something similar) privatly asked some of Word's programmers, who said an odf/xml feature would be trivial to add, but MS flatly refused to make a plugin for Office to convert to odf/xml, even though it meant losing the state's patronage.

    Microsoft is really determined to strangle open formats.
    • by Allador ( 537449 )

      ... an odf/xml feature would be trivial to add, but MS flatly refused to make a plugin for Office to convert to odf/xml, even though it meant losing the state's patronage. Microsoft is really determined to strangle open formats.

      Yes, so determined that they're actually paying for someone else to do it and host it on sourceforge.

      http://odf-converter.sourceforge.net/ [sourceforge.net]

  • by LibertineR ( 591918 ) on Friday February 02, 2007 @07:04PM (#17866942)
    IT Admin = "Boss, we can move to Open Office now, we have an XML converter for MS Word!"


    CIO = "What is this 'ribbon' thing I keep hearing about?"

    IT Admin = "Boss, we dont need the ribbon, its just Microsoft hype."

    CIO = "Have you seen the ribbon? Bring me the ribbon!"

    IT Admin = "Khaaaaaaaannnnn!"

  • I can't wait to see what Vista does with this. "Premium content detected, doc files are proprietary content I have notified Microsoft about your attempt to circumvent the DMCA. Please stay where you are and make no sudden moves".
  • It's XML, but... (Score:3, Interesting)

    by bbtom ( 581232 ) on Friday February 02, 2007 @08:15PM (#17867776) Homepage Journal
    Try reading Microsoft's documentation for OOXML. It's 6,000 pages long. Seriously. This is a great Microsoft PR stunt - yes, you've gotten your data in to XML format, but the XML format is so complicated that only the Microsoft programmers who wrote it can actually understand it. Part of the point about XML is interoperability. There's no way that sane people are going to read a 6,000 page Microsoft specification and write an XSLT to convert Microsoft OOXML in to a simpler and saner format. In short, this will not mean any competition with Microsoft. They buy PR in the geek community by saying "Office is going XML! Open data! Whee!" and making an XML format that's so complicated that nobody would ever use it. That's a pretty smart move. And it's a pretty dumb move on the part of ECMA. Congratulations on just giving your dignity away by signing off on a specification that's about nineteen times longer than War and Peace...

    No document in living history is ever going to be so complicated that it needs to be in a format that's specification is 6,000 pages long. Part of the point about XML was that we should be setting up simple, domain-specific markup languages and extending already existing markup languages. OOXML is bad because it's needlessly complicated and obscure. Having visited the OOXML website, I'm missing a lot of things I expect. First, I'm missing schema. If these guys are serious about XML, where are the XSD/RNG schema? Secondly, where are the cross-platform translators - ie. XSLs? I'm missing some kind of high-level summary of how I'm supposed to parse the XML. If the only way of doing anything with OOXML is a closed, black-box Microsoft converter, then we still haven't really got anywhere.

    Well, I'm breaking the cycle. All my documents are going to be either ASCII or a standard, non-obscure XML format like XHTML. Or something home-brewed and simple that can be easily transformed using XSL and XSL-FO. Screw Microsoft's phony attempt at interoperability. The Internet is interoperable by design. (X)HTML is interoperable by design. Let's prove to them that we mean interoperability by sticking to simple, sensible, semantically-based and scalable principles.
  • Word suuports "art borders". Users can choose from a wide selection of images selected from 1980's western culture. Does this get exported properly?

    If I can't get the candycorn borders, no deal!
  • I installed it and tried in in Word 2003 on a small, simple documents. I noticed that when the result file was loaded into Writer, it looked at first glance the same as if Writer had loaded a .doc file - e.g. white page background turns red, numbered section headers lost the numbering. I'll have a closer look and do the round-trip test, but it seems that the Windows team has exactly the same difficulties that OOo has in the conversion? Someone(TM) should perhaps compare some code here - not to point any bl

For God's sake, stop researching for a while and begin to think!

Working...