XMLmind Word To XML
|Convert icons
Convert to PDF iconConvert to RTF (Word 2000+) iconConvert to WordprocessingML (Word 2003+) iconConvert to Office Open XML (.docx, Word 2007+) iconConvert to OpenDocument (.odt, OpenOffice/LibreOffice 2+) icon

What is XMLmind Word To XML?

Microsoft® Word is an amazingly popular writing tool. However its main drawback is that, once your document is complete, you cannot do much with it: print it, convert it to PDF or send it as is by email.

XMLmind Word To XML aims no less than to suppress Microsoft® Word main drawback. This 100% Java™ software component allows to automate the publishing —in its widest sense— of contents created using Microsoft® Word 2007+.

More precisely, XMLmind Word To XML (w2x for short) allows to automatically convert DOCX files to:

item Clean, styled, valid HTML looking very much like the source DOCX document.
More precisely, the generated file contains clean, styled, valid XHTML+CSS, parsed as HTML by web browsers. Because the generated XHTML+CSS file is clean and valid, you can easily restyle it, extract metadata or an abstract from it before publishing it.
Examples: manual.docx converted to manual.html (single page styled HTML), frameset/manual.html (multi-page styled HTML), webhelp/manual.html (Web Help), manual.epub (EPUB).
item Unstyled, valid DITA map+topics, DITA topic, DocBook, XHTML or XML conforming to your custom schema.
In this case, most MS-Word styles are converted to semantic tags. For example, numbered paragraphs are converted to proper ordered lists.
Generating semantic XML out of DOCX files is useful for interchange reasons (e.g. implement open data) or because you want to port your existing documentation to a structured document format where form and content are completely separated (e.g. implement single source publishing).
Examples: manual.docx converted to manual_topic.dita (DITA topic), manual_dbk5.xml (DocBook 5), manual_5.xhtml (XHTML 5).
XMLmind Word To XML at a glance

w2x at a glance

Of course, deploying w2x does not require installing MS-Word on the machines hosting the software. Also note that w2x does not require the authors to change their habits while using MS-Word: no strict writing discipline, no specific styles, no specific document templates, no specific macros, etc.

XMLmind Word To XML is available as a desktop application, as a command-line utility, as a Java™ library and as a web application.

You'll like this Try free online DOCX conversion services: DOCX to Styled (X)HTML, Convert DOCX to Multi-page (X)HTML, Convert DOCX to Web Help, Convert DOCX to EPUB, DOCX to DITA map and topics, DOCX to DocBook V4 or V5, DOCX to unstyled, valid, “semantic” XHTML 1.0, 1.1 or 5.0.

© 2003-2017 Pixware SARL. Updated on 2017/4/12.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Acrobat and PostScript are trademarks of Adobe Systems Incorporated.