Parsing Office Open XML documents

With the release of the Office 2007 suite, Microsoft finally started using an open format for Word documents named Office Open XML (sometimes referred to as OOXML). Basically, a .docx file is just a ZIP file in disguise with the following basic contents: [Content_Types].xml /_rels   .rels /docProps   app.xml   core.xml /word   document.xml   fontTable.xml   settings.xml   styles.xml   webSettings.xml […]