Short Description: Savvy Repair for Microsoft Office, tries four methods for repair or recovery from corruption of Word DOCX, Excel XLSX and PowerPoint PPTX files. Each method starts with the repair of the zip structure.
Long Description 1: This program tries four methods of repair of corrupt of DOCX, XLSX and PPTX files. DOCX, XLSX and PPTX files are collections of mostly XML sub-files. All four methods first try to repair the zip structure. The first 3 methods involve attempts to repair all the xml sub-files. The first method with strict xml validation, the 2nd with lax, the 3rd adding missing subfiles. The last method is a data/text extraction one. No formatting is recovered.
Long Description 2: Savvy Repair for Microsoft Office tries four methods of repair of corrupt of Word DOCX, Excel XLSX and PowerPoint PPTX files. DOCX, XLSX and PPTX files are collections of mostly XML sub-files. All four methods first try to repair the zip structure. The first 3 methods involve attempts to repair all the xml sub-files. The repair is done by finding the first XML error, truncating the file there and then using xmllint to add the right tags to the end of the now shorter file.
The first method is done with strict xml validation, the 2nd with lax, the 3rd adding missing subfiles from a blank file of the appropriate extension. The last method is a data/text extraction one using SilverCoder's DocxToText. No formatting is recovered.
|