Short Description: Pptx files are zipped collections of xml files. This program is a simple GUI app for corrupt pptx text recovery. It uses the zip corruption hardy 7zip as the pptx unzipper and uses doesn't respect xml well formedness rules, unlike PowerPoint.
Long Description 1: Pptx files are in reality, zipped collections of xml files. This present program is a simple GUI app for corrupt pptx text recovery. It uses the zip corruption hardy 7zip as unzipper. Additionally the regular expressions used for text extraction from the constituent xml slide files doesn't depend on strict XML well formedness rules which PowerPoint seems to oddly use for reading XML files even during data recovery.
Long Description 2: The biggest cause of corruption of PPTX corruption appears to be zip problems. This GUI uses a somewhat corruption immune unzipper, 7zip. 7zip sometimes succeeds in extracting the slide xml files that contain the text from corrupt pptx files where PowerPoint 2007 - 2013 fail with their built in unzipper.
Furthermore Corrupt PPTX Salvager uses regular expressions to extract the text from these slide XML files rather than getting hung up on correct XML structure as PowerPoint seems to do during recovery attempts.
A recent improvement is adding a zip repair pretreatment using InfoZip's zip.exe -FF command. ANother improvement is an 'Alternatives menu' with additional ppt and pptx resources.
Corrupt PPTX Salvager is based on PPTX to Text converter by Sopan Shewale. His project is hosted on Sourceforge. Sopan's project is further based on Sandeep Kumar's docx2txt which is also found on Sourceforge.
This program was formerly known as PPTX Recovery and Corrupt PPTX2TXT.
|