This tool converts PDF files and images into editable PowerPoint presentations (.pptx) by leveraging structured data from the MinerU PDF Extractor. It accurately reconstructs text, images, and layout, providing a high-fidelity, editable version of the original document.
The application features a user-friendly graphical interface (GUI) and is designed for easy use.
As a user, you only need the standalone executable file (e.g., MinerU2PPT.exe). You do not need to install Python or any libraries.
-
Download the Application: Get the latest executable from the project's Releases page.
-
Get the MinerU JSON File:
- Go to the MinerU PDF/Image Extractor.
- Upload your PDF or image file and let it process.
- Download the resulting JSON file. This file contains the structural information that our tool needs for the conversion.

-
Run the Converter:
- Double-click the executable to start the application.
- Select Input File: Drag and drop your PDF or image file onto the first input field, or use the "Browse..." button.
- Select JSON File: Drag and drop the JSON file you downloaded from MinerU onto the second input field.
- Output Path: The output path for your new PowerPoint file will be automatically filled in. You can change it by typing directly or using the "Save As..." button.
- Options:
- Remove Watermark: Check this box to automatically erase elements like page numbers or footers.
- Generate Debug Images: Keep this unchecked unless you are troubleshooting.
- Click Start Conversion.
-
Open Your File: Once the conversion is complete, click the "Open Output Folder" button to find your new
.pptxfile.
The application also supports converting multiple files at once in Batch Mode.
- Switch to Batch Mode: Click the "Batch Mode" button in the top-right corner of the application. The interface will switch to the batch processing view.
- Add Tasks:
- Click the "Add Task" button. A new window will pop up.
- In the popup, select the Input File, the corresponding MinerU JSON File, and specify the Output Path.
- Set the Remove Watermark option for this specific task.
- Click "OK" to add the task to the list.
- Manage Tasks: You can add multiple tasks to the list. If you need to remove a task, select it from the list and click "Delete Task".
- Start Batch Conversion: Once all your tasks are added, click "Start Batch Conversion". The application will process each task sequentially. A log will show the progress for each file.
This section provides instructions for running the application from source and packaging it for distribution.
- Clone the repository.
- It is recommended to use a virtual environment:
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
- Install the required dependencies from
requirements.txt.pip install -r requirements.txt
- To run the GUI application:
python gui.py
- To use the CLI:
python main.py --json <path_to_json> --pdf <path_to_pdf> --output <path_to_ppt> [OPTIONS]
You can package the GUI application into a single .exe file for easy distribution.
-
Install PyInstaller:
pip install pyinstaller
-
Build the Executable: Run the
pyinstallercommand from the project's root directory. Use the--nameflag to specify a professional name for your application.--windowed: Prevents a console window from appearing in the background.--onefile: Bundles everything into a single executable file.--name: Sets the name of the final executable.
pyinstaller --windowed --onefile --name MinerU2PPT gui.py
-
Find the Executable: Your standalone application,
MinerU2PPT.exe, will be located in thedistfolder.
