pdf_to_excel - this is a convenient way to transfer data from a Pdf file to an Excel table in a couple of minutes, completely free of charge.
PdfToExcel uses the best tools on the market to convert Pdf files. You can read about it here.
API documentation - https://developer.adobe.com/document-services
-
Your first step is to download this repository and build it using Visual Studio.
-
After installation, you need to go to the official Adobe developers website and create an account.
-
Next you need to go to the console tab -> projects -> create a new project.
-
Add an API to your project and configure it (pay attention to the photo).
-
Rebuild the application by pressing Ctrl+Alt+F7 and go to the path
\bin\Debug\net6.0-windowsand transfer the private.key installed in advance there. -
In the same directory create a
file.jsonfile like this:
{
"client_credentials": {
"client_id": "your client_id from Adobe project",
"client_secret": "your client_secret from Adobe project"
},
"service_account_credentials": {
"organization_id": "your organization_id from Adobe project",
"account_id": "your account_id from Adobe project",
"private_key_file": "\\private.key"
}
}Supply your data from Adobe project
- Finally, save the changes and run the application)
To use pdf_to_excel you need to enter the path to the pdf file and come up with a name to save the output in excel. If you do not specify an output file name, it will be saved as output.xlsx.
The output is saved to the desktop.
Adobe API allows you to use your product for free about 500 times per project. After this period expires you will have to fetch new data and replace the file.json & private.key files.
It is also necessary to remember that to create an output file, the name must be unique and contain the .xlsx suffix, otherwise an error will be displayed.
It is also necessary to specify the full path to the pdf file: C:\project\pdf\1.pdf, no need to use any separating characters.
In any case, in case of errors you will be notified.

