Skip to content

digitization: proposal for refactor PDF check script#28

Merged
PascalEgn merged 5 commits intocern-sis:mainfrom
namollayo:22-check-pdf-script
Apr 15, 2026
Merged

digitization: proposal for refactor PDF check script#28
PascalEgn merged 5 commits intocern-sis:mainfrom
namollayo:22-check-pdf-script

Conversation

@namollayo
Copy link
Copy Markdown
Contributor

Ref: #22

Copy link
Copy Markdown
Collaborator

@PascalEgn PascalEgn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lets also add the functionallity to read a direcotry from CERNBox in order to check the Boite File numbers and then match them with the ones on S3, so we only check for the pdfs of the matched ones on S3

Comment thread refactory/storage_connection.py Outdated
Comment thread .gitignore Outdated
Comment thread refactory/main.py Outdated
Comment thread refactory/main.py
Comment thread refactory/main.py Outdated
Comment thread refactory/main.py Outdated
Comment thread refactory/main.py Outdated
@namollayo namollayo force-pushed the 22-check-pdf-script branch from e01efd3 to 80d7da2 Compare April 1, 2026 09:55
Copy link
Copy Markdown
Collaborator

@PascalEgn PascalEgn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just some small things, otherwise looks really nice!

Comment thread requirements.txt
Comment thread refactory/main.py Outdated
Comment thread refactory/storage_connection.py Outdated
Comment thread refactory/test_connections.py Outdated
Comment thread refactory/test_connections.py Outdated
Comment thread refactory/main.py
@PascalEgn PascalEgn merged commit 5c45442 into cern-sis:main Apr 15, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants