Skip to content

Add utility to get PDF info for proper titles on PDF entries #168

@benoit74

Description

@benoit74

Content of PDF documents is not indexed for suggestions, while on some ZIM it is the "core" of the ZIM.

Having a utility in scraperlib to extract PDF info and get the document title would probably help.

See openzim/warc2zim#290 for one use-case.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions