Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to scrape metadata from internal comicinfo.xml files? #705

Closed
RedRain77 opened this issue Oct 24, 2022 · 4 comments · Fixed by #761
Closed

Is it possible to scrape metadata from internal comicinfo.xml files? #705

RedRain77 opened this issue Oct 24, 2022 · 4 comments · Fixed by #761

Comments

@RedRain77
Copy link

ComicInfo.xml seems to be the most common metadata format among the comic community, originating from the ComicRack software but now widely supported by the majority of the apps of this type
(Komga,Kavita,Codex.Ubooqity,Comictagger).

I figured it would be an option in the Hdoujin plugin as Hdoujin also supports downloading metadata into ComicInfo.xml.

Is this a feature that already exists in some third party pm plugin?

I tried messing around with one myself but have zero Perl knowledge and seemed to fail when trying to get a script to use XML::LibXML.

Any information would be greatly appreciated :)

@Difegue
Copy link
Owner

Difegue commented Oct 24, 2022

👋 There's no plugin to import ComicInfo files atm, but it's certainly within the realm of possibility!

I'd recommend you use Mojo::DOM for this; While it's technically an HTML parser there's nothing stopping you from using it for regular xml.
It's also already used within the codebase, so you shouldn't have to do anything extra to get it to work. 👍

@Putarku
Copy link

Putarku commented Jan 30, 2023

Is it possible to scrape metadata from zip files comments?
Calibre's Embedded Comic Metadata plug-in can write metadata to comicinfo.xml files and zip files comments.
It would be much easier to read metadata from these two aspects.
just like
image

@Difegue Difegue added this to the Volunteers welcome milestone Jan 30, 2023
@Difegue
Copy link
Owner

Difegue commented Jan 30, 2023

Sure, just needs someone to write it 😅

@Difegue Difegue linked a pull request Feb 21, 2023 that will close this issue
@Difegue
Copy link
Owner

Difegue commented Feb 26, 2023

ComicInfo support is now available as a built-in plugin! 👍 This only does embedded comicinfo.xml files, but that's like 98% of the usecase.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants