-
-
Notifications
You must be signed in to change notification settings - Fork 240
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retrieve metadata from online sources #11
Comments
I use https://leagueofcomicgeeks.com/ for series information and also for tracking comics (like trakt for comics) but unfortunately, no API provided 😞 |
Indeed, the ajax methods return It also doesn't have the completeness information about a series (whether a series is ongoing, finished, abandoned, or in hiatus), which is one of the most important metadata i am looking after! |
Oh, Just read the last Checkbox, You can disregaurd the rest of this then... This is some Pie-in-The sky stuff, brace yourself.... As long as you give us a template and location to write the data. That would let the user leverage any data-scraping (or elbow-grease copy-pasting) to write metadata not covered by direct functionality (and hopefully mildly future-proofing the feature). Another example of a pre-set metadata format would be ComicRack's comicinfo.xml file within .cbz. I am only suggesting this one because "I" Use it alongside a pickier .json (both provided by Hdoujin Down-loader) Just throwing out some ideas, no pressure. If I had any idea how making a plugin woks and coding plugins, I'd have made a plugin already (probably leveraging HPX, my other database I'm running alongside this) |
Hi all, I would be interested to know a bit more about the metadata you are after, and how you are using it, so i get a better idea of what to implement and how to implement it. Could you give some insights about:
I'll throw in some that are of particular interest for me:
|
For me, if we follow the plex analogy, given a filename/folder hierarchy komga should be able to retrieve automatically series informations and book informations in series. Ideally komga should provide everything a client would need to satisfy at least basic needs. Considering tracking (like trakt for videos or anilist and kitsu for manga/anime), I'm not sure if it is the responsibility of komga or of the client. Actually I use yacreader to manage comics library and reading -> but no information about books or series, no metadata, no tracking So if I could use komga on desktop AND mobile (with the tachiyomi plugin), for comics AND manga, that would be nice. |
Thanks for the detailed answer! For the thumbnail, at the moment Komga generates one from the first page of each book. For a series, it's the thumbnail of the first book in the series. Do you think there would be a need to have a thumbnail coming from external sources ?
I will add tracking (read status) in Komga at some point. It might be manual to start with, because the implementation in the clients is not in my hands. For example in Tachiyomi it is not managed as an extension, but in the main app. There is also some questions on how to track, should the tracking be done on the matched series/books, so with recognized IDs like ComicVine, or using the internal Komga ID (but those can change if you move your files on disk for example). |
No the thumbnails seem fine. |
I'm not sure where you're currently pulling information from, but there are two fairly common sources to check for, which I would suggest honoring before trying to pull from ComicVine (which would require each installation to get a CV API key, and should be throttled so you don't try to pull metadata for over 200 books in an hour): ComicInfo.xml may exist inside an archive, especially if Mylar or ComicRack have been involved in the process of curating the books. Additionally, ComicBookLover tags may exist in the zipfile comments (for CBZ only) |
Mylar plus a viewer such as Ubooquity or one of the ComicStreamer forks is a common usage scenario for many of us, so our collections come with metadata embedded in the file. I'd love to replace Ubooquity with Komga as Komga is open source and has a seemingly much friendlier developer. 😉 I've got both running side by side for now, and I'm excited to see Komga get even better. |
Yeah, if we get some control over how the OPDS is presented to the client (like with custom filters, or reading lists, etc) then this will be the perfect complement to Mylar. :) |
Opds is quite flexible, and I plan to add reading lists to it later 😊 |
Personally, I would really like to see ComicInfo.xml support. I prefer having metadata embedded in the files. |
Planned in #54 |
Excellent, most importantly, I would really like to have some staging going on. What I mean is: ComicInfo.xml should always override the online source and maybe the finding of the scanned source should be written into ComicInfo.xml (if there is none) into the folder/zip/etc... Why? Because it would be really important to be able to move that meta data out of Komga easily. This is one of my biggest gripes with Plex. That way should a source ever return false information your earlier scrapes are always safe. I usually like to manually verify the metadata, knowing that the information that is shown will not be altered unless I explicitly force override would be nice. With Plex I'm always a bit skeptical. Another feat: even if I have to rebuild the entire library with a new database, the previous scrapes' metadata will be transferred. BONUS: another key metadata source is doujinshi.org for all the doujinshi collectors out there. :) (this would also mean another media "kind". Doujinshi typically aren't published by companies, but "circles" and that's pretty much an important nomenclature. Also very important: usually doujinshis are released at conventions like Comiket for example (most famous example) and not only is there a certain numbering system derived from that (C50 for example would be an identifier usually found in the beginning of a doujinshi's digital file name) but also the importance to have a field for which convention it was released at. If you look at doujinshi.org at sample entries (NSFW warning, it's very mixed content there and there is definitely no setting to view the site in SFW mode :D) you'll find which metadata is important to include. |
Just wanted to put forward a "better" metadata source for manga, MangaUpdates. It has more up to date and detatiled information about manga than Kitsu or MyAnimeList. |
No db "has it all". (I realize you didn't try to imply this) |
I use Mylar for my comic post-processing, but it uses a modified ComicTagger script internally. I like CT a lot on it's own because it allows more flexibility in data, I'd wonder if you could incorporate that tool into Komga somehow and give us some customization on metadata field mapping. Have a basic default, but let us muck about if we wanted to. I'd love to see the ability to grab story arc data, including upcoming and previous storyarcs that are pulled from ComicVine and it collects all issues into one - or a collection based on character appearances, etc. CT seems to fill in all of those details (I don't know if Mylar does). |
It would be great to add a source "doujinshi.org" for " Manga/Doujinshi" |
Indeed! Two more really good sources are nhentai and exhentai. For reference for those two one can look at the HappyPandaX project, specificall in the plugins repo: https://github.com/happypandax/plugins There's also a plugin that reads metadata files created by two very popular downloaders. ("File Metadata" plugin) Overall a pretty nifty project for any doujinshi lover and I've been using it for a few weeks now. Right now I'm having a few issues with importing, but I think I borked something. I'll sit down for that issue later. So far my strategy is to use both Komga and HPX, but to have feature overlap would be terrific, there's a lot each project can learn from the other. I'm very happy both exist! <3 |
Hi! Could you add the site manga-news as a source of metadata in French about mangas ? It's really complete and has a lot of information about manga sold in French. Unfortunately, the site does not seem to have an API. Thank you. |
Read metadata for comics with an ISBN from goolge books.
Google Books API: |
Hi, could you add anilist to the list for manga metadata please, in practice anilist has been more complete in manga and mahwa than myanimelist, from oneshots to independent artists. Here is the documentation and the api of the site. Thanks for your attention. |
Would this feature in embed the data into the cbz like comic tagger? Or just exist only in komga? (or an option for either) |
My wish is to add it to Komga only. |
Personally id like the opposite, and would like to embed if possible, especially when his new comic metadata standard is ready. But both should be options. |
Easiest approach from a developer stand point would probably be to start with it being added to Komga only first as the first phase, second phase would then probably be to add an export function, that would populate formats like ComicInfo.xml into the book files, third would be to add the ability to automatically export to book files when metadata is changed. |
Sounds about right, normally I would prefer just komga data, but I feel like books are a format that should be embedded, much like how music should be. |
Already requested here: #82 |
Hi. As Komga is getting better and better with each update, with a nice metadata feature, I'm curious if this feature is still being considered (I hope 🤞 ). Thank you for this marvelous software |
Hello, In any case, the work you are doing seems incredible to me. Thanks a lot!!! |
As there is no integration in Komga for an Anime Metadata provider, I made my own using the metadata from Anisearch. |
Great work! If there's the same sort of metadata for comics and BD (french/belgium comics), i would love this! |
Komga is in Kotlin 😉 The metadata retrieval is much more than hitting an api and mapping fields though. That bit is probably only 10% of what I envision for metadata retrieval. |
Thanks for your reply gotson :) Can you share with us the main components or behavior you envision for metadata retrieval? |
Hi @gotson. Happy new year !! :) |
Bangumi metadata scraper for Komga👉https://github.com/chu-shen/BangumiKomga Inspired by https://github.com/Pfuenzle/AnisearchKomga Thanks❤️ |
In the meantime, for our french friends who wish to refresh their BD metadata from Bedetheque, here is a small metadata scrapper i've written 👉 https://github.com/Inervo/BedethequeKomga Inspired from chu-shen/BangumiKomga and aubustou/bedetheque_scraper. Thanks a lot ❤️ NB: it's been ages since i've written some code, so it's far from perfect. Don't hesitate to raise any issue or to contribute :) |
Don't let "perfect" be the enemy of "good". ;) I'm sure if you start something others will contribute. |
I consider this the most crucial feature still missing in Komga. Besides the obvious benefits of metadata scraping this would also greatly assist in tracking the current status of a series. |
This comment was marked as off-topic.
This comment was marked as off-topic.
Having this feature built-in would be very nice. I currently use the Komf server and userscript to give me the possibility to identify and import metadata for a series. It can also auto-identify an entire library, but I am much too scared to use that functionality, so I prefer the manual single-identify personally :) Edit: It would be nice if this feature also has the possibility to merge info from multiple sources when importing metadata. No metadata provider has all wanted information when it comes to manga. While a preferred provider has the best descriptions, it may not present tags for that series, while other providers do. So merging in information like tags and authors/artists from other providers is really helpful. This is something Komf supports right now, and is something Komga would need to be able to do if it wants to shine in this area too :) |
This issue was moved to a discussion.
You can continue the conversation there. Go to discussion →
Mimicking Plex, Komga could manage metadata for series and books, and retrieve metadata from online providers.
See also #48 for manual metadata edition.
Potential providers:
In addition Komga should be able to:
The text was updated successfully, but these errors were encountered: