Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to collect/download such large scale anime videos ? #62

Closed
zhfkt opened this issue Oct 24, 2020 · 3 comments
Closed

How to collect/download such large scale anime videos ? #62

zhfkt opened this issue Oct 24, 2020 · 3 comments

Comments

@zhfkt
Copy link

zhfkt commented Oct 24, 2020

Hi Soruly,

First, thank you for developing the trace.moe.

I would like to create an anime videos repository in my project. So I am interested in the method or open source library to download large scale anime videos according to the anime list (mal? anidb?) . Could you have a chance to share the experience on how to collect/download such large scale anime videos in your project trace.moe ? Did you manually download the anime videos and organize the video files ? Did you use some open source automation tools/script to download the videos in regular ? I checked your project anilist-crawler but which seems only crawl the metadata of anime. I also checked your 2019 slides but it didn't mention as well.

Thank you !

@zhfkt zhfkt changed the title How to collect such large scale anime videos ? How to collect/download such large scale anime videos ? Oct 24, 2020
@soruly
Copy link
Owner

soruly commented Oct 25, 2020

A bit sensitive to talk about tools that can download videos on GitHub. You know youtube-dl has just been taken down by GitHub. I'll answer your questions as much as I can.
You can take a look at how to download stuff using RSS. Such feature is usually built-in or achievable via plug-ins, so I don't have to program anything for that. The only thing I need is to manually curate a list of regex keywords that maps to anilist IDs every season. Matching entries would be stored in different folders according to the anilist ID defined in the list. And then the file system watcher in sola can do the rest.

@zhfkt
Copy link
Author

zhfkt commented Oct 26, 2020

@soruly Thank you for replying.

I searched the keyword "RSS" and found some RSS anime feeds website such as "shana project". It seems that there are serveral websites providing the RSS feed for each season. I think it could be a solution for collecting the animes in recent seasons.

However, I found there is a limitation for the RSS feed that it could not provide the "historic information" - https://stackoverflow.com/questions/576552/how-do-i-fetch-all-old-items-on-an-rss-feed . So currently it seems that we only could retreive some recent animes instead of all animes (some animes may be from 10 - 20 years ago). Not sure whether I understand it correctly.

@soruly
Copy link
Owner

soruly commented Oct 26, 2020

Ya that's true. Finding old anime is very hard now.

@soruly soruly closed this as completed Oct 26, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants