Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
--download-archive processing order. #22497
Comments
|
This depends on concrete extractor and may not be technically possible if video id is not available from playlist page. |
Checklist
Description
Ok. I will try to be as concise in describing this as possible. Hopefully it will be understood.
Anyways.
I have enabled a --download-archive option which writes a list of every video id to a file.
I would like to see the program check and gather all video ids in a playlist and then match that with a list stored locally and only try to download the missing videos. This should happen before each page is downloaded and checked.
This would skip the "Downloading pc webpage" message for each and every id in the playlist.
There would be a huge speedup in processing large playlists.
This would lighten the load on target servers, some of which I'm sure are already overloaded.
This would let me add a crontab with more peace of mind, knowing I'm not programmatically hammering the target server on a schedule. I don't run a cron now because of this. I feel doing so is irresponsible and hard on server resources. If it was more efficient I would just be pulling down new videos on a playlist without hammering the server as hard. So this would be good for everyone.
Basically...
If anyone has any questions or clarifications to what I mean. Just ask and I'll try to clarify my thoughts on what I'd like to see. I'm not familiar enough with the programming / algorithms in this project, or I would fork and try to implement this myself.
Thanks for your time.