Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

--download-archive processing order. #22497

Closed
clem16 opened this issue Sep 24, 2019 · 1 comment
Closed

--download-archive processing order. #22497

clem16 opened this issue Sep 24, 2019 · 1 comment

Comments

@clem16
Copy link

@clem16 clem16 commented Sep 24, 2019

Checklist

  • I'm reporting a feature request
  • I've verified that I'm running youtube-dl version 2019.09.12.1
  • I've searched the bugtracker for similar feature requests including closed ones

Description

Ok. I will try to be as concise in describing this as possible. Hopefully it will be understood.

  • When I download a playlist it seems to downloads every page for every video in the playlist. - This is quite slow and inefficient in my opinion. If there is a valid programming reason why this is needed, I get it. Not criticizing anyone, just asking / suggesting an alternative way of processing. That could potentially speed up the process, and save bandwidth.

Anyways.

  • I have enabled a --download-archive option which writes a list of every video id to a file.

  • I would like to see the program check and gather all video ids in a playlist and then match that with a list stored locally and only try to download the missing videos. This should happen before each page is downloaded and checked.

  • This would skip the "Downloading pc webpage" message for each and every id in the playlist.

  • There would be a huge speedup in processing large playlists.

  • This would lighten the load on target servers, some of which I'm sure are already overloaded.

  • This would let me add a crontab with more peace of mind, knowing I'm not programmatically hammering the target server on a schedule. I don't run a cron now because of this. I feel doing so is irresponsible and hard on server resources. If it was more efficient I would just be pulling down new videos on a playlist without hammering the server as hard. So this would be good for everyone.

Basically...

  • Grab all video ids in playlist, get list of video ids in archive file. Check each id for a match in the archive, if there is no match its a new video and download can begin.

If anyone has any questions or clarifications to what I mean. Just ask and I'll try to clarify my thoughts on what I'd like to see. I'm not familiar enough with the programming / algorithms in this project, or I would fork and try to implement this myself.

Thanks for your time.

@clem16 clem16 added the request label Sep 24, 2019
@dstftw
Copy link
Collaborator

@dstftw dstftw commented Sep 24, 2019

This depends on concrete extractor and may not be technically possible if video id is not available from playlist page.

@dstftw dstftw closed this Sep 24, 2019
@dstftw dstftw added the incomplete label Sep 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants
You can’t perform that action at this time.