Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
[cbc.ca] Video ID issues when using Archive tag #21761
Comments
|
Any updates on this? |
Checklist
Description
WRITE DESCRIPTION HERE
I have been having issues with the https://watch.cbc.ca (or the recently updated https://gem.cbc.ca) site whenever the
--download-archivetag is used.It appears that what is happening is that when the program is sent either a URL for a season playlist or a batch file containing the URLs for multiple episodes in a season and the
--download-archivetag is also used, then only half of the episodes get downloaded, with every other one being skipped. Using the file output formatting-o "%(series)s - %(season_number)sx%(episode_number)02d - %(title)s [%(id)s].%(ext)s"it seems to show that the Video IDs are somehow getting shifted and mixed up at some point. Therefore, if a file archive is being used (and checked against) then every other file is being skipped because its ID is already in the database.This is what I believe is currently going on:
As a result, you can see that if you start with the first episode, then only the odd numbered episodes get downloaded. (Similar results for the even numbers if your first episode is evenly numbered.) This problem only occurs though when you are using an archive to avoid downloading the same file multiple times.
Regarding Video IDs, I have noticed that what is being saved into the finished file corresponds to the ID listed for the next episode's URL. Is there something unique with this site that the IDs are somehow getting shifted in such a way that it is causing this behaviour?
Here is an example using the following URLs for reference:
Season 1 Playlist: https://gem.cbc.ca/season/back-in-time-for-dinner-uk/season-1/1a9bb35b-f429-4b0a-9647-53ef460a3f1c
Episode 1: https://gem.cbc.ca/media/back-in-time-for-dinner-uk/season-1/episode-1/38e815a-010e6f4b76c
Episode 2: https://gem.cbc.ca/media/back-in-time-for-dinner-uk/season-1/episode-2/38e815a-010e6ecc82d
Episode 3: https://gem.cbc.ca/media/back-in-time-for-dinner-uk/season-1/episode-3/38e815a-010e70354a2
Episode 4: https://gem.cbc.ca/media/back-in-time-for-dinner-uk/season-1/episode-4/38e815a-010e749392c
Episode 1's ID should be
38e815a-010e6f4b76cEpisode 2's ID should be
38e815a-010e6ecc82dHowever, episode 1 gets saved as "Back in Time for Dinner (UK) - 1x01 - 1950s [38e815a-010e6ecc82d].mp4" (using the previously mentioned output format), and
38e815a-010e6ecc82dgets added to the archive after episode 1 is finished downloading rather than38e815a-010e6f4b76c. As a result, when episode 2 starts, its ID is already in the archive and gets skipped, with the program moving on to episode 3.Any assistance in figuring out what is going on so we can confidently continue to use an archive file with a batch file or playlist URL (without having to go back and edit out half of the IDs) would be appreciated. Thank you.
So... any thoughts?