Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upGitHub is where the world builds software
Millions of developers and companies build, ship, and maintain their software on GitHub — the largest and most advanced development platform in the world.
Sometimes a webpage will redirect not because it's a real redirect, but because it wants the user to login for paywall -type reasons. The New York Times does this (see redirect chain below), e.g. for https://www.nytimes.com/2017/04/05/arts/television/louis-ck-young-stephen-colbert.html?_r=0
That page has four Youtube embeds, but because
NYTimesArticleIEmatches it, it grabs it and tries to find an NYT video that's not there:The obvious workaround is
--force-generic-extractor, but that fails:And of course, the reason it fails is this redirect chain:
Commenting out the redirect following like so
Resolves this just fine:
I have no idea what the right solution is :)
It's certainly not a very pressing problem. And I guess maaaybe it has some connection to #12501 (but maybe not).