Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LAZR json decoding alexanderstreet extraction #7321

Open
kevin-vilbig opened this issue Oct 30, 2015 · 1 comment
Open

LAZR json decoding alexanderstreet extraction #7321

kevin-vilbig opened this issue Oct 30, 2015 · 1 comment

Comments

@kevin-vilbig
Copy link

@kevin-vilbig kevin-vilbig commented Oct 30, 2015

So... My university got a subscription to this thing and I've been playing with it to see if youtube-dl could dump it. I found some information that suggests that it might be using this nonstandard method to serialize json data or something? Anyway... I'll play with this more when I have some free (yeah right) time, but I thought y'all should know about these things anyway. I'll probably actually have the time to play with it in December. One thing, the content is "too long", and it throws a content too short exeception. The content that it downloads is the just the page again ( I diffed the outputs), which suggests that their player requires some data that needs to be scraped out and passed into the request. Idk what that is yet.

http://bazaar.launchpad.net/~lazr-developers/lazr.json/trunk/view/head:/README.txt

E:>youtube-dl.exe -v http://search.alexanderstreet.com/view/work/2725242
[debug] System config: []
[debug] User config: []
[debug] Command-line args: [u'-v', u'http://search.alexanderstreet.com/view/work
/2725242']
[debug] Encodings: locale cp1252, fs mbcs, out cp437, pref cp1252
[debug] youtube-dl version 2015.10.06.2
[debug] Python version 2.7.8 - Windows-7-6.1.7601-SP1
[debug] exe versions: none
[debug] Proxy map: {}
[generic] 2725242: Requesting header
WARNING: Falling back on generic information extractor.
[generic] 2725242: Downloading webpage
[generic] 2725242: Extracting information
[download] Downloading playlist: None
[generic] playlist None: Collected 2 video ids (downloading 2 of them)
[download] Downloading video 1 of 2
[debug] Invoking downloader on u'http://search.alexanderstreet.com/view/work/\\/
/alexstreet.vo.llnwd.net/o25/muco/1006835xxx/1006835573/1006835573-d
isc001-file001-400kbps-400pixels.m4v?e=1446197306\u0026h=749e220b03ae69f6bc6d43
3f9287e47f'
[download] Destination: A Hard Day's Night _ Alexander Street (1)-u0026h=749e220
b03ae69f6bc6d433f9287e47f.m4v
[download] 348.1% of 8.93KiB at Unknown speed ETA Unknown ETAERROR: content too
short (expected 9142 bytes and served 31824)
Traceback (most recent call last):
File "youtube_dl\YoutubeDL.pyo", line 1597, in process_info
File "youtube_dl\YoutubeDL.pyo", line 1539, in dl
File "youtube_dl\downloader\common.pyo", line 342, in download
File "youtube_dl\downloader\http.pyo", line 238, in real_download
ContentTooShortError

E:>youtube-dl.exe -j http://search.alexanderstreet.com/view/work/2725242
WARNING: Falling back on generic information extractor.
{"display_id": "u0026h=749e220b03ae69f6bc6d433f9287e47f", "extractor": "generic"
, "format": "0 - unknown", "requested_subtitles": null, "uploader": "search.alex
anderstreet.com", "format_id": "0", "playlist_index": 1, "playlist_title": null,
"playlist": null, "http_headers": {"Accept-Language": "en-us,en;q=0.5", "Accept
-Encoding": "gzip, deflate", "Accept": "text/html,application/xhtml+xml,applicat
ion/xml;q=0.9,/;q=0.8", "User-Agent": "Mozilla/5.0 (X11; Linux x86_64; rv:10.0
) Gecko/20150101 Firefox/20.0 (Chrome)", "Accept-Charset": "ISO-8859-1,utf-8;q=0
.7,;q=0.7", "Cookie": "SESS7beeaee852f6adcf8b7816afcd99defe=Sr4F9GJ0otaCiyLhve1
JXLbsQbGgqL5ry6S5dLzYZ20; cookied_customer_id=17"}, "url": "http://search.alexan
derstreet.com/view/work///alexstreet.vo.llnwd.net/o25/muco/1006835xxx
/1006835573/1006835573-disc001-file001-400kbps-400pixels.m4v?e=1446197306\u0
026h=749e220b03ae69f6bc6d433f9287e47f", "extractor_key": "Generic", "title": "A
Hard Day's Night | Alexander Street (1)", "id": "u0026h=749e220b03ae69f6bc6d433f
9287e47f", "playlist_id": null, "ext": "m4v", "webpage_url_basename": "2725242",
"webpage_url": "http://search.alexanderstreet.com/view/work/2725242", "filenam
e": "A Hard Day's Night _ Alexander Street (1)-u0026h=749e220b03ae69f6bc6d433f92
87e47f.m4v", "fulltitle": "A Hard Day's Night | Alexander Street (1)", "age_limi
t": 0, "n_entries": 2}
{"display_id": "u0026h=2cf5527087b04ea40775894be8b092c9", "extractor": "generic"
, "format": "0 - unknown", "requested_subtitles": null, "uploader": "search.alex
anderstreet.com", "format_id": "0", "playlist_index": 2, "playlist_title": null,
"playlist": null, "http_headers": {"Accept-Language": "en-us,en;q=0.5", "Accept
-Encoding": "gzip, deflate", "Accept": "text/html,application/xhtml+xml,applicat
ion/xml;q=0.9,
/
;q=0.8", "User-Agent": "Mozilla/5.0 (X11; Linux x86_64; rv:10.0
) Gecko/20150101 Firefox/20.0 (Chrome)", "Accept-Charset": "ISO-8859-1,utf-8;q=0
.7,_;q=0.7", "Cookie": "SESS7beeaee852f6adcf8b7816afcd99defe=Sr4F9GJ0otaCiyLhve1
JXLbsQbGgqL5ry6S5dLzYZ20; cookied_customer_id=17"}, "url": "http://search.alexan
derstreet.com/view/work///alexstreet.vo.llnwd.net/o25/muco/1006835xxx
/1006835573/1006835573-disc001-file001-800kbps-640pixels.m4v?e=1446197306\u0
026h=2cf5527087b04ea40775894be8b092c9", "extractor_key": "Generic", "title": "A
Hard Day's Night | Alexander Street (2)", "id": "u0026h=2cf5527087b04ea40775894b
e8b092c9", "playlist_id": null, "ext": "m4v", "webpage_url_basename": "2725242",
"webpage_url": "http://search.alexanderstreet.com/view/work/2725242", "_filenam
e": "A Hard Day's Night _ Alexander Street (2)-u0026h=2cf5527087b04ea40775894be8
b092c9.m4v", "fulltitle": "A Hard Day's Night | Alexander Street (2)", "age_limi
t": 0, "n_entries": 2}

E:>

@xthursdayx
Copy link

@xthursdayx xthursdayx commented Mar 16, 2020

@kevin-vilbig Did you ever figure this out? I'm trying to sort out the same thing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants
You can’t perform that action at this time.