Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[openload] re-open #15118 #15166

Closed
360up opened this issue Jan 4, 2018 · 13 comments
Closed

[openload] re-open #15118 #15166

360up opened this issue Jan 4, 2018 · 13 comments

Comments

@360up
Copy link

@360up 360up commented Jan 4, 2018

Fix 620ee87 worked for a couple of days. Now broken again.

Please re-open #15118

@IDerr
Copy link

@IDerr IDerr commented Jan 4, 2018

PR incoming : #15161
seems to be related but not sure :D

@bagud
Copy link

@bagud bagud commented Jan 4, 2018

Looks like span element holding stream URL changed ID from "streamurl" to "streamurj".

@yan12125
Copy link
Collaborator

@yan12125 yan12125 commented Jan 4, 2018

Thanks @bagud. They are really boring LOL

@yan12125 yan12125 closed this in de329f6 Jan 4, 2018
@steebchen
Copy link
Contributor

@steebchen steebchen commented Jan 9, 2018

Why not simply check #video[src]? It should always work, regardless of what funny ids they think of, right?

@bagud
Copy link

@bagud bagud commented Jan 16, 2018

Aaargh. I'm working on PHP/PhantomJS implementation of openload link extractor, but phantomjs returns completely useless streamurl, like: "Ø�ヒV�QRS®ᄍ�ð´￘SX�ヒA?õルÑyl│eZ0� Ó¡'�±�￟�DÔ-". What could I be doing wrong? I understand this is not the place to ask this but any help would be really appreciated.

@yan12125
Copy link
Collaborator

@yan12125 yan12125 commented Jan 16, 2018

@bagud: Possibly phantomjs is too old. Version 2.1.1 is known to work.

@bagud
Copy link

@bagud bagud commented Jan 16, 2018

@yan12125 Thanks for replying. It's not that, I'm using the latest version 2.1.1.

It's frustrating because you can easily see the url with standard browser using developer tools, but when the page is rendered with phantomjs the url gets obfuscated?

I'm not python guy, but as much as I can see in your openload extractor code you don't use any additional decoding of the stream url, you just output it as plain text, right?

Also I'm setting only user-agent header, no additonal headers are required in request?

I'll be really grateful for your answer on this or anybody who would be kind enough to help out. :)

@Hrxn
Copy link

@Hrxn Hrxn commented Jan 16, 2018

Maybe @Tithen-Firion knows the answer?

@yan12125
Copy link
Collaborator

@yan12125 yan12125 commented Jan 16, 2018

Also I'm setting only user-agent header, no additonal headers are required in request?

A point. openload extraction is sensitive to user-agent. It might be better to avoid another user-agent than the default one.

Anyway pasting full verbose log helps debugging.

@Tithen-Firion
Copy link
Contributor

@Tithen-Firion Tithen-Firion commented Jan 16, 2018

@steebchen no because it's not in video's src until you click on specific element. Their JS code is encoded with aaencode and some obfuscator. I've got some dirty-sketch of deobfuscator but even if we use it they will change it to something else.

@bagud take a look at this. Keep in mind that video urls work only on the same IP that accessed the site.

It's the same thing over again. In past they changed how the video url is encoded. Now they change where that url is stored. Of course you can scan all elements searching for specific pattern but it's easy to add dummy ones. What we need is to simulate click event. Tried that with PhantomJS but it failed.

@steebchen
Copy link
Contributor

@steebchen steebchen commented Jan 16, 2018

@Tithen-Firion In my browser, it's directly filled in without clicking anything. And even if, maybe you could just click on the play button and wait for it to be filled in. (EDIT: yep, that's what you said, didn't even read your comment till the end)

@Tithen-Firion
Copy link
Contributor

@Tithen-Firion Tithen-Firion commented Jan 16, 2018

Tested https://openload.co/f/*/ and https://openload.co/embed/*/ in Firefox, Chrome and PhantomJS. Src not set.

As for the click part: read my previous comment till the end.

@bagud
Copy link

@bagud bagud commented Jan 17, 2018

Thank you all for your suggestions. I managed to solve my problem checking into @Tithen-Firion code.

page.onInitialized = function() { page.evaluate(function() { delete window._phantom; delete window.callPhantom; }); };
I was missing this part in my code.

I naively assumed setting up custom user agent header would be enough to hide PhantomJS browser, but it's not. You have to manually delete those two references also (which I didn't know existed), otherwise you can be still easily identified as PhantomJS and denied, what openload obviously does check.

@Tithen-Firion I have tried to simulate click event and it fires, eg. I can see element class changes, but there must be something else preventing generation of src attribute, but I assume that you already know that :). If I come up with something I'll post it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
7 participants
You can’t perform that action at this time.