Skip to content

ExtractorHTML when a/@href links include the attribute data-remote="true...#80

Merged
vonrosen merged 1 commit intointernetarchive:masterfrom
nlevitt:ari-3617-data-remote
Sep 4, 2014
Merged

ExtractorHTML when a/@href links include the attribute data-remote="true...#80
vonrosen merged 1 commit intointernetarchive:masterfrom
nlevitt:ari-3617-data-remote

Conversation

@nlevitt
Copy link
Copy Markdown
Contributor

@nlevitt nlevitt commented Jul 19, 2014

...", include that info the viaContext; and in FetchHTTPRequest, when a CrawlURI has data-remote="true", add the http header "X-Requested-With: XMLHttpRequest"; data-remote is a jquery thing that means the url can be requested via ajax in a browser... some urls work wrong without the X-Requested-With header... this is not a great place for a non-generic thing like this, but where else could it go...

don't merge yet

…rue", include that info the viaContext; and in FetchHTTPRequest, when a CrawlURI has data-remote="true", add the http header "X-Requested-With: XMLHttpRequest"; data-remote is a jquery thing that means the url can be requested via ajax in a browser... some urls work wrong without the X-Requested-With header... this is not a great place for a non-generic thing like this, but where else could it go...
vonrosen pushed a commit that referenced this pull request Sep 4, 2014
ExtractorHTML when a/@href links include the attribute data-remote="true...
@vonrosen vonrosen merged commit 035a7b0 into internetarchive:master Sep 4, 2014
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants