Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Internationalized pages being scraped with wrong locale. #207

Open
kulikalov opened this issue Aug 19, 2016 · 2 comments
Open

Internationalized pages being scraped with wrong locale. #207

kulikalov opened this issue Aug 19, 2016 · 2 comments

Comments

@kulikalov
Copy link

Subject of the issue

E.g. i want to scrape a book page from play market. Google market is available in multiple languages without url-specific markers. So, how can i scrape the specific version with language i need instead of random one?

Your environment

"x-ray": "^2.2.0"
node --version: 5.9.0
npm --version: 3.7.3

Steps to reproduce

Try to scrape any page from play market. This one for example:
https://play.google.com/store/books/details/Walter_Isaacson_Steve_Jobs?id=I6R8MXStPXgC

Expected behaviour

I need to be able to specify locale i want to get

Actual behaviour

It scrapes random language (ukranian, spanish, portuges - all this while i'm running x-ray from the same VPS)

@kulikalov
Copy link
Author

Any guesses how to fix that?

@kulikalov
Copy link
Author

The solution is to allow users to specify headers props for requests. If it would be possible to pass headers from x-ray to > x-ray-crawler > superagent, then it would solve the problem

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant