-
Notifications
You must be signed in to change notification settings - Fork 405
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Crawl nested html documents loaded dynamically #185
Comments
Maybe you could use the |
@ab14p |
@yujiosaka @BubuAnabelas |
@ab14p |
Closing this issue because no information is provided for a month. |
What is the current behavior?
#150
With robots.txt set to false
Crawler waits using WaitFor and a specified timeout of 10 seconds, but still not able to crawl nested documents. For example, I want to extract src of an <iframe> which is an AD (display advertisement).
Enabled screen shot option to check if the ad iframe has loaded before evaluatePage function was executed. I can see the ad in screen shot but function does not return the src from <iframe>.
What is the expected behavior?
Be able to crawl nested html documents such as ADs which are loaded dynamically by Java Script.
Can you please provide an example or solution for this..
Please tell us about your environment:
The text was updated successfully, but these errors were encountered: