Skip to content
This repository has been archived by the owner on Dec 9, 2018. It is now read-only.

Is it possible to get the raw HTML from a Session or Node? #17

Closed
arne-cl opened this issue Mar 29, 2013 · 1 comment
Closed

Is it possible to get the raw HTML from a Session or Node? #17

arne-cl opened this issue Mar 29, 2013 · 1 comment

Comments

@arne-cl
Copy link

arne-cl commented Mar 29, 2013

Dear Niklas,

I am trying to parse parts of a weirdly formatted website, where .at_xpath() and .at_css() don't help much. Is it somehow possible to retrieve the raw HTML that a Node or Session instance represent?

Kind regards,
Arne

@niklasb
Copy link
Owner

niklasb commented Mar 29, 2013

Hello Arne,

if I am not mistaken, you should be able to use session.body() to get the HTML and session.document() to get a parsed version of the document (using the lxml library).

Greetings,
Niklas

@niklasb niklasb closed this as completed Apr 2, 2013
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants