Is it possible to get the raw HTML from a Session or Node? #17

arne-cl · 2013-03-29T10:22:21Z

Dear Niklas,

I am trying to parse parts of a weirdly formatted website, where .at_xpath() and .at_css() don't help much. Is it somehow possible to retrieve the raw HTML that a Node or Session instance represent?

Kind regards,
Arne

niklasb · 2013-03-29T15:12:00Z

Hello Arne,

if I am not mistaken, you should be able to use session.body() to get the HTML and session.document() to get a parsed version of the document (using the lxml library).

Greetings,
Niklas

niklasb closed this as completed Apr 2, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to get the raw HTML from a Session or Node? #17

Is it possible to get the raw HTML from a Session or Node? #17

arne-cl commented Mar 29, 2013

niklasb commented Mar 29, 2013

Is it possible to get the raw HTML from a Session or Node? #17

Is it possible to get the raw HTML from a Session or Node? #17

Comments

arne-cl commented Mar 29, 2013

niklasb commented Mar 29, 2013