Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
Browse files

more words

  • Loading branch information...
commit 6563a5d14594c0b49cd5fe99c06715bfca4d02bc 1 parent 24b68b6
@tenderlove authored
Showing with 4 additions and 0 deletions.
  1. +4 −0 nokogiri.markdown
View
4 nokogiri.markdown
@@ -116,6 +116,10 @@ will be represented in memory with a tree that looks like this:
![HTML Tree](html_tree.png)
+Any data extraction technique we will use is simply a way for traversing this
+in-memory tree. If we keep this structure in mind while trying to do data
+extraction, we can truly enter parsing nirvana!
+
## Data Extraction
### Basic XPath
Please sign in to comment.
Something went wrong with that request. Please try again.