streamlined html tag parser (deprecated in favor of tagstream-conduit)
Haskell
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
Text/HTML
tests
.gitignore
FilterUrl.hs
Highlight.hs
LICENSE
Parse.hs
README.rst
Setup.hs
TestTagSoup.hs
tag-stream.cabal

README.rst

What's this

The purpose of tag-stream is to process html in a streamlined fasion, it can tolerate some bad htmls, but it don't handle bad html structure.

Tag-stream parse HTML/XML into a token stream. It also provides an Enumeratee named tokenStream which runs in constant memory.

You can start from tests/Tests.hs to see what it can do.