Automate parser generator for HTML documents
Show Tokenizing Result
-l : show all tree nodes.
--search : search specific token
--trace : trace selected token
--select : select token
--rule : find rule for linear approach
Regex, XPath and BeautifulSoup
--xpath-to-soup-rule : translate xpath to soup style path rule
--soup-rule-to-xpath : translate soup style path rule to xpath
--regex-to-xpath : translate regex format to xpath list or xpath linear approach
--xpath : run xpath
--soup : run soup
--regex : run regex
HTML Formatting
--format : formatting html
Tools
--find-media : find media download links
--transform : transform video format