Skip to content

HTML documents automate parser generator for High-level languages

License

Notifications You must be signed in to change notification settings

rollrat/automate-html

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

automate-html

Automate parser generator for HTML documents

GUI

Functions TODO

 Show Tokenizing Result
 -l       : show all tree nodes.
 --search : search specific token
 --trace  : trace selected token
 --select : select token
 --rule   : find rule for linear approach
 
 Regex, XPath and BeautifulSoup
 --xpath-to-soup-rule : translate xpath to soup style path rule
 --soup-rule-to-xpath : translate soup style path rule to xpath
 --regex-to-xpath : translate regex format to xpath list or xpath linear approach
 --xpath : run xpath
 --soup  : run soup
 --regex : run regex
 
 HTML Formatting
 --format : formatting html
 
 Tools
 --find-media : find media download links
 --transform  : transform video format

About

HTML documents automate parser generator for High-level languages

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages