Skip to content
HTML form parser for humans.
Python
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
zetanize
.whitesource
LICENSE
README.md
setup.py

README.md


zetanize
zetanize

HTML Form Parser For Humans

Introduction

It's very easy to make HTTP requests in python, thanks to urllib and requests. However, there was no way to submit HTML forms on the go, well now there is.

Documentation

from zetanize import zetanize
forms = zetanize(html)

Well that's it! Just feed zetanize a HTML document and it will give you a dict of actionable form data.
Let's parse https://google.com for getting familiar:

from requests import get
from zetanize import zetanize

html = get('https://google.com').text
forms = zetanize(html)

Here's the output:

{
  "0": {
    "action": "/search", 
    "inputs": [
      {
        "type": "hidden", 
        "name": "ie", 
        "value": "ISO-8859-1"
      }, 
      {
        "type": "hidden", 
        "name": "hl", 
        "value": "en-IN"
      }, 
      {
        "type": "hidden", 
        "name": "source", 
        "value": "hp"
      }, 
      {
        "type": "hidden", 
        "name": "biw", 
        "value": ""
      }, 
      {
        "type": "hidden", 
        "name": "bih", 
        "value": ""
      }, 
      {
        "type": "", 
        "name": "q", 
        "value": ""
      }, 
      {
        "type": "submit", 
        "name": "btnG", 
        "value": "Google Search"
      }, 
      {
        "type": "submit", 
        "name": "btnI", 
        "value": "I"
      }, 
      {
        "type": "hidden", 
        "name": "gbv", 
        "value": "1"
      }
    ], 
    "method": "get"
  }
}
You can’t perform that action at this time.