Skip to content
This repository

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP

A portable version of Franz's Opensource HTML Parser

branch: master

Fetching latest commit…

Octocat-spinner-32-eaf2f5

Cannot retrieve the latest commit at this time

Octocat-spinner-32 dev
Octocat-spinner-32 .gitignore
Octocat-spinner-32 COPYING
Octocat-spinner-32 LICENSE
Octocat-spinner-32 README.md
Octocat-spinner-32 cl-html-parse.asd
README.md

Introduction

CL-HTTP-PARSE is a slash and burn port of Franz's phtml HTML parser.

I made a few small changes to the source mainly involving the call to excl:intern* in collect-tag. In particular, all tags are now string-upcased before being interned. There is probably a better solution but I'm not very Allegro savvy.

Any questions or concerns should be directed to me, Gary Warren King at gwking@metabang.com.

Example #1

(html-parse:parse-html "

Parsing HTML is Phun

Why is it phun?

Parsing HTML is phun because angle brackets are better than parentheses.

")

==>

((:HTML (:HEAD (:TITLE "Parsing HTML is Phun")) (:BODY (:H1 "Why is it phun?") (:P "Parsing HTML is phun because angle brackets are better than parentheses."))))

Example #2

(html-parse:parse-html )

==> ((:HTML ...))

Something went wrong with that request. Please try again.