Skip to content

Parsers for HTML that convert possibly malformed HTML into well-formed XML

License

Notifications You must be signed in to change notification settings

pharo-contributions/XML-XMLParserHTML

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

XML-XMLParserHTML

Build Status Coverage Status

XMLParserHTML provides SAX and DOM parsers in Pharo for HTML that convert possibly malformed HTML into well-formed XML.

Installation

Metacello new
	baseline: 'XMLParserHTML';
	repository: 'github://pharo-contributions/XML-XMLParserHTML/src';
	load.

Usage

A simple example on how to use the XML parser for HTML:

...

results in the following XML output

...

Scrapping

This library together with XPath enables you to do web scrapping from the confort of the Pharo toolset.

You can learn more about how to do it reading the Scrapping with XPath booklet.

LICENSE

MIT License

History

This project was migrated from http://smalltalkhub.com/#!/~PharoExtras/XMLParserHTML

About

Parsers for HTML that convert possibly malformed HTML into well-formed XML

Resources

License

Stars

Watchers

Forks

Packages

No packages published