Skip to content

shannah/CN1HTMLParser

master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
bin
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

CN1HTMLParser

An HTML parser for Codename One

License

Apache 2.0

Installation

Install through Codename One settings.

If you haven’t activated any cn1libs before in your Codename One projects, see this tutorial which explains the process.

Usage

HTMLParser parser = new HTMLParser();

// Parse HTML string into an Element
// Note: parse() is async, but calling get() will block until the result is available.
Element root = parser.parse(htmlString).get();

// Async version:
// parser.parse(htmlString).ready(root->{...});

// Now just showing some typical stuff you might do with the results
// using existing CN1 tools since the resulting Element is
// the same as returned by XMLParser
Result r = Result.fromContent(root);
// Update images
List<Element> images = r.getAsArray("//img");
int index = 0;
List<String> toLoad = new ArrayList<>();
if (images != null) {
	for (Element img : images) {
		String src = img.getAttribute("src");
		if (src.startsWith("http://*/") || (!src.startsWith("http://") && !src.startsWith("data:") && !src.startsWith("https"))) {
			img.setAttribute("id", "nt-image-"+index);
			toLoad.add(src);
			img.setAttribute("src", "");
			index++;
		}
	}
}

// Print out document as well-formed XML.
XMLWriter writer = new XMLWriter(true);
String pageContent = writer.toXML(root);

How it Works

This library uses an off-screen BrowserComponent to parse the HTML, then serialize it to XML. It then passes the XML to the Codename One XMLParser to parse.

Building from Source

git clone https://github.com/shannah/CN1HTMLParser
cd CN1HTMLParser
./antw jar

Credits

  1. Created by Steve Hannah

About

HTML Parser for Codename One

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published