Skip to content

HTML parser that intend to be fast, but wasn't benchmarked or optimized yet

License

Notifications You must be signed in to change notification settings

azazar/uncaring-html-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Uncaring HTML Parser

HTML parser that intend to be fast, but I didn't benchmark it yet. And it's yet to be documented. And it doesn't parse all HTML. It searches given HTML for matching tags without parsing whole HTML or building it's DOM.

Example usage

String html = "<a href='http://example.org/' class='link'>";

new Html(html).css("a.link").filter(a -> a.attr("href").contains("http:")).forEach(e -> {
    System.out.println(e.getTextContent() + " : " + e.attr("href"));
});

Repositories

I'm not planning to upload it to any public Maven repositories, but it can be linked using JitPack repository.

About

HTML parser that intend to be fast, but wasn't benchmarked or optimized yet

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages