Skip to content

Automatically extracts structured information from webpages

License

Notifications You must be signed in to change notification settings

dscherr/web-auto-extractor

 
 

Repository files navigation

This is my fork of raine/web-auto-extractor where I've fixed some small bugs like null pointer exceptions, because it seem's that the origin project is no longer actively developed.

raine/web-auto-extractor is very very useful for a web crawler project in which i am involved and solved a lot of problems much better than other libs.

The functionality ist 100% the same as in raine/web-auto-extractor, so you find the documentation there.

About

Automatically extracts structured information from webpages

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 55.2%
  • JavaScript 44.0%
  • Shell 0.8%