Skip to content
Web data extraction tool implemented as chrome extension
JavaScript HTML CSS
Branch: master
Clone or download
Pull request Compare This branch is 11 commits ahead of martinsbalodis:master.
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.settings
.vs
docs
extension
jasmine-standalone/lib/jasmine-1.3.1
playgrounds
tests
.gitignore
.gitmodules
.project
LICENSE
README.md

README.md

Idra Sitemap Creator

Idra Sitemap Creator is a chrome browser extension, forked from the WebScaper.io project, built for Sitemap Creation, to be used in the Idra - Open Data Federation Platform, in order to federate through scraping a generic Open Data Web Catalogue. Using this extension you can create a plan (sitemap) how a web site should be traversed and what should be extracted. Using these sitemaps the Idra Platform will navigate the site accordingly and extract all data to be mapped to DCAT-AP Datasets.

Features

  1. Create Sitemap
  2. Sitemaps are stored in browsers local storage
  3. Multiple data selection types
  4. Import, Export sitemaps (to be imported in Idra)

Help

The instruction to install and the Scraping Guide of Idra can be found at the corresponding section of Read The Docs.

Submit bugs and suggest features on [bug tracker] github-issues

License

LGPLv3

You can’t perform that action at this time.