Skip to content

coveo-labs/web-scraper-helper

Repository files navigation

web-scraper-helper

Now available from the Chrome Web Store

⚠️ Experimental, use at your own risk.

This repository is shared to be helpful and for informational purposes (not officially supported, not for production use).
For assistance, refrain from contacting Coveo support and use Github issues instead.

The Coveo Cloud V2 Web and Sitemap source types can use a web scraping configuration to exclude web page sections, extract metadata, and create sub-items for web pages to index (see Web Scraping Configuration).

The web-scraper-helper works directly in your browser allowing you to easily create and test your web scraping configuration when visiting web site pages requiring web scraping.

Description

Information about the project is in the README.

The How-To Guide is also useful.

The web-scraper-helper uses Amplitude to understand user behavior and improve the extension based on these insights. We respect user privacy and only track non-personal identifiable information. This document contains a detailed overview about the specific events.

Contribution

If you want to contribute:

  1. Fork the repo.
  2. Make desired changes in your fork.
  3. Make a pull request from your fork.