Skip to content
@scrapinghub

Scrapinghub

Turn web content into useful data

splash

Lightweight, scriptable browser as a service with an HTTP API

Updated May 24, 2016

Python 12 3

flatson

Tool to flatten stream of JSON-like objects, configured via schema

Updated May 24, 2016

Python 505 61

dateparser

python parser for human readable dates

Updated May 24, 2016

kafka-docker

forked from wurstmeister/kafka-docker

Updated May 24, 2016

Python 205 49

frontera

A scalable frontier for web crawlers

Updated May 23, 2016

JavaScript 4,199 611

portia

Visual scraping for Scrapy

Updated May 23, 2016

Python 2 6

doc.scrapinghub.com

Scrapinghub Documentation

Updated May 20, 2016

Python 0 10

pgcontents

forked from quantopian/pgcontents

A Postgres-backed ContentsManager implementation for IPython

Updated May 20, 2016

Python 21 24

shub

Scrapinghub Command Line Client

Updated May 19, 2016

Python 7 0

scrapy-mosquitera

Restrict crawl and scraping scope using matchers.

Updated May 19, 2016

Python 0 0

scrapinghub-stack-portia

Software stack used to run Portia spiders in Scrapinghub cloud

Updated May 19, 2016

Python 188 34

scrapyrt

Scrapy realtime

Updated May 17, 2016

Python 5 2

scrapinghub-entrypoint-scrapy

Scrapy entrypoint for Scrapinghub job runner

Updated May 16, 2016

Shell 9 4

docker-images

Updated May 16, 2016

drone

forked from drone/drone

Drone is a Continuous Integration platform built on Docker, written in Go

Updated May 13, 2016

Python 5 1

kafka-scanner

High Level Kafka Scanner

Updated May 12, 2016

Shell 1 2

scrapinghub-conda-recipes

Conda packages for scrapinghub channel

Updated May 12, 2016

Python 226 79

scrapylib

Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)

Updated May 11, 2016

Python 15 19

python-hubstorage

HubStorage client library

Updated May 11, 2016

Something went wrong with that request. Please try again.