Skip to content
@scrapinghub

Scrapinghub

Turn web content into useful data

Python 182 32

scrapyrt

Scrapy realtime

Updated May 5, 2016

splash

Lightweight, scriptable browser as a service with an HTTP API

Updated May 5, 2016

Python 184 42

frontera

A scalable frontier for web crawlers

Updated May 5, 2016

Python 0 0

scrapinghub-stack-portia

Software stack used to run Portia spiders in Scrapinghub cloud

Updated May 5, 2016

JavaScript 4,158 602

portia

Visual scraping for Scrapy

Updated May 5, 2016

Python 2 5

doc.scrapinghub.com

Scrapinghub Documentation

Updated May 4, 2016

Python 0 0

shub-image

Client side tool to prepare docker images to run crawlers in Scrapinghub

Updated May 4, 2016

Python 15 19

python-hubstorage

HubStorage client library

Updated May 4, 2016

Python 1 1

scrapinghub-stack-hworker

Updated May 4, 2016

extruct

Extract embedded metadata from HTML markup

Updated Apr 29, 2016

Python 4 2

scrapinghub-entrypoint-scrapy

Scrapy entrypoint for Scrapinghub job runner

Updated Apr 28, 2016

Python 492 61

dateparser

python parser for human readable dates

Updated Apr 27, 2016

Python 20 23

shub

Scrapinghub Command Line Client

Updated Apr 22, 2016

kafka-docker

forked from wurstmeister/kafka-docker

Updated Apr 22, 2016

Python 5 1

kafka-scanner

High Level Kafka Scanner

Updated Apr 21, 2016

spark

forked from apache/spark

Mirror of Apache Spark

Updated Apr 20, 2016

Shell 9 4

docker-images

Updated Apr 14, 2016

Python 1 202

dulwich

forked from duendex/dulwich

Pure-Python Git implementation

Updated Apr 13, 2016

Python 88 50

testspiders

Useful test spiders for Scrapy

Updated Apr 12, 2016

Something went wrong with that request. Please try again.