Deprecated. Yet another feed aggregator. Implemented in Python using Hbase as datastore.
Python
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
aggregator
doc
static/images
templates
tests
.gitignore
Hbase.thrift
INSTALL
LICENSE
README
TODO
aggregator.py
schema.txt
settings.py
test.py

README

Hbase powered feed aggregator
-----------------------------

WARNING: This is deprecated. It works only with HBase 0.19 and an older version of Apache Thrift. I will keep the repository here because I believe it's still useful as a proof-of-concept. 

Install guide
-------------

See INSTALL for detailed install steps.

Hbase schema
------------

The aggregator is using three tables:
 - `Feeds` - for storing raw feed and some limited metadata
 - `Urls` - for storing the extracted urls from the feeds
 - `UrlsIndex` - an index table used to generate the aggregated feed