Skip to content

AWS framework to stream, classify, index and store data

Notifications You must be signed in to change notification settings

Aminoid/garfield

Repository files navigation

Garfield

AWS framework to stream, classify, index and store it using Kinesis, S3, ElasticSearch and Lambda.

Install Instructions

  1. Since, we are using boto3 to access AWS components, follow all the instructions on this page.

  2. Install all python dependencies. We are using python 2.7.

pip install requirements.txt
  1. Update the config in config.py with your s3_host, kinesis_stream and Elastic Search endpoint.
  2. Upload models/nb.model to your s3 bucket.

Usage

  1. Run the get_kinesis script and wait for it to be ready. This will classify, index and store the records in kinesis stream. Sometimes, it takes a little while before it starts consuming the stream, so please be patient.
python get_kinesis.py
  1. To create a stream of data into kinesis, run put_kinesis script in another terminal.
python put_kinesis.py

Results

You can check the indexing into Elastic Search using curl command like an example below.

curl -XGET https://search-news-group-pcagpupl573mnu3scbh3wp63vu.us-east-1.es.amazonaws.com/news/news/_search?pretty\&q=category:comp.graphics

About

AWS framework to stream, classify, index and store data

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages