Skip to content

TruffleX/restaurants

Repository files navigation

restaurants

Overview

overview

Setup

Get docker. Then:

make setup
make build
make run

After this, you can set up a cron to run our ingest code:

env EDITOR=nano crontab -e
0 23 * * * cd $TRUFFLEX_PATH && make update_db && make yelp_ingest

If you get errors, call Aaron. make setup may require sudo depending on your local permissions.

Commands:

  • make run: Launch Environment
  • make jupyter: Launch jupyter notebook in environment
  • make update: Update DB with new RSS entries (from environment)
  • make app-dev: Run the web app in dev mode (localhost:8000).

Tools

  • etl/dbpedia.py: sparql client for grabbing dbpedia entries.
  • etl/rss.py: executable script for updating mongo db with latest from known RSS feeds.
  • db/dbclient: Client for interacting with mongo db

Internal

dbpedia:

RSS

  • Within the container, run python src/scripts/update_db.py to collect new RSS entries and write them to db.
  • To add a new RSS entry, update the update_db.py script (RSS_URLS) (to do: come up with a better way of doing this)

Reviews

Will keep each dataset compressed on s3. I'm keeping them public for now. We'll coordinate on sharing AWS resources and make it private.

  • Yelp. Run data/yelp/get_data.py to grab it..

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors