THIS DATA SET IS BEING REWORKED BASED ON NEW DATA BY THE COMPTROLLER ON DATA.TEXAS.GOV
The comptroller now stores 10 years worth of Mixed Beverage Gross Receipts data on the Texas' Socrata portal.
- I'll first rework this on the new data layout using downloaded data.
- I'll then try to use the Socrata API to pull the data we need.
This repository is a lesson in a repeatable, transparent data pipeline using Jupyter Notebooks and the Python package called agate.
It uses a conda
virtual environment.
(Note: If you are using a different virtual environment, you can use mixbev-pip.txt as a requirements file.)
This should only have to be done once on your machine. Assumes you are on Mac, but this should work with Windows.
- run
conda create -n mixbev --file mixbev-env.txt
to create the mixbev environment that has the pyton packages you need - run
source activate mixbev
to load the environment so we can add theagate
package that is not in conda. - run
pip install agate
to install the agate package, which is not included in the conda system.
cd
into themixbev-receipts
- run
source activate mixbev
to enter the virtual environment - run
jupyter notebook
to start the notebook - Your browser will open jupyter
- Go inside the
notebooks
directory and chooseMixed-beverages-agate.ipynb
The rest of the directions are in the Mixed beverages agate notebook.