End to End Data Science Without Leaving the GPU

Source materials from Randy Zwitch 'End to End Data Science Without Leaving the GPU' talk at PyData NYC 2018

Build Environment

In order to get all of the packages working in harmony, I built libGDF/pygdf from source, after building pymapd. I've included the conda environment file.

conda create --name pydatanyc2018 --file spec-file.txt

For your best chance at replicating this build environment, using Docker with Ubuntu 16.04 and nvidia-docker2 might be your best bet.

Data

I've included the PJM RTO data created by the df query in cell 3. So to follow along, you could import the example data (exampledata.csv.gz) to pandas, then run the code examples below that query step.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
LICENSE		LICENSE
README.md		README.md
exampledata.csv.gz		exampledata.csv.gz
omnisci_screenshot.png		omnisci_screenshot.png
pydatanyc.ipynb		pydatanyc.ipynb
spec-file.txt		spec-file.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End to End Data Science Without Leaving the GPU

Build Environment

Data

About

Releases

Packages

Languages

License

heavyai/pydatanyc2018

Folders and files

Latest commit

History

Repository files navigation

End to End Data Science Without Leaving the GPU

Build Environment

Data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages