Tutorials on Python Data Containers for no-so-BigData
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
movielens-1m
.gitignore
0-Intro.ipynb
1-Memory-Profilers.ipynb
2-A-Reduction-Tale.ipynb
3-In-Memory-Tables.ipynb
4-On-Disk-Tables.ipynb
LICENSE
README.rst

README.rst

Notebooks and other materials for the Data Containers tutorial

You can get the latest release of the materials here:

https://github.com/FrancescAlted/DataContainers/releases

Also, make sure that you have the next Python packages installed:

  • numpy
  • numexpr
  • pandas
  • bcolz
  • tables (pytables)
  • matplotlib
  • psutil
  • memory_profiler
  • ipython_memwatcher

I recommend to use Anaconda to install most of the packages above, and for the software that is not in anaconda.org repos, just use pip, e.g.:

$ pip install ipython_memwatcher

and start by the different tutorials following the numerical order.

** Enjoy data! **