Skip to content

ravi2k1/Data-Mining-Python

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

67 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python

Basic Python setup for data analysis

The IPython notebook runs in the browser, and works best in Google Chrome. The Anaconda Python distribution is an easily-installable bundle of Python and many commonly used libraries. Download the appropriate version of Anaconda here.

Extra libraries that are useful for python:

  1. BeautifulSoup
  2. Pattern
  3. Seaborn
  4. MrJob

The recommended way to install these packages is to run pip install BeautifulSoup mrjob pattern seaborn on the command line. If this doesn't work, you can download the source code, and run python setup.py install from the source code directory. On Unix machines, either of these commands may require sudo (i.e. sudo pip install... or sudo python)

References

  1. Learning Python (5th Edition) by Mark Lutz
  2. Python for Data Analysis by Wes McKinney
  3. Introduction to Python for Econometrics, Statistics and Data Analysis by Kevin Sheppard

About

Sheng's python codes for data manipulation and data mining

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Jupyter Notebook 95.8%
  • HTML 3.5%
  • Python 0.7%