Skip to content

acmauth/mlintro

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

A Machine Learning Intro by ACM AUTH Student Chapter sfhmmy_logo

Getting started with Machine Learning

ℹ️ This is a a WIP project, join our team or contribute freely

📝 Setting up your Dev environment

- Python - Libraries - Editor

- Working online & offline

Python Installation

Version: Python 3.6.+ (not 2.7 or 3.7)

Manual installation (Not Recommended)

Windows - Download

Linux Based - Already Installed

  • Check version: python --version or python3 --version
  • Update/ Install: sudo apt-get install python3
  • Can also use Anaconda (see below)

macOS - Follow this guide or use Anaconda (see below)

Anaconda Installation (Recommended)

Anaconda is a downloadable, free, open source, high-performance and optimized Python distribution.

Anaconda includes:

  • conda & conda build
  • Python (choose version, change versions easily)
  • 100+ automatically installed, open source scientific packages and their dependencies that have been tested to work well together, including SciPy, NumPy and many others.

-- Includes everything that you need and is the best for new users--

Available for Windows, macOS and Linux, all versions of Anaconda are supported by the community.


Conda environments

With conda, you can create, export, list, remove, and update environments that have different versions of Python and/or packages installed in them. Switching or moving between environments is called activating the environment. You can also share an environment file.

Available Python is 3.7. You can downgrade it to 3.6.6 with conda install python=3.6

Using the command line:

  • Verify conda is installed, check version number: conda info
  • Install package: pip install package_name or conda install package_name
  • Create new empty environment named ENVNAME: conda create --name ENVNAME or clone an existing environment conda create --name ENVNAME --clone EXISTING_ENVIRONMENT
  • Activate a named conda environment conda activate ENVNAME
  • Deactivate the current environment conda deactivate
  • List all packages and versions in the active environment conda list
  • More commands on environments here

Important Libraries for Machine Learning

WILL BE UPDATED ACCORDINGLY

NumPy -> N-dimensional array for numerical computation.

Pandas -> Powerful Python data structures and data analysis toolkit.

scikit-learn -> Python modules for machine learning and data mining

Matplotlib Seaborn Bokeh -> Plotting libraries for Python

NLTK Gensim -> The leading platforms for building Python programs to work with human language data.

TensorFlow Keras PyTorch -> Deep learning frameworks

🍩 IDEs and Notebooks

All in one IDEs

  • Go for the ultimate version as a student
  • Has everything you'll ever need (editor, auto-compeltion, debugger, database connect, a ton of plugins)
  • Perfect if you already use JetBrains products
  • Works with Anaconda (can create environments), remote interpreters and native python.
  • Version Control (Git etc.) & Deployment
  • Also cool, and open source
  • Better for begginers and those coming from R Studio and MatLab...
  • Variable Explorer

Notebooks

The notebook extends the console-based approach to interactive computing in a qualitatively new direction, providing a web-based application suitable for capturing the whole computation process: developing, documenting, and executing code, as well as communicating the results.

  • Forget all the above, just a notebook - just for scripting
  • Easy to use, comes with Anaconda
  • A web application, combine explanatory text, mathematics, computations and their rich media output.
  • Work from your browser, lightweight, further installs
  • Execute code in pieces and in any way you want.

Command Cheatsheet - Applies to Colab also

Google Colaboratory or just Colab!

A jupyter notebook online

  • No installation, both CPU and GPU/TPU usage
  • Aims to collaboration and sharing
  • Added support - LaTex Mathematics
  • Can also run on your local Python Installation
  • Interaction with GitHub and Google Drive

🔥 You first machine learning code

Presentations

sfhmmy_logo

ΣΦΗΜΜΥ11 2019

Notebooks

ΣΦΗΜΜΥ11 2019

Datasets

California Housing Dataset (with ocean proximity)

Other Resources

Other resources that can help you on your first steps with machine learning

📖 Books

  • Géron, A. (2017). Hands-on machine learning with Scikit-Learn and TensorFlow: concepts, tools, and techniques to build intelligent systems. " O'Reilly Media, Inc.".
  • Flach, P. (2012). Machine learning: the art and science of algorithms that make sense of data. Cambridge University Press.
  • Mitchell, T., Buchanan, B., DeJong, G., Dietterich, T., Rosenbloom, P., & Waibel, A. (1990). Machine learning. Annual review of computer science, 4(1), 417-433.

💻 Online Courses

:octocat: Repositories

About

Getting started with Machine Learning

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published