hadoop
Here are 18 public repositories matching this topic...
Recently updated with 50 new notebooks! Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Apr 10, 2017 - Python
A spark cluster configuration with the Apache Toree notebook
-
Updated
Jan 9, 2018 - Shell
Hadoop beginner exercise in analyzing European football teams' statistics over the last 20 years. The goal is to determine which team had the highest win percentage-rate.
-
Updated
Oct 29, 2022 - Makefile
Exercise of using the Streaming API with Hadoop to determine the word count of Wikipedia articles.
-
Updated
Oct 29, 2022 - Jupyter Notebook
📓 [Active] Portafolio of data science projects. Using: Python, PyTorch, Spark, Tensorflow, Scikit, Keras. Includes Classification, Regression, Time series, NLP, Deep learning, among others.
-
Updated
Feb 27, 2018 - Jupyter Notebook
Hadoop environment with HDFS, Spark, Hue, Jupyter Notebooks, etc. all installed in docker-compose
-
Updated
Mar 25, 2022 - Jupyter Notebook
This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.
-
Updated
May 4, 2018 - Jupyter Notebook
This project walks through how you can create recommendations using Apache Spark machine learning. There are a number of jupyter notebooks that you can run on IBM Data Science Experience, and there a live demo of a movie recommendation web application you can interact with. The demo also uses IBM Message Hub (kafka) to push application events to…
-
Updated
Apr 17, 2023 - Jupyter Notebook
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
-
Updated
Mar 20, 2024 - Python
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."