Skip to content
#

analytics

Here are 90 public repositories matching this topic...

Pyhaystack is a module that allow python programs to connect to a haystack server project-haystack.org. Connection can be established with Niagara Platform running the nhaystack, Skyspark and Widesky. For this to work with Anaconda IPython Notebook in Windows, be sure to use "python setup.py install" using the Anaconda Command Prompt in Windows.…

  • Updated Feb 26, 2021
  • Python

Build a movie recommendation data pipeline using Azure services for efficient data ingestion, transformation, and orchestration. Utilize Azure Blob Storage, Azure Databricks, and Azure Data Factory to implement collaborative filtering and PySpark ML for accurate movie recommendations.

  • Updated Sep 30, 2023
  • Jupyter Notebook

This is the final project I had to do to finish my Big Data Expert Program in U-TAD in September 2017. It uses the following technologies: Apache Spark v2.2.0, Python v2.7.3, Jupyter Notebook (PySpark), HDFS, Hive, Cloudera Impala, Cloudera HUE and Tableau.

  • Updated May 4, 2018
  • Jupyter Notebook

This repository contains codes developed in Python which deals with smart meter analytics. Building consumption dataset from Pecan Street Dataport was obtained along with temperature and irradiance data. The dataset was used to build machine learning models using linear regression, random forest deicision tree, Neural networks and Support vector…

  • Updated Dec 22, 2018
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the analytics topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the analytics topic, visit your repo's landing page and select "manage topics."

Learn more