Azure Databricks Notebook that assigs team members to customers based on a set of criteria
-
Updated
Jul 1, 2018 - Python
Azure Databricks Notebook that assigs team members to customers based on a set of criteria
Feature Engineering, Spark ML Random Forest Model, Log MLFlow, Streaming Data Source
Big data processing of news with Text Mining in Apache Spark through 3 fundamental processes: data preparation, searching based on the inverted index and grouping of news by similarity.
A solution for on-demand training and serving of Machine Learning models, using Azure Databricks and MLflow
Continuous Delivery tool for PySpark Notebooks based jobs on Databricks
Repository of notebooks and related collateral used in the Databricks Demo Hub, showing how to use Databricks, Delta Lake, MLflow, and more.
Jupybricks is a python package that allows databricks developers to switch easily between local development in jupyter notebooks and databricks notebooks. 🔨 convert databricks .py files into jupyter notebooks . 🔨 convert local jupyter notebooks to databricks .py files
Accelerator code for an anomaly detection module leveraging Databricks for use as part of a Network Threat Detection System
Códigos em spark utilizados no dia a dia para manipulação de dados desde a ingestão até o refinamento.
Repository to develop databricks notebooks suitable for use in a Azure DevOps environment, and space for refactoring existing ETL design
An introduction to PySpark, Creating a simple multi regression ML model and hosting it on a databricks cluster
Accelerator code for advertising spend optimization, leveraging Databricks
End-To-End-Solution-DataEngineering-FinalProject
Databricks provides a unified, open platform for all your data. It empowers data scientists, data engineers and data analysts with a simple collaborative environment to run interactive and scheduled data analysis workloads.
Databricks notebook that integrates data from Microsoft Dataverse to Databricks Delta table, including the schema inference
Code accelerator to migrate data from Snowflake tables into Databricks Delta Live Tables
A simple commandline application to keep in sync between databricks and your local filesystem.
Introducing Delta-Buddy: Your ultimate Delta Lake companion! 🚀 Streamline your data journey with an AI-powered chatbot. Ask Delta-Buddy anything about your Delta Lake.
A bilingual ChatBot created to answer to FAQs asked to the Colombian Institute for Family Welfare (ICBF).
The project harnessed an ETL multi-hop architecture, ingesting data from the Ergast API into a storage backed by Azure Data Lake. The process involved weekly ingestion of bronze layer data as cutover and delta files. Raw data, in varied formats, was transformed using Azure Databricks PySpark notebooks into enriched Silver and Gold layers.
Add a description, image, and links to the databricks-notebooks topic page so that developers can more easily learn about it.
To associate your repository with the databricks-notebooks topic, visit your repo's landing page and select "manage topics."