Module 22 challenge: Using Google Colab to work on Big Data queries with PySpark SQL, parquet, and cache partitions
-
Updated
Jun 1, 2024 - Jupyter Notebook
Module 22 challenge: Using Google Colab to work on Big Data queries with PySpark SQL, parquet, and cache partitions
Certified training by iTrainAsia on Big Data Analytics, EDA, Data Storytelling, Machine Learning
Ce dépôt GitHub regroupe tous les cours, TP, TD, projets, et exercices de ma formation en master en mathématiques appliquées pour la science des données. Parcourez-le pour une vue complète de mon parcours académique, offrant une perspective détaillée de mon apprentissage dans ce domaine.
💾 Welcome to the Big Data Analytics Repository! 📚✨ Immerse yourself in a carefully curated reservoir of knowledge on Big Data Analytics. 🌐💡 Explore the intricacies of deriving insights from vast datasets and navigating powerful analytics tools. 🚀🔍
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
The binary build of LEO CDP Free Edition for training purposes
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Visual, interactive queries against big databases
vineyard (v6d): an in-memory immutable data manager. (Project under CNCF, TAG-Storage)
📘 FIWARE 305: Real-time Processing of Context Data using Apache Flink
📘 FIWARE 306: Real-time Processing of Context Data using Apache Spark
a suite of benchmark applications for distributed data stream processing systems
This repository contains the final project for the Rakamin Big Data Analytics Internship. It include a complete dashboard of Kimia Farma's sales performance analysis from 2020 to 2023.
open source tools for interaction with IBM PAIRS:
TIL(=Today I learned.)
Big Data Project - CMP2024 - Computer Engineering - Cairo University
The project aimed to explore and visualize the daily variation of "ECON_STOCKMARKET" topics distributed worldwide throughout the year. By examining the data, the objective was to gain insights into how discussions and events related to the stock market were dispersed across different countries on a day-to-day basis.
O objetivo deste trabalho é explorar as capacidades de arquiteturas de bancos de dados distribuídos para lidar com conjuntos de dados complexos, em particular, o "Relatório de Saldo Mensal da Conta", que apresenta todos os Saldos Mensais das Contas dos clientes entre Jan/2020 e Dez/2020.
SSVC Ore Miner - www.rapticore.com
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."