1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
-
Updated
Jun 20, 2024 - Python
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
ARAKAT - Big Data Analysis and Business Intelligence Application Development Platform
IBM Environmental Intelligence Geospatial python SDK
The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
This project analyses and correlates student performance with different attributes. Then at last, it determines most suitable algorithm from bunch of them.
EpiData IoT Data Science Platform - Community Edition
Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia
Plugin offering views, operators, sensors, and more developed at Pandora Media.
This is a repository containing my code samples that helped me understand the concepts of distributed storage and processing of Big data using Apache spark and Python.
Repositório criado para versionar o conteúdo das atividades práticas da disciplina de Projeto Interdisciplinar para Sistemas de Informação III (PISI III), ofertada pelo curso de Bacharelado em Sistemas de Informação da UFRPE.
Iot,Big Data Analytics using Apache-kafka,spark and other aws services
A model to recommend movies based on collaborative filtering (using ALS algorithm) and perform various analysis on the data.
This repository analyzes the Multivariate workload data of Google Cluster machines.
Gemini-Web Vulnerability Detection (G-WVD) detecting web application vulnerabilities with deep learning
Convert excel to parquet for quick loading into Hive table.
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."