Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia
-
Updated
Nov 4, 2024 - Python
Material de apoyo para cursos, Facultad de Minas, Universidad Nacional de Colombia
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Contains the code for my article on Medium, which provides a comprehensive guide to setting up, packaging, and running PySpark projects.
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
A multi-cloud framework for big data analytics and embarrassingly parallel jobs, that provides an universal API for building parallel applications in the cloud ☁️🚀
Real-time YouTube comment sentiment analysis using Kafka, Spark, and Streamlit dashboard.
YiraBot: Simplifying Web Scraping for All. A user-friendly tool for developers and enthusiasts, offering command-line ease and Python integration. Ideal for research, SEO, and data collection.
This project builds a scalable log analytics pipeline use Lambda architecture for real-time and batch processing of NASA server logs.
SPOTIFY - Big Data Analysis w/ Spark
SocialSituSecu is a project exploring the social network security, computing and intelligence basd on social situational metadata, which is sponsored by National Natural Science Foundation of China Grant No.61972133, and Project of Leading Talents in Science and Technology Innovation for Thousands of People Plan in Henan Province Grant No.204200…
This project utilizes big data analytics, machine learning, and statistical methods to identify and classify adverse effects of COVID-19 vaccinations. By analyzing large datasets, it aims to uncover patterns and correlations, providing valuable insights into vaccine safety and efficacy.
The repositary contains big data analytics projects using Apache Spark, SQL, and Machine Learning models.
Logistic regression modeling of swing state voter turnout to support U.S. political campaign proposals
The credit card fraud detection system which sends transaction data to a Kafka topic, and processes this data to detect fraud using predefined rules or a machine learning model, triggering alerts for fraudulent transactions.
⏱ Real-Time Sentiment Analysis using PySpark and simulation of Twitter/X API using FastAPI
SSVC Ore Miner - www.rapticore.com
Gemini-Web Vulnerability Detection (G-WVD) detecting web application vulnerabilities with deep learning
Retorno ao Uso do Python
Repository for the Big Data Specialization from University of California San Diego on Coursera
Add a description, image, and links to the big-data-analytics topic page so that developers can more easily learn about it.
To associate your repository with the big-data-analytics topic, visit your repo's landing page and select "manage topics."