Big data training material
-
Updated
Jun 29, 2023 - Python
Big data training material
Simple service for bigquery streaming using google pub-sub
Research is fun!
Big Data project for ATS subject, basic parallel implementation of map-reduce paradigm with a test to count words in text files
This repository contains mapreduce extractors to preprocess and extract websites from the common crawl corpus.
A big data project to obtain real time relevant news from multiple websites.
Real-time retrieval of tweets and periodic updates of their feedback. Used for analysis of the effects of feedback on Spanish politicians.
Extracted data from Twitter using "tweepy" API. Explored both searching and streaming APIs. Finally, performed data cleaning and transformation.
Machine learning project on bike sharing demand for the Artificial Intelligence course of my bachelor.
The main focus of this thesis is to understand and predict the success of an actor, defining the success of an actors as the actor having been featured in a motion picture. The project was carried out using machine learning, data from IMDb website and python
Distributed Real Time Spam Classification using Apache Spark
Add a description, image, and links to the big-data topic page so that developers can more easily learn about it.
To associate your repository with the big-data topic, visit your repo's landing page and select "manage topics."