Basic sentiment analysis of real-time tweets with the help of Spark’s streaming APIs
-
Updated
Feb 24, 2018 - Python
Basic sentiment analysis of real-time tweets with the help of Spark’s streaming APIs
A Real-Time Tweet Streaming Pipeline (Using Flume, Kafka & Spark-Streaming) with Deep Learning Sentiment Analysis Model for instant scoring.
Spark Streaming using Flume (pushed based Approach)
Apache Spark Streaming application which reads web log data and monitors for 404 errors.
Analyzing real-time data using Apache Kafka & Spark Streaming
Engineered a data pipeline on GCP for a mock game development company, to track player activity in guilds and in-game purchases, using Docker and streaming events from a Flask app through Kafka, PySpark filtering, Cloudera storage, and Presto queries.
Health Information Application
Innovative movie recommendation project using MovieLens data. This project aims to integrate Big Data analysis and machine learning to improve movie suggestions, using Apache Spark, Elasticsearch and a Flask API, to deliver a personalized and dynamic user experience.
Smart City Realtime Data Engineering Project
Exercise Solution for the course big and linked data(BLD)
This repository represents several projects completed in IE HST's MS in Business Analytics and Big Data's Stream Processing Analytics course.
This is the project 3 which is to build a real-time data pipeline using Apache Kafka and Spark Streaming
Apache Spark 3 - Spark Programming in Python for Beginners
All topics related to data streaming and real-time analysis
PySpark is a Python API for support Python with Spark. Whether it is to perform computations on large datasets or to just analyze them
ETL Pipeline Project
An on-time performance management system for airlines using Spark and Kafka streaming
How to build a complete Data Platform -> Here
Add a description, image, and links to the spark-streaming topic page so that developers can more easily learn about it.
To associate your repository with the spark-streaming topic, visit your repo's landing page and select "manage topics."