Aplicação de regex para validação de nomes em spark
-
Updated
Nov 25, 2022 - Python
Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.
Aplicação de regex para validação de nomes em spark
Trying best case apache spark working environment for robust data pipelines
This repository contains a simple Flask application that serves as a customer feedback form. The submitted data is sent to a Kafka topic. The Kafka consumer, implemented as a Spark application, processes the data and writes it to a Cassandra table for further analysis.
Pyspark studies.
An on-time performance management system for airlines using Spark and Kafka streaming
Project that captures information about all Dark Souls 3 (DS3) weapons and performs textual analysis on.
A python program written for Spark clusters that utilizes the big data processing capabilities of the Apache Spark engine by using the sentiment analysis technique.
Finding the right mutual fund using Spark ML
IMDB Spark Project for "Prof2IT: Python for Big Data and Data Science" course by Kharkiv IT Cluster and Grid Dynamics.
Exploring streaming design patterns with Kafka and Spark Structural Streaming
Created by Matei Zaharia
Released May 26, 2014