A data pipeline management platform
-
Updated
Dec 9, 2022 - JavaScript
A data pipeline management platform
Linux pipe like concept implementation in go
POC in Apache Kafka and Spark Streaming using Avro serialization.
This personalized news is a complex web application to read personalized news. Major components includes React+Express frontend-backend, message queue based data pipelines, news topic classifiers, user preference predictions, and etc.
PostgreSQL, Data Modeling, Star Schema, ETL, Data Engineering
Latency Estimation for Neural Network Architecture
ETL pipeline with AWS Redshift orchestrated with Airflow
Codes for data flow between models, data post-process, and visualization
Create Data Pipeline with Apache Airflow for Sparkify Datasets.
Udacity Data Engineering Nanodegree - Project #2
An end to end data pipeline with Kafka Spark Streaming Integration
Short course: Introduction to Machine Learning
Transformation airbnb data set using dbt and snowflake, then visualizing data using preset
Data pipeline to gather data from chess website APIs using Airflow.
Исследование продаж компьютерных игр
An end-to-end data pipeline deployed on GCP that extracts cryptocurrency data for analytics.
Convolutional Neural Network capable of detecting brain tumors and respective locations from 5712 MRI brain scans
This project is a specialized Library Management System (LMS) built using MYSQL as the backend database. The database schema is designed to ensure data integrity and consistency, with tables storing information about users, books, transactions, staff.
A cutting-edge big data initiative aimed at creating a real-time data pipeline to analyze the popularity and sentiments of trending topics on Twitter.
The mini project for the course Database Technologies. The task is to take in data via a pipeline built using spark-streaming and kafka, and store the processed data into a SQLite database for further manipulation
Add a description, image, and links to the data-pipeline topic page so that developers can more easily learn about it.
To associate your repository with the data-pipeline topic, visit your repo's landing page and select "manage topics."