-
jysan bank
- Almaty, Kazakhstan
- https://stackoverflow.com/users/story/898042
Lists (1)
Sort Name ascending (A-Z)
Stars
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
Notes talking about the design and implementation of Apache Spark
This repo contains "Databricks Certified Data Engineer Professional" Questions and related docs.
Roadmap для Data Engineer. Цель роадмапа – устроиться тебе на работу!
This repository will contain all of the resources for the Mage component of the Data Engineering Zoomcamp: https://github.com/DataTalksClub/data-engineering-zoomcamp/tree/main
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment anal…
100+ Python challenging programming exercises
Practice your pandas skills!
An example project that demontrates real time big data stream processing using GigaSpaces
Data Engineering pet-project covering GCP, Docker, workflow orchestration with Mage, data transforming with dbt, batch processing via Spark
Pipeline that extracts data from Crinacle's Headphone and InEarMonitor databases and finalizes data for a Metabase Dashboard. The dashboard is then used to support a purchasing decision of which He…
The smart city reference pipeline shows how to integrate various media building blocks, with analytics powered by the OpenVINO™ Toolkit, for traffic or stadium sensing, analytics and management tasks.
This project shows how to capture changes from postgres database and stream them into kafka
Learn how to design, develop, deploy and iterate on production-grade ML applications.
My solution to the book <A collection of Data Science Take-home Challenges>
Sample project to demonstrate data engineering best practices
DataTalks.Club's Data Engineering Zoomcamp Project
Final Project of the MLOps Zoomcamp hosted by DataTalksClub.
DataTalks.Club's Data Engineering Zoomcamp Project
A repo to track data engineering projects
A batch Data Pipeline that retrieves data from a user purchase table and a movie review table and is transformed to form a user behaviour metric table.
A project portfolio to accompany my resume
Data Engineering Project in GCP