-
Updated
Jun 7, 2024 - Java
datawarehouse
Here are 395 public repositories matching this topic...
A project for creating a Data Warehouse, designing the ETL process, creating visualizations on Power BI and creating data mining models
-
Updated
Jun 7, 2024 - TSQL
A free to use dbt package for creating and loading Data Vault 2.0 compliant Data Warehouses (powered by dbt, an open source data engineering tool, registered trademark of dbt Labs)
-
Updated
Jun 7, 2024
Service for bulk-loading data to databases with automatic schema management (Redshift, Snowflake, BigQuery, ClickHouse, Postgres, MySQL)
-
Updated
Jun 7, 2024 - Go
Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.
-
Updated
Jun 7, 2024 - Java
Computer Science and Engineering (CSE) is a multidisciplinary field that combines elements of computer science and computer engineering to design, develop, and maintain computer systems and software. It is a rapidly evolving field that plays a crucial role in shaping the modern world.
-
Updated
Jun 5, 2024 - Jupyter Notebook
Repository for tutorials, information and notes on technology in general.
-
Updated
Jun 4, 2024 - Python
DIRECT, the Data Integration Run-time Execution Control Tool, is a data logistics control framework that can be used to monitor, log, audit and control data integration / ETL processes.
-
Updated
Jun 4, 2024 - TSQL
Docker를 사용하여 Hadoop 생태계의 구성 요소와 기타 필수 서비스를 컨테이너화하여 강력한 데이터 엔지니어링 환경을 설정하는 방법을 보여줍니다. 설정에는 Hadoop (HDFS, YARN), Apache Hive, PostgreSQL 및 Apache Airflow가 포함되며, 이들 모두가 원활하게 작동하도록 구성되어 있습니다.
-
Updated
May 29, 2024 - Shell
This repository contains a collection of Databases projects and code samples showcasing my skills and experience in SQL-PostgreSQL development. It serves as a portfolio to demonstrate my proficiency in various aspects of Database programming. Mostly, includes tasks about SQL, PostgreSQL and GIS.
-
Updated
May 27, 2024 - PLpgSQL
The project aims to enhance NLP capabilities for Amharic Language by developing a data corpus for various NLP applications. The project involves collecting, cleaning, processing data, developing APIs, and automating the pipeline.
-
Updated
May 30, 2024 - Jupyter Notebook
Hydra: Column-oriented Postgres. Add scalable analytics to your project in minutes.
-
Updated
May 21, 2024 - C
Developed a robust ETL pipeline for Next Cola Pvt. Ltd data which extracts data from many different OLTP sources, converts them into dimensions and facts and load into datawarehouse for analytical workload.
-
Updated
May 21, 2024 - Python
A Data Warehouse project based on Microsoft Northwind Database.
-
Updated
May 17, 2024
This project outlines the final project requirements for DAV6100 - Information Architectures, focusing on group assignments, scoring criteria, topic selection, core requirements, and project components such as design, development, visualization, and executive presentation.
-
Updated
May 14, 2024 - HTML
Roadmap for Data Engineering
-
Updated
May 9, 2024 - Java
An open source and free to use generic (basic) Microsoft SQL Server data warehouse
-
Updated
May 7, 2024 - TSQL
Soccer Players Data Analyst and Similar Players Finder
-
Updated
May 6, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the datawarehouse topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the datawarehouse topic, visit your repo's landing page and select "manage topics."