Skip to content
#

datawarehouse

Here are 80 public repositories matching this topic...

后端学习笔记,本项目存放了一些我阅读有关的技术类的书籍和部分源码阅读的笔记整理。 涉及范围包括后端开发中的计算机学科基础知识、高级语言的基础知识、源码阅读笔记、数据库知识、数据挖掘知识等,同时也会涉及到一些具体生产场景中会遇到的一些实际问题。 :-D

  • Updated Oct 12, 2019
  • Python

An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS application using Apache Airflow as an orchestration tool and various data warehouse technologies and finally using Apache Superset to connect to DWH for generating BI dashboards for weekly reports

  • Updated Dec 7, 2022
  • Python

This project promulgates an automated end-to-end ML pipeline that trains a biLSTM network for sentiment analysis, experiment tracking, benchmarking by model testing and evaluation, model transitioning to production followed by deployment into cloud instance via CI/CD

  • Updated Feb 1, 2023
  • Python

Improve this page

Add a description, image, and links to the datawarehouse topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datawarehouse topic, visit your repo's landing page and select "manage topics."

Learn more