深圳地铁大数据客流分析系统🚇🚄🌟
-
Updated
May 16, 2024 - Scala
深圳地铁大数据客流分析系统🚇🚄🌟
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Unified SQL Analytics Engine Based on SparkSQL
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark | This repo is contributed to Apache Kyuubi | 项目已迁移至 Apache Kyuubi
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Write ETL using your favorite SQL dialects
Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
This project is used for tracking lineage when using spark. Our team is aimed at enhancing the ability of column relation during logical plan analysis.
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, la…
Add a description, image, and links to the hive topic page so that developers can more easily learn about it.
To associate your repository with the hive topic, visit your repo's landing page and select "manage topics."