hive

全国大数据竞赛三等奖解决方案，省赛二等奖解决方案。一键安装大数据环境脚本，自动部署集群环境，包括zookeeper、hadoop、mysql、hive、spark以及一些基础环境。已通过实际服务器测试，效果极佳，仅需要输入密码等少量人为干预。解放安装部署配置所需人力。并添加若干scala案例，结合spark用以进行数据准备。

mysql shell scala spark hive hadoop bigdata zookeeper hdfs wordcount

Updated Sep 26, 2024
Scala

sparsecode / DaFlow

Star

Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.

json scala csv apache-spark hive hadoop avro etl parquet transformation-rules etl-framework etl-pipeline join-data

Updated Jun 7, 2021
Scala

Renien / ETL-Starter-Kit

Star

📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.

scala hive gradle bigdata datascience pig scalding azkaban datamining starter-project etl-framework mapreduce-jobs

Updated Mar 20, 2017
Scala

frankyu8 / ushas

Star

This project is used for tracking lineage when using spark. Our team is aimed at enhancing the ability of column relation during logical plan analysis.

metadata spark hive lineage column-lineage

Updated Jan 7, 2022
Scala

LoveNui / Customer-Viewership-Realtime-Analysis

Star

kafka spark hive hadoop etl hdfs nifi crm-analytics

Updated Jul 19, 2023
Scala

HuemulSolutions / huemul-bigdatagovernance

Star

Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, la…

Updated Apr 21, 2023
Scala

Improve this page

Add a description, image, and links to the hive topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hive topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hive

Here are 79 public repositories matching this topic...

geekyouth / SZT-bigdata

apache / kyuubi

Qihoo360 / XSQL

yaooqinn / spark-authorizer

51zero / eel-sdk

yahoo / maha

smart-data-lake / smart-data-lake

qubole / spark-acid

pkeropen / BigData-News

yaooqinn / itachi

haozhang-x / spark-waimai

ymericson / big-data-streets

SharpData / SharpETL

wushengyeyouya / Hive-JDBC-Proxy

ZongXR / BigData-Competition

sparsecode / DaFlow

Renien / ETL-Starter-Kit

frankyu8 / ushas

LoveNui / Customer-Viewership-Realtime-Analysis

HuemulSolutions / huemul-bigdatagovernance

Improve this page

Add this topic to your repo