#

hive-metastore

Here are 66 public repositories matching this topic...

ExpediaGroup / waggle-dance

Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.

hive federation metastore hive-metastore oss-portal-listed

Updated Jun 2, 2025
Java

ExpediaGroup / circus-train

Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.

bigquery big-data hive replication s3 replicate-data hive-metastore hive-table

Updated Mar 5, 2024
Java

aws-samples / aws-dbs-refarch-datalake

Reference Architectures for Datalakes on AWS

glue amazon-emr data-transformation data-lake data-catalog data-analytics hive-metastore emr-cluster ingest-data

Updated May 13, 2020
HTML

naushadh / hive-metastore

Apache Hive Metastore as a Standalone server in Docker

docker spark presto trino hive-metastore localstack

Updated Aug 22, 2024
Python

dominikhei / Local-Data-LakeHouse

Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testing.

data-lake minio trino hive-metastore apache-iceberg lakehouse data-lakehouse

Updated Sep 2, 2023
Dockerfile

hive-metastore-client

quintoandar / hive-metastore-client

A client for connecting and running DDLs on hive metastore.

python package hive etl data-engineering hive-metastore-client metastore hive-metastore ddls

Updated Mar 20, 2024
Thrift

beekeeper

ExpediaGroup / beekeeper

Service for automatically managing and cleaning up unreferenced data

java big-data hive s3 maintenance cleanup metastore hive-metastore oss-portal-featured

Updated May 28, 2025
Java

gmrqs / lasagna

A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive Metastore, Trino and Kafka

docker spark jupyter docker-compose pyspark minio spark-streaming jupyterlab trino hive-metastore

Updated Dec 19, 2024
Jupyter Notebook

apache-spark-docker

Wittline / apache-spark-docker

Dockerizing an Apache Spark Standalone Cluster

docker apache-spark hive docker-compose pyspark hdfs hadoop-cluster hue hadoop-docker dataengineering hive-metastore dataengineer

Updated Jun 29, 2022
VBA

thanhENC / e2e-data-platform

End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)

airflow spark docker-compose end-to-end data-platform dbt data-pipeline trino hive-metastore adventureworks delta-lake lightdash

Updated Oct 14, 2024
Python

san089 / Cloudera_Material

Cloudera_Material: Study Material to help people preparing for Cloudera CCA Spark and Hadoop Developer Exam (CCA175). Feel free to collaborate.

big-data spark hive hadoop bigdata cloudera pyspark cca flume certification sqoop cca175 hive-metastore sqoop-session sqoop-export sqoop-import

Updated Apr 21, 2020

apiary

ExpediaGroup / apiary

Apiary provides modules which can be combined to create a federated cloud data lake

aws hive datalake hive-metastore

Updated Apr 3, 2024

harrydevforlife / building-lakehouse

Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.

python airflow spark s3 metabase minio dbt flask-api hive-metastore delta-lake lakehouse

Updated Apr 20, 2024
Python

GoogleCloudPlatform / datacatalog-connectors-hive

Sample code with integration between Data Catalog and Hive data source.

python hive analytics gcp data-warehouse metadata-management hive-metastore apache-atlas datacatalog

Updated Jan 29, 2025
Python

ExpediaGroup / shunting-yard

Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.

big-data hive replication replicate-data hive-metastore hive-table circus-train

Updated Oct 11, 2021
Java

UrbanOS-Public / kdp

Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store

kubernetes minio prestodb hive-metastore

Updated Oct 20, 2022
Dockerfile

cloudera-labs / hms-mirror

"hms-mirror" is a utility used to bridge the gap between two clusters and migrate hive metadata.

hive hive-metastore

Updated May 27, 2025
Java

akolb1 / gometastore

Go Client for Hive Metastore

go golang hive rest-api thrift rest-client hms hive-metastore-client metastore hive-metastore

Updated Dec 18, 2022
Go

criccomini / hive-metastore-standalone

Apache Hive Metastore in Standalone Mode With Docker

docker presto hive hadoop prestodb trino hcatalog hive-metastore github-workflow github-workflows trinodb

Updated Jul 22, 2024
Dockerfile

criccomini / pymetastore

A Python Client for Hive Metastore

python hive thrift data-engineering hcatalog hive-metastore

Updated Dec 19, 2023
Python

Improve this page

Add a description, image, and links to the hive-metastore topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hive-metastore topic, visit your repo's landing page and select "manage topics."