Skip to content
forked from apache/hudi

Upserts And Incremental Processing on Big Data

License

Notifications You must be signed in to change notification settings

thaiphamquoc/hudi

 
 

Repository files navigation

Hudi

Hudi (pronounced Hoodie) stands for Hadoop Upserts anD Incrementals. Hudi manages storage of large analytical datasets on HDFS and serve them out via two types of tables

  • Read Optimized Table - Provides excellent query performance via purely columnar storage (e.g. Parquet)
  • Near-Real time Table (WIP) - Provides queries on real-time data, using a combination of columnar & row based storage (e.g Parquet + Avro)

For more, head over here

About

Upserts And Incremental Processing on Big Data

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 95.6%
  • Scala 3.3%
  • Other 1.1%