Refactor the Hudi-based Spark library.
1.CDC Ingestion
- Binary Logs Ingestion(MySQL)
2.Documents Ingestion
-
MongoDB
-
Elasticsearch
3.File Ingestion
- Excel
4.RDB Ingestion
- JDBC
mvn clean package -pl [model] -am -Dmaven.test.skip=true
The library currently supports the following versions of components:
-
Scala:2.12.x
-
Spark:3.1.x
-
Hudi:0.9.0