Skip to content

Commit

Permalink
[HUDI-5912] Update snapshot_exporter.md (#8142)
Browse files Browse the repository at this point in the history
Replace with the latest version
  • Loading branch information
clp007 committed Mar 15, 2023
1 parent 9b218d4 commit 7620e6b
Showing 1 changed file with 10 additions and 10 deletions.
20 changes: 10 additions & 10 deletions website/versioned_docs/version-0.13.0/snapshot_exporter.md
Original file line number Diff line number Diff line change
Expand Up @@ -28,10 +28,10 @@ query, perform any repartitioning if required and will write the data as Hudi, p
Exporter scans the source dataset and then makes a copy of it to the target output path.
```bash
spark-submit \
--jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar" \
--jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
/opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/hudi/" \
--output-format "hudi"
Expand All @@ -42,12 +42,12 @@ The Exporter can also convert the source dataset into other formats. Currently o

```bash
spark-submit \
--jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar" \
--jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
/opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/json/" \
--target-output-path "/tmp/exported/hudi/" \
--output-format "json" # or "parquet"
```

Expand All @@ -60,10 +60,10 @@ By default, if no partitioning parameters are given, the output dataset will hav
Example:
```bash
spark-submit \
--jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar" \
--jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
/opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/json/" \
--output-format "json" \
Expand Down Expand Up @@ -105,11 +105,11 @@ After putting this class in `my-custom.jar`, which is then placed on the job cla

```bash
spark-submit \
--jars "packaging/hudi-spark-bundle/target/hudi-spark-bundle_2.11-0.6.0-SNAPSHOT.jar,my-custom.jar" \
--jars "/opt/hudi-spark-bundle_2.12-0.13.0.jar,my-custom.jar" \
--deploy-mode "client" \
--class "org.apache.hudi.utilities.HoodieSnapshotExporter" \
packaging/hudi-utilities-bundle/target/hudi-utilities-bundle_2.11-0.6.0-SNAPSHOT.jar \
/opt/hudi-utilities-bundle_2.12-0.13.0.jar \
--source-base-path "/tmp/" \
--target-output-path "/tmp/exported/json/" \
--output-format "json" \
--output-partitioner "com.foo.bar.MyPartitioner"
--output-partitioner "com.foo.bar.MyPartitioner"

0 comments on commit 7620e6b

Please sign in to comment.