Skip to content

docs(blog): minor update on 2025 Hudi blog#17753

Merged
xushiyan merged 1 commit intoapache:asf-sitefrom
xushiyan:minor-update
Dec 30, 2025
Merged

docs(blog): minor update on 2025 Hudi blog#17753
xushiyan merged 1 commit intoapache:asf-sitefrom
xushiyan:minor-update

Conversation

@xushiyan
Copy link
Member

Minor wording update

Copilot AI review requested due to automatic review settings December 30, 2025 16:15
@xushiyan xushiyan merged commit dc2b238 into apache:asf-site Dec 30, 2025
5 checks passed
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR makes minor wording improvements to the Apache Hudi 2025 year-in-review blog post. The changes enhance clarity and precision by improving tense consistency, removing redundant words, and correcting terminology.

  • Improved tense consistency ("was" → "has been")
  • Removed redundant descriptors ("growing", "expanding")
  • Updated PMC terminology from "joined" to "was elected to"
  • Improved sentence structure for better readability

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Performance saw major gains across the board. Parquet binary copy for clustering delivered 10-15x faster execution with 95% compute reduction. Apache Flink writer achieved 2-3x improved throughput with Avro conversion eliminated in the write path. Apache Spark metadata-table streaming ran ~18% faster for update-heavy workloads. Indexing enhancements — partitioned record index, partition-level bucket index, HFile caching, and Bloom filters — delivered up to 4x speedup for lookups on massive tables.

Spark 4.0 and Flink 2.0 support were added. [Apache Polaris (Incubating)](https://polaris.apache.org/) catalog integration enabled multi-engine queries with unified governance. Operational simplicity improved with storage-based locking that eliminated external dependencies. New merge modes replaced legacy payload classes with declarative options, and SQL procedures enhanced table management directly in Spark SQL. See more details in the [release blog](https://hudi.apache.org/blog/2025/11/25/apache-hudi-release-1-1-announcement/).
Spark 4.0 and Flink 2.0 support was added. [Apache Polaris (Incubating)](https://polaris.apache.org/) catalog integration enabled multi-engine queries with unified governance. Operational simplicity improved with storage-based locking that eliminated external dependencies. New merge modes replaced legacy payload classes with declarative options, and SQL procedures enhanced table management directly in Spark SQL. See more details in the [release blog](https://hudi.apache.org/blog/2025/11/25/apache-hudi-release-1-1-announcement/).
Copy link

Copilot AI Dec 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The change from "were added" to "was added" may be grammatically incorrect. The subject "Spark 4.0 and Flink 2.0 support" is compound (two items), which typically takes a plural verb form. "were added" would be the grammatically correct choice unless the phrase is intended to refer to a single unified support feature.

Suggested change
Spark 4.0 and Flink 2.0 support was added. [Apache Polaris (Incubating)](https://polaris.apache.org/) catalog integration enabled multi-engine queries with unified governance. Operational simplicity improved with storage-based locking that eliminated external dependencies. New merge modes replaced legacy payload classes with declarative options, and SQL procedures enhanced table management directly in Spark SQL. See more details in the [release blog](https://hudi.apache.org/blog/2025/11/25/apache-hudi-release-1-1-announcement/).
Spark 4.0 and Flink 2.0 support were added. [Apache Polaris (Incubating)](https://polaris.apache.org/) catalog integration enabled multi-engine queries with unified governance. Operational simplicity improved with storage-based locking that eliminated external dependencies. New merge modes replaced legacy payload classes with declarative options, and SQL procedures enhanced table management directly in Spark SQL. See more details in the [release blog](https://hudi.apache.org/blog/2025/11/25/apache-hudi-release-1-1-announcement/).

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants