feat(datafusion): add TRUNCATE TABLE and DROP PARTITION SQL support by JingsongLi · Pull Request #292 · apache/paimon-rust

JingsongLi · 2026-04-28T01:58:04Z

Purpose

Wire existing TableCommit::truncate_table() and truncate_partitions() APIs to the DataFusion SQL layer, supporting:

TRUNCATE TABLE db.t
TRUNCATE TABLE db.t PARTITION (col = val, ...)
ALTER TABLE db.t DROP PARTITION (col = val, ...)

Brief change log

Tests

API and Format

Documentation

Wire existing TableCommit::truncate_table() and truncate_partitions() APIs to the DataFusion SQL layer, supporting: - TRUNCATE TABLE db.t - TRUNCATE TABLE db.t PARTITION (col = val, ...) - ALTER TABLE db.t DROP PARTITION (col = val, ...) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

jerry-024

Three must-fix items, posted inline. The HIGH one (partial partition spec) is silent wrong-data deletion, not just a behavior nuance.

jerry-024 · 2026-04-28T05:20:12Z

+        }
+
+        let field = field_map.get(col_name.as_str()).ok_or_else(|| {
+            DataFusionError::Plan(format!("Column '{col_name}' not found in table schema"))


HIGH: silent wrong-partition deletion on incomplete specs.

This builds partition only from keys the user supplied. The map flows through TableCommit::truncate_partitions → partitions_to_bytes:

let datum = p.get(key).cloned().flatten(); // missing key → None

None is encoded into the partition filter as IS NULL. So on a (dt, region) partitioned table, DROP PARTITION (dt = '2026-04-28') becomes a filter dt='2026-04-28' AND region IS NULL — it does not delete all dt='2026-04-28' partitions like Hive/Spark prefix semantics would. If a NULL-region partition exists, it gets deleted (silent wrong-target). If not, silent no-op. Either way, user intent is mishandled with no error.

Two acceptable fixes:

Reject incomplete specs explicitly: after the loop, if partition.len() != partition_keys.len() → Err listing the missing keys.

Implement true prefix-match deletion: requires a partition-filter primitive that matches on a subset of keys; not just a downstream-API tweak.

For reference, Java's Spark PaimonPartitionManagement.toPaimonPartitions does require(partitionFieldCount <= partitionKeys.length) and projects the partition row type so prefix specs work as Hive expects.

jerry-024 · 2026-04-28T05:20:12Z

+            .map_err(to_datafusion_error)?;
+
+        let wb = table.new_write_builder();
+        let commit = wb.new_commit();


MEDIUM: empty PARTITION () clause falls through to a full-table truncate.

If truncate.partitions is Some(empty_vec) (whatever AST shape sqlparser produces for a degenerate parser edge case), this branch is skipped and execution drops into commit.truncate_table() below — the entire table is wiped despite the user writing a PARTITION clause. Opposite of intent.

Suggest tightening:

if let Some(partitions) = &truncate.partitions { if partitions.is_empty() { return Err(DataFusionError::Plan( "PARTITION clause requires at least one column = value".to_string(), )); } // ... existing branch return ok_result(...); } commit.truncate_table().await?;

This way only TRUNCATE TABLE t (no PARTITION clause at all, i.e. truncate.partitions is None) reaches truncate_table().

jerry-024 · 2026-04-28T05:20:12Z

                    }
                }
+                // DropPartitions is a data operation (not a schema change), so we handle it
+                // separately and return early — it cannot be combined with schema changes.


MEDIUM: if_exists dropped at three levels — none honored.

Inner AlterTableOperation::DropPartitions { if_exists: _ } (this match arm): partition-level IF EXISTS is explicitly bound and discarded. The doc comment claims this "matches IF EXISTS semantics" because the underlying overwrite is a no-op on missing partitions (verified — truncate_partitions early-returns on empty resolved entries). But that makes plain DROP PARTITION behave identically to DROP PARTITION IF EXISTS. Spark's AlterTableDropPartitionCommand errors when IF EXISTS is omitted and the partition doesn't exist — this PR diverges silently.

Outer Statement::AlterTable { if_exists, .. }: returning early from this branch into handle_drop_partitions skips whatever outer IF EXISTS handling the rest of handle_alter_table does. ALTER TABLE IF EXISTS missing_table DROP PARTITION (...) will fail with a hard error from catalog.get_table(...) instead of being a silent no-op.

Statement::Truncate { if_exists, .. } (handler at line 570): sqlparser parses the flag (confirmed in sqlparser::ast::Truncate), but handle_truncate_table never reads it. TRUNCATE TABLE IF EXISTS missing_table errors instead of no-oping.

Suggested fix: at each layer, read the flag, and on get_table returning NotFound (or by list_tables first), short-circuit to ok_result when if_exists is set, otherwise propagate the error.

jerry-024

+1

JingsongLi force-pushed the truncate branch from 9f36161 to 83eb5a8 Compare April 28, 2026 02:10

jerry-024 reviewed Apr 28, 2026

View reviewed changes

Fix comments

20a7f30

jerry-024 approved these changes Apr 28, 2026

View reviewed changes

JingsongLi merged commit 07de603 into apache:main Apr 28, 2026
7 of 8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(datafusion): add TRUNCATE TABLE and DROP PARTITION SQL support#292

feat(datafusion): add TRUNCATE TABLE and DROP PARTITION SQL support#292
JingsongLi merged 2 commits intoapache:mainfrom
JingsongLi:truncate

JingsongLi commented Apr 28, 2026

Uh oh!

jerry-024 left a comment

Uh oh!

jerry-024 Apr 28, 2026

Uh oh!

jerry-024 Apr 28, 2026

Uh oh!

jerry-024 Apr 28, 2026

Uh oh!

jerry-024 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JingsongLi commented Apr 28, 2026

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

jerry-024 left a comment

Choose a reason for hiding this comment

Uh oh!

jerry-024 Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

jerry-024 Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

jerry-024 Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

jerry-024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants