feat(table): Support Dynamic Partition Overwrite #482

dttung2905 · 2025-07-08T21:59:46Z

No description provided.

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

laskoviymishka · 2025-07-09T09:31:04Z

table/transaction.go

+}
+
+// Delete performs a delete operation with the given filter and snapshot properties.
+func (t *Transaction) Delete(ctx context.Context, filter iceberg.BooleanExpression, snapshotProps iceberg.Properties) error {


i'm a little bit doubting here, DPO suppose to delete the whole file inside a partition, this Delete method semantic is odd, per description i would guess that this should delete rows based on predicate, but in a fact it deletes whole files.

maybe it worth to rename method and make it private?
is there any need to keep it public?

Yes you are right. I think it should be a private method. I changed the name to make it more meaningful

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

zeroshade · 2025-08-05T18:46:11Z

table/transaction.go

+	// Check that all partition fields use identity transforms
+	currentSpec := t.meta.CurrentSpec()
+	for field := range currentSpec.Fields() {
+		if _, ok := field.Transform.(iceberg.IdentityTransform); !ok {
+			return fmt.Errorf("%w: dynamic overwrite does not support non-identity-transform fields in partition spec: %s",
+				ErrInvalidOperation, field.Name)
+		}
+	}


is this defined in the spec? Or is this just a NotYetImplemented thing?

zeroshade · 2025-08-05T18:46:32Z

table/transaction.go

+	if tbl.NumRows() == 0 {
+		return nil
+	}


shouldn't this overwrite the partition with an empty partition?

zeroshade · 2025-08-05T18:48:21Z

table/transaction.go

+	var allDataFiles []iceberg.DataFile
+	for df, err := range dataFiles {
+		if err != nil {
+			return err
+		}
+		allDataFiles = append(allDataFiles, df)
+	}
+
+	partitionsToOverwrite := make(map[string]struct{})
+	for _, df := range allDataFiles {
+		partitionKey := fmt.Sprintf("%v", df.Partition())
+		partitionsToOverwrite[partitionKey] = struct{}{}
+	}


you can probably merge these loops

zeroshade · 2025-08-05T18:51:19Z

table/transaction.go

+		return err
+	}
+
+	deleteProducer := t.updateSnapshot(fs, snapshotProps).mergeOverwrite(nil)


shouldn't this use the commitUUID?

zeroshade · 2025-08-05T18:56:21Z

table/transaction.go

+			partitionExpr := partitionExprs[0]
+			for _, expr := range partitionExprs[1:] {
+				partitionExpr = iceberg.NewAnd(partitionExpr, expr)
+			}


this is already handled via NewAnd. You can do: partitionExpr := iceberg.NewAnd(partitionExprs[0], partitionExprs[1], partitionExprs[2:]...)

zeroshade · 2025-08-05T18:58:13Z

table/transaction.go

+	result := expressions[0]
+	for _, expr := range expressions[1:] {
+		result = iceberg.NewOr(result, expr)
+	}


same comment as above, iceberg.NewOr already handles an arbitrary number of arguments so you don't have to do this loop manually

zeroshade · 2025-08-05T19:02:51Z

table/transaction.go

+func parsePartitionKey(partitionKey string, fieldNames []string) []interface{} {
+	// Simple parsing for demonstration - assumes a format like "field1=value1/field2=value2"
+	parts := strings.Split(partitionKey, "/")
+	values := make([]interface{}, len(fieldNames))


we have the schema, we can use the field names to determine the types so we know what type to parse into from the strings

zeroshade · 2025-08-05T19:04:51Z

table/transaction.go

+	switch t := typ.(type) {
+	case iceberg.PrimitiveType:
+		switch t {
+		case iceberg.PrimitiveTypes.Int32:
+			if v, ok := value.(int32); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Int64:
+			if v, ok := value.(int64); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Float32:
+			if v, ok := value.(float32); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Float64:
+			if v, ok := value.(float64); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.String:
+			if v, ok := value.(string); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		case iceberg.PrimitiveTypes.Bool:
+			if v, ok := value.(bool); ok {
+				return iceberg.EqualTo(term, v)
+			}
+		}
+	}


the types and casting should be handled for you once the expression is bound. So you shouldn't need the iceberg.Type, just do a switch on value.(type) and calling iceberg.EqualTo(term, v)

lliangyu-lin · 2025-08-07T18:40:59Z

table/transaction.go

+}
+
+// deleteFileByFilter performs a delete operation with the given filter and snapshot properties.
+func (t *Transaction) deleteFileByFilter(ctx context.Context, filter iceberg.BooleanExpression, snapshotProps iceberg.Properties) error {


I'm also working on a complete delete API (CoW) that can delete row level and file level based on predicate in #518.
Hopefully we don't need this method once the full delete API is supported.

Support Dynamic Partition Overwrite

de83515

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

laskoviymishka suggested changes Jul 9, 2025

View reviewed changes

Make deleteByFilter method private

931506d

Signed-off-by: dttung2905 <ttdao.2015@accountancy.smu.edu.sg>

zeroshade requested changes Aug 5, 2025

View reviewed changes

lliangyu-lin reviewed Aug 7, 2025

View reviewed changes

dttung2905 mentioned this pull request Aug 23, 2025

can you support partitioned tables? #536

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(table): Support Dynamic Partition Overwrite #482

feat(table): Support Dynamic Partition Overwrite #482

Uh oh!

dttung2905 commented Jul 8, 2025

Uh oh!

laskoviymishka Jul 9, 2025

Uh oh!

dttung2905 Jul 9, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

zeroshade Aug 5, 2025

Uh oh!

lliangyu-lin Aug 7, 2025 •

edited

Loading

Uh oh!

Uh oh!

feat(table): Support Dynamic Partition Overwrite #482

Are you sure you want to change the base?

feat(table): Support Dynamic Partition Overwrite #482

Uh oh!

Conversation

dttung2905 commented Jul 8, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lliangyu-lin Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lliangyu-lin Aug 7, 2025 •

edited

Loading