Drop table should not clean the folder for Nessie catalog #22392

ajantha-bhat · 2024-06-14T08:48:49Z

Description

The same table might still be live in other branches or tags. Therefore, dropping the table should not clean up the files as it is not reference-aware. Use the Nessie GC tool to clean up expired files. This behaviour is consistent with the Spark Nessie integration.

Additional context and related issues

Spark integration:
https://github.com/apache/iceberg/blob/main/nessie/src/main/java/org/apache/iceberg/nessie/NessieIcebergClient.java#L557-L562
Docs: https://iceberg.apache.org/docs/1.5.1/nessie/#further-use-cases

Release notes

( ) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
(x) Release notes are required, with the following suggested text:

# Section
* Fix some things. ({issue}`issuenumber`)

ajantha-bhat · 2024-06-14T08:50:14Z

...n/trino-iceberg/src/main/java/io/trino/plugin/iceberg/catalog/nessie/TrinoNessieCatalog.java

@@ -237,7 +237,8 @@ public void dropTable(ConnectorSession session, SchemaTableName schemaTableName)
 BaseTable table = (BaseTable) loadTable(session, schemaTableName);
 validateTableCanBeDropped(table);
 nessieClient.dropTable(toIdentifier(schemaTableName), true);
- deleteTableDirectory(fileSystemFactory.create(session), schemaTableName, table.location());


Initial PR might have copied code from other catalogs and forgot to handle it.

Now the behaviour is same as Spark integration (expected)

ajantha-bhat · 2024-06-14T08:50:41Z

plugin/trino-iceberg/src/test/java/io/trino/plugin/iceberg/BaseIcebergConnectorSmokeTest.java

@@ -859,7 +859,7 @@ private ZonedDateTime getSnapshotTime(String tableName, long snapshotId)
 .getOnlyColumnAsSet());
 }

- private String getTableLocation(String tableName)
+ protected String getTableLocation(String tableName)


needed for overriding the tests

ajantha-bhat · 2024-06-14T08:51:31Z

.../java/io/trino/plugin/iceberg/catalog/nessie/TestIcebergNessieCatalogConnectorSmokeTest.java

@@ -140,8 +163,8 @@ protected void dropTableFromMetastore(String tableName)
 @Override
 protected String getMetadataLocation(String tableName)
 {
- // used when registering a table, which is not supported by the Nessie catalog
- throw new UnsupportedOperationException("metadata location for register_table is not supported");
+ BaseTable table = (BaseTable) catalog.loadTable(TableIdentifier.of("tpch", tableName));


getMetadataLocation is a required functionality and it is used for other than register table also.

ajantha-bhat · 2024-06-14T08:51:52Z

.../java/io/trino/plugin/iceberg/catalog/nessie/TestIcebergNessieCatalogConnectorSmokeTest.java

@@ -197,7 +220,7 @@ public void testRegisterTableWithDifferentTableName()
 public void testRegisterTableWithMetadataFile()
 {
 assertThatThrownBy(super::testRegisterTableWithMetadataFile)
- .hasMessageContaining("metadata location for register_table is not supported");
+ .hasMessageContaining("register_table procedure is disabled");


because now getMetadataLocation is implemented.

ajantha-bhat · 2024-06-14T08:53:04Z

.../java/io/trino/plugin/iceberg/catalog/nessie/TestIcebergNessieCatalogConnectorSmokeTest.java

+ assertThat(getQueryRunner().tableExists(getSession(), tableName)).isFalse();
+ assertThat(fileSystem.listFiles(tableLocation).hasNext())
+ .describedAs("Table location should exist")
+ .isTrue();


The base tests assume table location to NOT exist. Hence, had to override as per Nessie behaviour.

ajantha-bhat · 2024-06-14T11:13:31Z

cc: @dimas-b

dimas-b

Nice catch! Thanks, @ajantha-bhat !

ajantha-bhat · 2024-06-18T10:16:33Z

ping for review

dimas-b · 2024-06-21T14:41:30Z

@ajantha-bhat : Will this problem also exist when Nessie is used via its Iceberg REST Catalog API?

ajantha-bhat · 2024-06-23T04:59:28Z

@ajantha-bhat : Will this problem also exist when Nessie is used via its Iceberg REST Catalog API?

Server will handle the delete implementation for the REST catalog as of now. So that problem does not exist for REST catalog.

https://github.com/apache/iceberg/blob/a47937c0c1fcafe57d7dc83551d8c9a3ce0ab1b9/core/src/main/java/org/apache/iceberg/rest/RESTClient.java#L38-L71

The same table might still be live in other branches or tags. Therefore, dropping the table should not clean up the files as it is not reference-aware. Use the Nessie GC tool to clean up expired files. This behavior is consistent with the Spark Nessie integration.

ajantha-bhat · 2024-06-24T12:28:39Z

@findepi or @findinpath: Can you please take a look?

olivier-derom · 2024-06-27T14:58:47Z

We've been running into issues with Trino and nessie branching because of this.
This PR would be a great improvement, thanks!

ajantha-bhat · 2024-06-27T15:06:31Z

@olivier-derom: Thanks for using Nessie and Trino. We are working with the community to get this merged.

mosabua · 2024-07-03T16:41:35Z

At first glance this looks good to me. Maybe @cwsteinbach can help next.

mosabua · 2024-07-03T16:49:46Z

One question/remark I have is around consistency across different metastores/catalogs. Seems like we are ending up in a situation where dropping a table has different results in terms of meta data and file deletion depending on what catalog is used (Nessie vs REST vs HMS vs...). But I assume we cant really avoid given the expectation of consistent behaviour across query engines.

Under that assumption I think the implemented approach and reliance on the Nessie GC is fine. Unless of course there is development in the Iceberg spec around what is supposed to happen on dropping tables.

cwsteinbach · 2024-07-03T17:28:39Z

Unless of course there is development in the Iceberg spec around what is supposed to happen on dropping tables.

@mosabua, the Iceberg table format spec doesn't cover this, and I don't expect that it ever will. It would be really nice to have consistent DDL behavior across engine/catalog combinations, but I don't think Iceberg can dictate DDL semantics for third-party systems, and even if they could, it would create a lot of pain for users in the form of backward-incompatible changes.

That said, the Iceberg REST Catalog API does make a distinction between only dropping the table record and dropping both the table record and the underlying data, and defaults to the former rather than the latter:

https://github.com/apache/iceberg/blob/d255c87b00c8ca422a1d32a33b3a9cfe2f04cea2/open-api/rest-catalog-open-api.yaml#L787

ajantha-bhat · 2024-07-04T06:35:17Z

Thanks @mosabua and @cwsteinbach for the inputs.

Yeah. Since Nessie is a catalog level versioning. We can't drop a table without knowing where and all it is referenced. Hence, we block it from the usual path and recommend using Nessie GC. This is one of the complexity that has introduced because of catalog level versioning.

With REST API v2 we are planning to formally introduce catalog level versioning capability in the REST catalog spec for easy clarification to the users.

Lastly, let me know if anything needed for this PR to get merged. Thanks.

wendigo · 2024-07-04T10:58:25Z

Thanks @ajantha-bhat

ajantha-bhat · 2024-07-04T11:00:53Z

Thanks for the review and merge.

wendigo · 2024-07-04T11:31:42Z

I've just updated nessie to the latest version as well @ajantha-bhat

cla-bot bot added the cla-signed label Jun 14, 2024

github-actions bot added the iceberg Iceberg connector label Jun 14, 2024

ajantha-bhat commented Jun 14, 2024

View reviewed changes

ajantha-bhat force-pushed the drop branch from 733fa2a to db04917 Compare June 14, 2024 09:01

ajantha-bhat requested a review from nastra June 14, 2024 11:13

ajantha-bhat requested a review from wendigo June 14, 2024 11:13

dimas-b approved these changes Jun 14, 2024

View reviewed changes

wendigo requested a review from findinpath June 14, 2024 13:20

ajantha-bhat changed the title ~~Nessie: Drop table should not clean the folder~~ Drop table should not clean the folder for Nessie catalog Jun 23, 2024

ajantha-bhat force-pushed the drop branch from db04917 to efd9e42 Compare June 23, 2024 05:00

ajantha-bhat force-pushed the drop branch from efd9e42 to 7faab6a Compare June 24, 2024 05:26

ajantha-bhat requested a review from ebyhr June 25, 2024 13:14

ebyhr removed their request for review June 26, 2024 00:42

wendigo approved these changes Jul 4, 2024

View reviewed changes

wendigo merged commit 75a9a91 into trinodb:master Jul 4, 2024
43 checks passed

github-actions bot added this to the 452 milestone Jul 4, 2024

colebow mentioned this pull request Jul 10, 2024

Add Trino 452 release notes #22573

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop table should not clean the folder for Nessie catalog #22392

Drop table should not clean the folder for Nessie catalog #22392

ajantha-bhat commented Jun 14, 2024

ajantha-bhat Jun 14, 2024

ajantha-bhat Jun 14, 2024

ajantha-bhat Jun 14, 2024

ajantha-bhat Jun 14, 2024

ajantha-bhat Jun 14, 2024 •

edited

Loading

ajantha-bhat commented Jun 14, 2024

dimas-b left a comment

ajantha-bhat commented Jun 18, 2024

dimas-b commented Jun 21, 2024

ajantha-bhat commented Jun 23, 2024

ajantha-bhat commented Jun 24, 2024

olivier-derom commented Jun 27, 2024

ajantha-bhat commented Jun 27, 2024

mosabua commented Jul 3, 2024

mosabua commented Jul 3, 2024

cwsteinbach commented Jul 3, 2024

ajantha-bhat commented Jul 4, 2024 •

edited

Loading

wendigo commented Jul 4, 2024

ajantha-bhat commented Jul 4, 2024

wendigo commented Jul 4, 2024

Drop table should not clean the folder for Nessie catalog #22392

Drop table should not clean the folder for Nessie catalog #22392

Conversation

ajantha-bhat commented Jun 14, 2024

Description

Additional context and related issues

Release notes

ajantha-bhat Jun 14, 2024

Choose a reason for hiding this comment

ajantha-bhat Jun 14, 2024

Choose a reason for hiding this comment

ajantha-bhat Jun 14, 2024

Choose a reason for hiding this comment

ajantha-bhat Jun 14, 2024

Choose a reason for hiding this comment

ajantha-bhat Jun 14, 2024 • edited Loading

Choose a reason for hiding this comment

ajantha-bhat commented Jun 14, 2024

dimas-b left a comment

Choose a reason for hiding this comment

ajantha-bhat commented Jun 18, 2024

dimas-b commented Jun 21, 2024

ajantha-bhat commented Jun 23, 2024

ajantha-bhat commented Jun 24, 2024

olivier-derom commented Jun 27, 2024

ajantha-bhat commented Jun 27, 2024

mosabua commented Jul 3, 2024

mosabua commented Jul 3, 2024

cwsteinbach commented Jul 3, 2024

ajantha-bhat commented Jul 4, 2024 • edited Loading

wendigo commented Jul 4, 2024

ajantha-bhat commented Jul 4, 2024

wendigo commented Jul 4, 2024

ajantha-bhat Jun 14, 2024 •

edited

Loading

ajantha-bhat commented Jul 4, 2024 •

edited

Loading