Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DynamoDB runtime dependencies for Delta S3 multi-writer #89

Merged
merged 1 commit into from
Oct 28, 2024

Conversation

istreeter
Copy link
Collaborator

Delta has a "S3 multi-cluster" mode, which allows safe cocurrent writes by multiple loaders.

To enable this mode, users can set the spark options documented by Delta:

  • spark.delta.logStore.s3.impl=io.delta.storage.S3DynamoDBLogStore
  • spark.io.delta.storage.S3DynamoDBLogStore.ddb.tableName=???
  • spark.io.delta.storage.S3DynamoDBLogStore.ddb.region=???
  • (and some others)

This commit adds the necessary runtime dependencies for this mode.

Delta has a "S3 multi-cluster" mode, which allows safe cocurrent writes
by multiple loaders.

To enable this mode, users can set the spark options documented by
Delta:

- `spark.delta.logStore.s3.impl=io.delta.storage.S3DynamoDBLogStore`
- `spark.io.delta.storage.S3DynamoDBLogStore.ddb.tableName=???`
- `spark.io.delta.storage.S3DynamoDBLogStore.ddb.region=???`
- (and some others)

This commit adds the necessary runtime dependencies for this mode.
@oguzhanunlu oguzhanunlu merged commit e39d411 into develop Oct 28, 2024
2 checks passed
@oguzhanunlu oguzhanunlu deleted the delta-dynamodb-runtime-dependencies branch October 28, 2024 12:52
istreeter added a commit that referenced this pull request Dec 31, 2024
In #89 we added support for Delta's "S3 multi-cluster" mode, which uses
a DynamoDB table to allow safe concurrent access by multiple loaders.

This PR extends the webhook feature, so the webhook receives a friendly
error message if the DynamoDB table is mis-configured
istreeter added a commit that referenced this pull request Dec 31, 2024
In #89 we added support for Delta's "S3 multi-cluster" mode, which uses
a DynamoDB table to allow safe concurrent access by multiple loaders.

This PR extends the webhook feature, so the webhook receives a friendly
error message if the DynamoDB table is mis-configured
istreeter added a commit that referenced this pull request Dec 31, 2024
In #89 we added support for Delta's "S3 multi-cluster" mode, which uses
a DynamoDB table to allow safe concurrent access by multiple loaders.

This PR extends the webhook feature, so the webhook receives a friendly
error message if the DynamoDB table is mis-configured
istreeter added a commit that referenced this pull request Dec 31, 2024
In #89 we added support for Delta's "S3 multi-cluster" mode, which uses
a DynamoDB table to allow safe concurrent access by multiple loaders.

This PR extends the webhook feature, so the webhook receives a friendly
error message if the DynamoDB table is mis-configured
istreeter added a commit that referenced this pull request Jan 3, 2025
In #89 we added support for Delta's "S3 multi-cluster" mode, which uses
a DynamoDB table to allow safe concurrent access by multiple loaders.

This PR extends the webhook feature, so the webhook receives a friendly
error message if the DynamoDB table is mis-configured
istreeter added a commit that referenced this pull request Jan 3, 2025
In #89 we added support for Delta's "S3 multi-cluster" mode, which uses
a DynamoDB table to allow safe concurrent access by multiple loaders.

This PR extends the webhook feature, so the webhook receives a friendly
error message if the DynamoDB table is mis-configured
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants