The delta extension adds support for the Delta Lake open-source storage format. It is built using the Delta Kernel. The extension offers read support for Delta tables, both local and remote.
For implementation details, see the announcement blog post.
Warning The
deltaextension is currently experimental and is only supported on given platforms.
The delta extension will be transparently autoloaded on first use from the official extension repository. If you would like to install and load it manually, run:
INSTALL delta; LOAD delta;
To scan a local Delta table, run:
SELECT *
FROM delta_scan('file:///some/path/on/local/machine'); To scan a Delta table in an S3 bucket, run:
SELECT *
FROM delta_scan('s3://some/delta/table'); For authenticating to S3 buckets, DuckDB Secrets are supported:
CREATE SECRET (
TYPE S3,
PROVIDER CREDENTIAL_CHAIN
);
SELECT *
FROM delta_scan('s3://some/delta/table/with/auth'); To scan public buckets on S3, you may need to pass the correct region by creating a secret containing the region of your public S3 bucket:
CREATE SECRET (
TYPE S3,
REGION 'my-region'
);
SELECT *
FROM delta_scan('s3://some/public/table/in/my-region'); To scan a Delta table in an Azure Blob Storage bucket, run:
SELECT *
FROM delta_scan('az://my-container/my-table'); For authenticating to Azure Blob Storage, DuckDB Secrets are supported:
CREATE SECRET (
TYPE AZURE,
PROVIDER CREDENTIAL_CHAIN
);
SELECT *
FROM delta_scan('az://my-container/my-table-with-auth'); While the delta extension is still experimental, many (scanning) features and optimizations are already supported:
More optimizations are going to be released in the future.
The delta extension requires DuckDB version 0.10.3 or newer.
The delta extension currently only supports the following platforms:
linux_amd64, linux_amd64_gcc4, and linux_arm64
osx_amd64 and osx_arm64
windows_amd64
Support for the other DuckDB platforms is work-in-progress.
© Copyright 2018–2024 Stichting DuckDB Foundation
Licensed under the MIT License.
https://duckdb.org/docs/extensions/delta.html