Kedro Databricks
Kedro plugin to develop Kedro pipelines for Databricks. This plugin strives to provide an excellent developer experience when using Kedro on Databricks.
Key Features¶
- Initialization: Transform your local Kedro project into a Databricks Asset Bundle.
- Generation: Generate Asset Bundle resource definitions from your Kedro pipelines.
- Deployment: Deploy your Kedro pipelines to Databricks as Jobs.
- Execution: Run your Kedro pipelines on Databricks straight from the command line.
- Cleanup: Remove all Databricks resources created by the plugin.
Advanced Capabilities¶
- Resource generation modes (
nodeandpipeline) with support for custom generators. - Flexible override model with defaults, named overrides, and regex-based overrides.
- Support for non-job resources (for example volumes) in
conf/<env>/databricks.yml. - Databricks CLI passthrough (
-- ...) for advanced target/profile control. - Automatic local data upload during deploy when
_file_pathis configured.
For practical examples and deep-dive configuration patterns, see the User Guide.