Announcement

Free to view yesterday and today
Customer Service: cat_manager

Databricks Terraform Provider: Automate Databricks Infrastructure with IaC

Manage your Databricks infrastructure and resources using HashiCorp Terraform. This provider enables declarative configuration of workspaces, clusters, jobs, permissions, and more.

Go
Added on 2025年5月10日
View on GitHub
Databricks Terraform Provider: Automate Databricks Infrastructure with IaC preview
513
Stars
428
Forks
Go
Language

Project Introduction

Summary

The Databricks Terraform Provider allows users to manage Databricks infrastructure and resources using Terraform's Infrastructure as Code approach. It integrates with the Databricks API to provision and manage resources like workspaces, clusters, jobs, notebooks, permissions, and more.

Problem Solved

Manually configuring Databricks workspaces, clusters, jobs, and permissions across multiple environments is time-consuming, inconsistent, and error-prone. This provider allows you to define your Databricks infrastructure as code, enabling reproducible deployments and version control.

Core Features

Workspace Management

Declaratively manage Databricks workspaces across cloud providers (AWS, Azure, GCP).

Cluster Automation

Automate the creation and configuration of Databricks clusters, including instance pools and libraries.

Jobs and MLOps Resource Management

Define and manage Databricks Jobs and MLOps resources as code.

Identity and Access Management

Control user, group, and service principal access and permissions within Databricks.

Tech Stack

Terraform
Go
Databricks API

Use Cases

The Databricks Terraform Provider can be used for a wide range of infrastructure automation tasks within Databricks.

Provisioning Databricks Workspaces

Details

Provision a new Databricks workspace in your preferred cloud provider and configure its initial settings, including VPC/VNet peering and storage.

User Value

Rapidly deploy new Databricks environments for projects, teams, or testing.

Managing Databricks Clusters and Compute Resources

Details

Define standard cluster configurations (size, autoscaling, libraries) and deploy them consistently for different workloads.

User Value

Ensure consistent, cost-effective, and performant compute resources for all users and jobs.

Automating Data Pipelines and ML Workflows

Details

Define and schedule data processing jobs or machine learning model training pipelines directly in your Terraform configuration.

User Value

Version control and automate the deployment of your operational workflows.

Implementing Access Control and Security Policies

Details

Manage user access, group memberships, and permissions on clusters, notebooks, and other workspace objects.

User Value

Enforce security best practices and compliance requirements programmatically.

Recommended Projects

You might be interested in these projects

immortalwrtimmortalwrt

ImmortalWrt is an open-source embedded operating system based on OpenWrt, specifically tailored and optimized for users in mainland China, offering enhanced features, stability, and compatibility.

C
79832304
View Details

overleafoverleaf

A web-based, collaborative LaTeX editor designed to simplify document creation and teamwork for academic writing, reports, presentations, and more.

JavaScript
152801575
View Details

coturncoturn

coturn is a free open source implementation of TURN and STUN servers. It is used to facilitate NAT traversal for real-time communications applications like WebRTC, VoIP, and online gaming.

C
123012107
View Details