Learning Path1: Explore Azure Databricks
Get Started With Azure Databricks
Identify Azure Databricks Workloads
Understand Key Concepts
Data Governance Using Unity Catalog and Microsoft Purview
Learning Path 2: Select and Configure Compute in Azure Databricks
Choose an appropriate compute type
Configure compute performance
Configure compute features
Install libraries for compute
Configure compute access
Learning Path 3: Create and organize objects in Unity Catalog
Learning Path 4: Secure Unity Catalog objects
Understand query lifecycle
Implement access control strategies
Understand fine-grained access control
Implement row filtering and column masking
Access Azure Key Vault secrets
Authenticate data access with service principals
Authenticate resource access with managed identities
Learning Path 5: Govern Unity Catalog objects
Create and preserve table definitions
Configure ABAC with tags and policies
Apply data retention policies
Set up and manage data lineage
Configure audit logging
Design secure Delta Sharing strategy
Learning Path 6: Design and implement data modeling with Azure Databricks
Design ingestion logic and data source configuration
Choose a data ingestion tool
Choose a data table format
Design and implement a data partitioning scheme
Choose a slowly changing dimension (SCD) type
Implement a slowly changing dimension (SCD) type 2
Design and implement a temporal (history) table to record changes over time
Choose granularity on a column or table based on requirements
Choose managed vs unmanaged tables
Design and implement a clustering strategy
Learning Path 7: Ingest data into Unity Catalog
Ingest data with Lakeflow Connect
Ingest data with notebooks
Ingest data with SQL methods
Ingest data with CDC feed
Ingest data with Spark Structured Streaming
Ingest data with Auto Loader
Ingest data with Lakeflow Spark Declarative Pipelines
Learning Path 8: Cleanse, transform, and load data into Unity Catalog
Profile data
Choose column data types
Resolve duplicates and nulls
Transform data with filters and aggregations
Transform data with joins and set operators
Transform data with denormalization and pivots
Load data with merge, insert, and append
Learning Path 9: Implement and manage data quality constraints with Azure Databricks
Implement validation checks
Implement data type checks
Detect and manage schema drift
Manage data quality with pipeline expectations.
Learning Path 10: Design and implement data pipelines with Azure Databricks
Design order of operations for a pipeline
Choose notebook vs Lakeflow Pipelines
Design Lakeflow job logic
Design error handling in pipelines and jobs
Create pipeline with notebook
Create pipeline with Lakeflow Spark Declarative Pipelines
Learning Path 11: Implement Lakeflow Jobs with Azure Databricks
Learning Path 12: Implement development lifecycle processes in Azure Databricks
Apply Git version control best practices
Manage branching and pull requests
Implement testing strategy
Configure and package Declarative Automation Bundles
Deploy bundle with Databricks CLI
Learning Path 13: Monitor, troubleshoot and optimize workloads in Azure Databricks
Monitor and manage cluster consumption
Troubleshoot and repair Lakeflow Jobs
Troubleshoot Spark jobs and notebooks
Implement log streaming with Azure Log Analytics