
When it helps, when it hurts, how to pick columns, and what KPIs to measure. Introduction: Why the Hype Around…

Optimal file sizes, compaction strategies, and how to keep your Delta tables lightning-fast Why Small Files Are a Big Problem…

Here’s a simple line-by-line summary of the Serverless compute plane networking (08/04/2025): General idea Serverless egress control (outbound connections) Network…

Here’s a simple, line-by-line summary of the Serverless compute limitations (09/29/2025): General limitations Streaming limitations Machine learning limitations Notebook limitations…

Best practices for serverless compute Big picture Before you migrate Ingesting data (getting data in) Querying external data (without moving…

Here’s a simple line-by-line summary of the important points from the Serverless Compute release notes (09/24/2025): Perfect 👍 Here’s a…

How to detect skew, leverage AQE, use repartitioning patterns, and tune the shuffle service for blazing-fast jobs Why Shuffle Is…

Benchmarking SQL/Delta workloads, common pitfalls, and a practical migration checklist Why Photon Exists Databricks introduced Photon, a vectorized query engine…

How to set min/max nodes, use termination settings, mix spot/preemptible nodes, and avoid “yo-yo” scaling Why Autoscaling Is Tricky Autoscaling…

How to pick nodes, cores, memory, and disk for ETL vs. ML vs. SQL—and when to scale up vs. out…

Lakehouse Federation lets you query external databases directly from Databricks—without copying the data into your lake. You point Unity Catalog…

What is a streaming table in Databricks? A streaming table is a Delta table that Databricks keeps up-to-date automatically as…

Topic View Materialized View What it is A stored query; results are computed each time you query it A precomputed…

What is Databricks SQL (DBSQL)? Databricks SQL is the lakehouse analytics experience for running SQL, building dashboards, alerts, and jobs…

What “serverless” means on Databricks Serverless lets you run Databricks workloads without creating or sizing clusters. Databricks provisions, scales, patches,…

Delta Sharing lets you publish live, governed data to other teams or external partners—without copying files or building custom APIs….

Why workspace‑catalog binding? By default, any workspace attached to the same Unity Catalog metastore can see and access catalogs (subject…

What is column‑level masking? Column masks hide or transform sensitive values (PII/PCI/PHI) at query time. Every read of a masked…

What you’ll learn Prereqs: Unity Catalog enabled; DBR 12.2 LTS+ for reads (see notes for dedicated compute), and a SQL…

Secrets (passwords, keys, tokens) should never live in notebooks or job configs. Databricks gives you a built-in, governed place to…