TabPFN, by Prior Labs, applies the pre-trained LLM paradigm to tabular data, removing the need for traditional ML preprocessing and per-task training.

Platform

Partners

Databricks

21 min read

Data Engineering•2026-02-23

Spark Declarative Pipelines: Why Data Engineering Needs to Become End-to-End Declarative

Spark Declarative Pipelines (SDP) extends declarative data processing from individual queries to entire pipelines in Apache Spark, reducing operational burden for data engineering teams.

Data Engineering•2026-02-19

Use Genie Everywhere with Enterprise OAuth

Databricks Genie now supports enterprise OAuth to embed natural-language data analytics into Microsoft Teams and custom web apps.

Data Engineering•2026-02-18

Predictive Optimization at Scale: A Year of Innovation and What’s Next

This post covers Databricks' Predictive Optimization (PO) in Unity Catalog, which became the default platform behavior in 2025 for autonomous lakehouse table maintenance.

Architecture•2026-01-07

Why We Use Separate Tech Stacks for Personalization and Experimentation

Spotify explains why they maintain separate tech stacks for personalization and experimentation rather than combining them into one system.

Background Coding Agents: Predictable Results Through Strong Feedback Loops (Honk, Part 3)

Spotify shares how they designed reliable background coding agents ("Honk") using strong verification loops to minimize incorrect or broken pull requests at scale.

Background Coding Agents: Context Engineering (Honk, Part 2)

Spotify shares lessons from building a background coding agent ("Honk") that automates large-scale code migrations by generating mergeable pull requests across thousands of repositories.