67 articles
TabPFN, by Prior Labs, applies the pre-trained LLM paradigm to tabular data, removing the need for traditional ML preprocessing and per-task training.
Spark Declarative Pipelines (SDP) extends declarative data processing from individual queries to entire pipelines in Apache Spark, reducing operational burden for data engineering teams.
Databricks Genie now supports enterprise OAuth to embed natural-language data analytics into Microsoft Teams and custom web apps.
This post covers Databricks' Predictive Optimization (PO) in Unity Catalog, which became the default platform behavior in 2025 for autonomous lakehouse table maintenance.
Spotify explains why they maintain separate tech stacks for personalization and experimentation rather than combining them into one system.
Spotify shares how they designed reliable background coding agents ("Honk") using strong verification loops to minimize incorrect or broken pull requests at scale.
Spotify shares lessons from building a background coding agent ("Honk") that automates large-scale code migrations by generating mergeable pull requests across thousands of repositories.