Dropbox Tech Blog  logo Dropbox Tech Blog
|Machine Learning

Inside the feature store powering real-time AI in Dropbox Dash

2025-12-18
10 min read
0
by Hicham Badri,Appu Shaji,Craig Wilhite,Josh Clemm,Jason Shang,Artem Nabirkin

Endigest AI Core Summary

Dropbox Dash built a custom hybrid feature store to power real-time AI ranking across tens of thousands of work documents.

  • Infrastructure spans two environments (on-premises low-latency and cloud-based Spark), ruling out standard cloud-native feature stores
  • Feast was chosen as the orchestration layer; its Python serving path was replaced with a Go service achieving p95 latencies of 25-35ms
  • Dynovault (in-house DynamoDB-compatible storage) co-located with inference workloads delivers ~20ms client-side latency
  • A three-part ingestion system (batch, streaming, direct writes) keeps features fresh, with change detection cutting batch update times from 1+ hour to under 5 minutes
  • Only 1-5% of feature values change per 15-minute window; targeting that subset reduced write volumes from hundreds of millions to under one million records per run
Tags:
#LLM
#AI
#Machine Learning
#Dash