Pinterest Engineering Blog - Medium logoPinterest Engineering Blog - Medium
|AI

How Pinterest Built a Real‑Time Radar for Violative Content using AI

2025-12-08
10 min read
1
by Pinterest Engineering

Endigest AI Core Summary

Pinterest built a real-time AI-assisted system to measure the prevalence of policy-violating content by tracking what users actually see, not just what they report.

  • Prevalence is defined as the share of daily user impressions that include policy-violating content, reported with 95% confidence intervals
  • ML-assisted weighted reservoir sampling prioritizes high-risk, high-impression content while using inverse-probability weighting to keep estimates unbiased
  • A multimodal LLM (vision + text) labels sampled content at scale, running 15x faster and at significantly lower cost than human-only review
  • LLM quality is validated against human-labeled gold sets and monitored for model drift to maintain alignment with current policy
  • The system breaks down prevalence by policy area, sub-policy, surface (Homefeed, Search, etc.), geography, and user age buckets
Tags:
#ai
#engineering
#pinterest
#foundational-model
#trust-and-safety