|AI

How Pinterest Built a Real‑Time Radar for Violative Content using AI

2025-12-08

10 min read

by Pinterest Engineering

Tags:

engineering

foundational-model

trust-and-safety

Read Original

Get the latest tech trends every morning

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

Endigest AI Core Summary

Pinterest built a real-time AI-assisted system to measure the prevalence of policy-violating content by tracking what users actually see, not just what they report.

•Prevalence is defined as the share of daily user impressions that include policy-violating content, reported with 95% confidence intervals
•ML-assisted weighted reservoir sampling prioritizes high-risk, high-impression content while using inverse-probability weighting to keep estimates unbiased
•A multimodal LLM (vision + text) labels sampled content at scale, running 15x faster and at significantly lower cost than human-only review
•LLM quality is validated against human-labeled gold sets and monitored for model drift to maintain alignment with current policy
•The system breaks down prevalence by policy area, sub-policy, surface (Homefeed, Search, etc.), geography, and user age buckets

How Pinterest Built a Real‑Time Radar for Violative Content using AI

Get the latest tech trends every morning

Endigest AI Core Summary

Related Articles

Developer's guide to Gemini Enterprise and A2UI integration

Boston Children’s uses AI to unlock new diagnoses

How Braintrust turns customer requests into code with Codex

May 29, 2026