The power of LLMs on your data, more than two orders of magnitude faster and cheaper

2026-05-13

15 min read

by Thibaud Hottelier

Tags:

AI & Machine Learning

Databases

Data Analytics

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

Google Cloud introduces proxy models to accelerate and reduce costs of LLM-powered SQL functions.

•Proxy models replace LLM calls using pre-computed embeddings and lightweight logistic regression models
•Achieves ~100x speedup and ~1000x cost reduction compared to direct LLM invocations
•Automatically trained on data samples with quality evaluation during or before query execution in BigQuery and AlloyDB
•Maintains 90-116% relative accuracy for semantic filtering but fails on complex reasoning tasks

This summary was automatically generated by AI based on the original article and may not be fully accurate.

Related Articles