All Tech Blogs Explore Tags Send Feedback

Endigest

© 2026 Endigest. All rights reserved.

About
Privacy
Terms
Contact
RSS

3x Faster Search: Parallel Test-Time Scaling with Instructed-Retriever-1 | Endigest

Databricks

|AI

3x Faster Search: Parallel Test-Time Scaling with Instructed-Retriever-1

2026-06-04

1 min read

1

Tags:

Mosaic Research

Get the latest tech trends every morning

Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.

Email address

Endigest AI Core Summary

Anthropic introduces Instructed-Retriever-1 to accelerate Knowledge Assistant search through parallel test-time operations.

•Search latency drops 3x and answer generation 2x via parallel query generation and reranking
•Single model handles query and evidence ranking in parallel, maintaining low latency
•Trained on synthetic enterprise environments, matching Claude Sonnet 4.5 quality on KARLBench
•Uses Mixture-of-Experts with FP8 quantization and speculative decoding for efficient serving
•Achieves 81.0 nDCG@10 on real workloads with end-to-end latency under 10 seconds

This summary was automatically generated by AI based on the original article and may not be fully accurate.

Related Articles

Nemotron 3.5 Content Safety: Customizable Multimodal Safety for Global Enterprise AI

The Hacker News

Agentic AI Is Transforming Defense, But Only Secure IT Infrastructure Will Maximize It

How to Fine-Tune Nemotron 3.5 ASR for Your Language, Domain, or Accent

How Endava is redesigning software delivery around AI agents