Explore real-world engineering experiences from top tech companies.
Receive daily AI-curated summaries of engineering articles from top tech companies worldwide.
Cloudflare explains how they built infrastructure to run extra-large language models like Kimi K2.5 on Workers AI, optimizing for agentic use cases with long contexts and frequent tool calls.
Cloudflare launches a unified AI inference layer providing single API access to 70+ models from 12+ providers, designed to address the challenges of building with multiple AI models and agents.
Databricks partnership with Google Cloud enables enterprises to build production-ready agentic AI on governed data through native Gemini model integration.
Google Cloud showcases how production-grade AI and agentic platforms are transforming media and entertainment workflows through its partner ecosystem.
AI Search is a plug-and-play search primitive for agents that need to retrieve information from large documents.
Claude Opus 4.7 from Anthropic is now available on Vercel AI Gateway, optimized for complex agentic tasks.
Cloudflare Email Service enters public beta, enabling agents to use email as a native interface.
OpenAI introduces GPT-Rosalind, a frontier reasoning model designed for life sciences research.
GitLab Duo Agent Platform now supports Claude Opus 4.7, Anthropic's latest model, bringing improved reasoning and instruction adherence for agent workflows.
This article explores how code agents can assist in porting language models from the transformers library to MLX while maintaining high code quality standards.
GitLab 18.11 introduces AI agents and enhanced security features.
MaxText introduces Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) capabilities now available on single-host TPU configurations for post-training large language models.