Announcing Amazon SageMaker Inference for custom Amazon Nova models
2026-02-16
6 min read
2
by Channy Yun (윤석찬)
Endigest AI Core Summary
Amazon SageMaker Inference now supports GA deployment of custom Amazon Nova models for production-grade inference.
- •Covers Nova Micro, Nova Lite, and Nova 2 Lite models with continued pre-training, SFT, or reinforcement fine-tuning
- •Uses EC2 G5/G6 instances (more cost-efficient than P5) with auto-scaling based on 5-minute usage patterns
- •Configurable parameters include context length, concurrency, batch size, temperature, top_p, and reasoning effort
- •Deploy via SageMaker Studio UI or SDK using create_model, create_endpoint_config, and create_endpoint APIs
- •Available in us-east-1 and us-west-2 with per-hour billing and no minimum commitment
Tags:
#Amazon Nova
#Amazon SageMaker AI
#Artificial Intelligence
#Featured
#Launch
#News
