Amazon logo with curved arrow from A to Z forming a smile. Amazon — Nova Lite Model Details Capabilities and Features Pricing Programmatic Access Service Tiers Regional Availability Quotas and Limits Sample Code

Nova Lite

Amazon — Nova Lite

Model Details

Nova Lite is Amazon's low-cost multimodal model that processes text, images, and video inputs for tasks like document analysis and visual Q&A. For more information about model development and performance, see the model/service card.

Model launch date: Dec 05, 2024
Model EOL date: No sooner than 12/4/2025
End User License Agreements and Terms of Use: View
Model lifecycle: Active
Context window: 300K tokens
Max output tokens: 5K
Knowledge cutoff: Oct 2024

Input Modalities	Output Modalities	APIs supported	Endpoints supported
Audio	Embedding	`Responses`	`bedrock-runtime`
Image	Image	`Chat Completions`	`bedrock-mantle`
Speech	Speech	`Invoke`
Text	Text	`Converse`
Video	Video

Capabilities and Features

Bedrock Features

Features supported using bedrock-runtime endpoint

Supported	Not Supported
Response streaming Guardrails Client-side tool calling	Structured outputs

Prompt caching using bedrock-runtime endpoint

For more information, see Prompt caching for faster model inference.

Prompt caching supported	Min tokens per cache checkpoint	Max cache checkpoints per request	Supported TTL	Fields that accept prompt cache checkpoints
Yes	1K*	4	5 minutes	`system` and `messages`

* Amazon Nova models support a maximum of 20K tokens for prompt caching. Prompt caching is primarily for text prompts.

Pricing

For pricing, please refer to the Amazon Bedrock Pricing page.

Programmatic Access

Use the following model IDs and endpoint URLs to access this model programmatically. For more information about the available APIs and endpoints, see APIs supported and Endpoints supported.

Endpoint Model ID In-Region endpoint URL Geo inference ID Global inference ID

Endpoint	Model ID	In-Region endpoint URL	Geo inference ID	Global inference ID
`bedrock-runtime`	`amazon.nova-lite-v1:0`	`https://bedrock-runtime.{region}.amazonaws.com`	`us.amazon.nova-lite-v1:0` `eu.amazon.nova-lite-v1:0`	Not supported

bedrock-runtime

amazon.nova-lite-v1:0

https://bedrock-runtime.{region}.amazonaws.com

us.amazon.nova-lite-v1:0

eu.amazon.nova-lite-v1:0

Not supported

For example, if region is us-east-1 (N. Virginia), then the bedrock-runtime endpoint URL will be "https://bedrock-runtime.us-east-1.amazonaws.com" and for bedrock-mantle will be "https://bedrock-mantle.us-east-1.api.aws/v1".

Service Tiers

Amazon Bedrock offers multiple service tiers to match your workload requirements. Standard provides pay-per-token access with no commitment. Priority offers higher throughput with a time-based commitment. Flex provides lower-cost access for flexible, non-time-sensitive workloads. Reserved provides dedicated throughput with a term commitment for predictable workloads. For more information, see service tiers.

Standard	Priority	Flex	Reserved

Regional Availability

Regional availability at a glance

Bedrock offers three inference options: In-Region keeps requests within a single Region for strict compliance, Geo Cross-Region routes across Regions within a geography (US, EU, etc.) for higher throughput while respecting data residency, and Global Cross-Region routes anywhere worldwide for maximum throughput when there are no residency constraints. Refer to the Regional availability page for more details.

Region	In-Region	Geo	Global
`us-east-1` (N. Virginia)
`us-east-2` (Ohio)
`us-west-1` (N. California)
`us-west-2` (Oregon)
`us-gov-west-1` (GovCloud)
`eu-central-1` (Frankfurt)
`eu-north-1` (Stockholm)
`eu-south-1` (Milan)
`eu-south-2` (Spain)
`eu-west-1` (Ireland)
`eu-west-2` (London)
`eu-west-3` (Paris)
`ap-northeast-1` (Tokyo)
`ap-southeast-2` (Sydney)
`ap-southeast-3` (Jakarta)
`il-central-1` (Tel Aviv)
`me-central-1` (UAE)

Geo inference details

Geo: US

Geo Inference ID: us.amazon.nova-lite-v1:0

Source Region	Destination Regions
us-east-1 (N. Virginia)	us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-2 (Oregon)
us-east-2 (Ohio)	us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-2 (Oregon)
us-west-1 (N. California)	us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-1 (N. California), us-west-2 (Oregon)
us-west-2 (Oregon)	us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-2 (Oregon)

Geo: EU

Geo Inference ID: eu.amazon.nova-lite-v1:0

Source Region	Destination Regions
eu-central-1 (Frankfurt)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-west-1 (Ireland), eu-west-3 (Paris)
eu-north-1 (Stockholm)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-west-1 (Ireland), eu-west-3 (Paris)
eu-south-1 (Milan)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-south-1 (Milan), eu-west-1 (Ireland), eu-west-3 (Paris)
eu-south-2 (Spain)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-south-2 (Spain), eu-west-1 (Ireland), eu-west-3 (Paris)
eu-west-1 (Ireland)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-west-1 (Ireland), eu-west-3 (Paris)
eu-west-3 (Paris)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-west-1 (Ireland), eu-west-3 (Paris)
il-central-1 (Tel Aviv)	eu-central-1 (Frankfurt), eu-north-1 (Stockholm), eu-south-1 (Milan), eu-west-1 (Ireland), eu-west-3 (Paris), il-central-1 (Tel Aviv)

Quotas and Limits

Your AWS account has default quotas to maintain the performance of the service and to ensure appropriate usage of Amazon Bedrock. The default quotas assigned to an account might be updated depending on regional factors, payment history, fraudulent usage, and/or approval of a quota increase request. For more details, please refer to Quotas for Amazon Bedrock documentation and see the limits for the model.

Sample Code

Step 1 - AWS Account: If you have an AWS account already, skip this step. If you are new to AWS, sign up for an AWS account.

Step 2 - API key: Go to the Amazon Bedrock console and generate a long-term API key.

Step 3 - Get the SDK: To use this getting started guide, you must have Python already installed. Then install the relevant software depending on the APIs you are using.


pip install boto3

Step 4 - Set environment variables: Configure your environment to use the API key for authentication.


AWS_BEARER_TOKEN_BEDROCK="<provide your Bedrock API key>"

Step 5 - Run your first inference request: Save the file as bedrock-first-request.py

Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Nova Sonic

Nova Micro