Palmyra X4
Writer — Palmyra X4
Model Details
Palmyra X4 is Writer's enterprise LLM optimized for business writing, content generation, and knowledge work with strong instruction following. For more information about model development and performance, see the model/service card
Model launch date: Sep 2024
Model EOL date: No sooner than 4/28/2026
End User License Agreements and Terms of Use: View
Model lifecycle: Active
Context window: 128K tokens
Max output tokens: 8K
| Input Modalities | Output Modalities | APIs supported |
Endpoints supported |
|---|---|---|---|
Responses | bedrock-runtime | ||
Chat Completions | bedrock-mantle | ||
Invoke | |||
Converse | |||
Capabilities and Features
Bedrock Features
Features supported using bedrock-runtime endpoint
Pricing
For pricing, please refer to the Amazon Bedrock Pricing
Programmatic Access
Use the following model IDs and endpoint URLs to access this model programmatically. For more information about the available APIs and endpoints, see APIs supported
| Endpoint | Model ID | In-Region endpoint URL | Geo inference ID | Global inference ID |
|---|---|---|---|---|
bedrock-runtime |
writer.palmyra-x4-v1:0 |
https://bedrock-runtime.{region}.amazonaws.com |
us.writer.palmyra-x4-v1:0 |
Not supported |
For example, if region is us-east-1 (N. Virginia), then the bedrock-runtime endpoint URL will be "https://bedrock-runtime.us-east-1.amazonaws.com" and for bedrock-mantle will be "https://bedrock-mantle.us-east-1.api.aws/v1".
Service Tiers
Amazon Bedrock offers multiple service tiers to match your workload requirements. Standard provides pay-per-token access with no commitment. Priority offers higher throughput with a time-based commitment. Flex provides lower-cost access for flexible, non-time-sensitive workloads. Reserved provides dedicated throughput with a term commitment for predictable workloads. For more information, see service tiers
| Standard | Priority | Flex | Reserved |
|---|---|---|---|
Regional Availability
Regional availability at a glance
Bedrock offers three inference options: In-Region keeps requests within a single Region for strict compliance, Geo Cross-Region routes across Regions within a geography (US, EU, etc.) for higher throughput while respecting data residency, and Global Cross-Region routes anywhere worldwide for maximum throughput when there are no residency constraints. Refer to the Regional availability page for more details.
| Region | In-Region | Geo | Global |
|---|---|---|---|
us-east-1 (N. Virginia) | |||
us-east-2 (Ohio) | |||
us-west-1 (N. California) | |||
us-west-2 (Oregon) |
Geo inference details
Geo: US
Geo Inference ID: us.writer.palmyra-x4-v1:0
| Source Region | Destination Regions |
|---|---|
| us-east-1 (N. Virginia) | us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-2 (Oregon) |
| us-east-2 (Ohio) | us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-2 (Oregon) |
| us-west-1 (N. California) | us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-1 (N. California), us-west-2 (Oregon) |
| us-west-2 (Oregon) | us-east-1 (N. Virginia), us-east-2 (Ohio), us-west-2 (Oregon) |
Sample Code
Step 1 - AWS Account: If you have an AWS account already, skip this step. If you are new to AWS, sign up for an AWS account
Step 2 - API key: Go to the Amazon Bedrock console
Step 3 - Get the SDK: To use this getting started guide, you must have Python already installed. Then install the relevant software depending on the APIs you are using.
pip install boto3
Step 4 - Set environment variables: Configure your environment to use the API key for authentication.
AWS_BEARER_TOKEN_BEDROCK="<provide your Bedrock API key>"
Step 5 - Run your first inference request: Save the file as bedrock-first-request.py