Serverless endpoint operations - Amazon SageMaker AI
Services or capabilities described in AWS documentation might vary by Region. To see the differences applicable to the AWS European Sovereign Cloud Region, see the AWS European Sovereign Cloud User Guide.

Serverless endpoint operations

Unlike other SageMaker AI real-time endpoints, Serverless Inference manages compute resources for you, reducing complexity so you can focus on your ML model instead of on managing infrastructure. The following guide highlights the key capabilities of serverless endpoints: how to create, invoke, update, describe, or delete an endpoint. You can use the SageMaker AI console, the AWS SDKs, the Amazon SageMaker Python SDK, or the AWS CLI to manage your serverless endpoints.