Custom Lambda functions for reward evaluation Lambda function implementation details

Setting up reward functions for open-weight models

Reward functions evaluate response quality and provide feedback signals for model training. You can set up reward functions using custom Lambda functions. Choose the approach that matches your task requirements.

Custom Lambda functions for reward evaluation

You can set up reward functions using custom Lambda functions. Within your Lambda function, you have flexibility in how you implement the evaluation logic:

Objective tasks – For objective tasks like code generation or math reasoning, use verifiable rule-based graders that check correctness against known standards or test cases.
Subjective tasks – For subjective tasks like instruction following or chatbot interactions, call Amazon Bedrock foundation models as judges within your Lambda function to evaluate response quality based on your criteria.

Your Lambda function can implement complex logic, integrate external APIs, perform multi-step calculations, or combine multiple evaluation criteria depending on your task requirements.

Note

When using custom Lambda functions:

Increase the Lambda timeout from default 3 seconds to maximum 15 minutes for complex evaluations.
The Lambda execution role needs permissions to invoke the Lambda function as described in Lambda permissions for reward functions.

Lambda function implementation details

When implementing custom Lambda reward functions, your function must accept and return data in the following format.

Design guidelines

Rank responses – Give the best answer a clearly higher score
Use consistent checks – Evaluate task completion, format adherence, safety, and reasonable length
Maintain stable scaling – Keep scores normalized and non-exploitable

Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Prepare data

Create fine-tuning jobs