sagemaker-runtime

moto.sagemakerruntime.models.SageMakerRuntimeBackend

Example usage

python

@mock_sagemakerruntime def test_sagemakerruntime_behaviour: boto3.client("sagemaker-runtime") ...

Implemented features for this service

[X] invoke_endpoint

This call will return static data by default.

You can use a dedicated API to override this, by configuring a queue of expected results.

A request to get_query_results will take the first result from that queue. Subsequent requests using the same details will return the same result. Other requests using a different QueryExecutionId will take the next result from the queue, or return static data if the queue is empty.

Configuring this queue by making an HTTP request to /moto-api/static/sagemaker/endpoint-results. An example invocation looks like this:

python

expected_results = {

"account_id": "123456789012", # This is the default - can be omitted "region": "us-east-1", # This is the default - can be omitted "results": [ { "Body": "first body", "ContentType": "text/xml", "InvokedProductionVariant": "prod", "CustomAttributes": "my_attr", }, # other results as required ],

} requests.post( "http://motoapi.amazonaws.com:5000/moto-api/static/sagemaker/endpoint-results", json=expected_results, )

client = boto3.client("sagemaker", region_name="us-east-1") details = client.invoke_endpoint(EndpointName="asdf", Body="qwer")
[ ] invoke_endpoint_async
[ ] invoke_endpoint_with_response_stream