EC2 instances

You can spin up a Cortex cluster on a variety of AWS instance types. If you are unsure about which instance to pick, review these options as a starting point. This is not a comprehensive guide so please refer to the full documentation on AWS for more information.

T3 instances

T3 instances are useful for development clusters that primarily run model inferences with low compute and memory resource utilization.

Example: iris classification with scikit-learn
Compute: low
Memory: low
GPU: no
AWS cost: starts at $0.0052 per hour*

C5 instances

C5 instances are useful for clusters that primarily run model inferences with medium compute and low memory resource utilization.

Example: language identification with fastText
Compute: medium
Memory: low
GPU: no
AWS cost: starts at $0.085 per hour*

M5 instances

M5 instances are useful for clusters that primarily run model inferences with low compute and memory resource utilization.

Example: MPG estimation with scikit-learn
Compute: low
Memory: medium
GPU: no
AWS cost: starts at $0.096 per hour*

G4 instances

G4 instances are useful for clusters that primarily run model inferences with high compute and low memory resource utilization that can run on GPUs.

Example: image classification with Inception
Compute: high
Memory: medium
GPU: yes
AWS cost: starts at $0.526 per hour*

P2 instances

P2 instances are useful for clusters that primarily run model inferences with high compute and high memory resource utilization that can run on GPUs.

Example: text generation with GPT-2
Compute: high
Memory: high
GPU: yes
AWS cost: starts at $0.900 per hour*

* On-demand pricing for the US West (Oregon) AWS region.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ec2-instances.md

ec2-instances.md

EC2 instances

T3 instances

C5 instances

M5 instances

G4 instances

P2 instances

Files

ec2-instances.md

Latest commit

History

ec2-instances.md

File metadata and controls

EC2 instances

T3 instances

C5 instances

M5 instances

G4 instances

P2 instances