Add AWS EC2 instances #777

tnthornton · 2021-07-22T17:39:45Z

Description of your changes

Add ec2/instance

Fixes #85

I have:

Read and followed Crossplane's contribution process.
Run make reviewable test to ensure this PR is ready for review.

How has this code been tested

basic create/delete so far

## create
[add-ec2-instances][~/codez/provider-aws]$ kubectl apply -f examples/ec2/instance.yaml
instance.ec2.aws.crossplane.io/sample-instance36 created

## list
[add-ec2-instances][~/codez/provider-aws]$ kubectl get instance sample-instance36
NAME                READY   SYNCED   INSTANCES   RUNNING   AGE
sample-instance36   True    True     2           2         37m
[add-ec2-instances][~/codez/provider-aws]$ kubectl get instance sample-instance36 -o wide
NAME                READY   SYNCED   INSTANCES   RUNNING   AGE   ID                  PENDING   SHUTTINGDOWN   STOPPED   STOPPING   TERMINATED
sample-instance36   True    True     2           2         37m   sample-instance36   0         0              0         0          0

## delete
[add-ec2-instances][~/codez/provider-aws]$ kubectl delete instance sample-instance36
instance.ec2.aws.crossplane.io "sample-instance36" deleted
[add-ec2-instances][~/codez/provider-aws]$ kubectl get instance
No resources found
[add-ec2-instances][~/codez/provider-aws]$ kubectl get instance sample-instance36
Error from server (NotFound): instances.ec2.aws.crossplane.io "sample-instance36" not found

tnthornton · 2021-07-22T17:44:20Z

This is very much still a WIP. FWIW, I started from the vpc resource which is where you'll see a number references to vpc in the comments.

tnthornton · 2021-07-26T19:38:02Z

Alrighty, I think I got to a point where it would be good to understand what the community's desired outcome is.

Background

As @muvaf called out in #85 (comment), the AWS SDK API defines a RunInstances API for creating/running EC2 Instances, which means we can't generate most of the code through the ACK Generation Tool. 🤷

While manually implementing the API, most of it seems pretty straightforward until you get to maxCount and minCount (https://github.com/crossplane/provider-aws/pull/777/files#diff-4c32c59577ee9db86c31a77678689c5bc9128ae2a875b514ec393f12a67ffd62R197). These properties allow you to define the max/min number of instances to launch. In practice that's really no big deal; however given we're trying to externally manage these resources the behavior ends up being something we probably need to think about how to handle.

The behavior that I'm seeing is that each instance ends up with its own instanceID. It looks like in practice we use the resource ID we get back from AWS to keep track of the resource (and subsequently add it to the resource as the external-name).
Note there are other details about the individual instances that are different as well, for example IPv4 addresses, etc.

Problem

It's probably easiest to outline the issue in pictures.
Given a Instance definition like the following (note I added the EC2 Instance special tag to make it easier to pick out the instances for this example):

apiVersion: ec2.aws.crossplane.io/v1alpha1
kind: Instance
metadata:
  name: pr-example
spec:
  forProvider:
    region: us-east-1
    imageId: ami-0dc2d3e4c0f9ebd18
    maxCount: 4
    minCount: 1
    tags:
      - key: Name
        value: pr-example  
  providerConfigRef:
    name: example

We get the following:

[add-ec2-instances][~/codez/provider-aws]$ kubectl apply -f examples/ec2/instance.yaml
instance.ec2.aws.crossplane.io/pr-example created

[add-ec2-instances][~/codez/provider-aws]$ kubectl get instance pr-example
NAME         READY   SYNCED   INSTANCES   RUNNING   AGE
pr-example   True    True     4           4         3m

Note the different instanceIDs and different IPv4 addresses and DNS.

If we were just dealing with a handful of instances getting spun up, I think adding a slice of instanceIDs to the external-name annotation wouldn't be a big deal, however this pretty quickly gets into a state that doesn't scale well. Imagine someone asking for 100 instances, 1000 instances, 10k instances.

Solution

Currently, I've implemented grouping based on the external labels (tags) that are applied to the instances so I can observe the instances as a group and delete as a group through performing a DescribeInstanceInput that has a filter using those external labels. That all seems to work just fine.

Where I'm starting to get into behavior I'm not sure whether or not its desirable is in keeping track of what we observe about the instances. For example, should we be tracking the DNS and addresses for these instances in order to display that to the end user? If so, it seems like we can totally get into the problematic behavior I outlined above when we're talking about 100+ instances.

What are your thoughts?

cc: @negz @hasheddan @muvaf @jbw976

tnthornton · 2021-07-26T19:44:35Z

I guess one thought is if it's not possible to remotely ask for that many resources, the above issue is probably moot.

tnthornton · 2021-07-26T20:08:07Z

Doing some research on the side, it looks like in my example above I'm using On-Demand instances, specifying the m type (m1.small). If that is true, it looks like for the account that I deployed into (which could be different for others), the limit on the at account is:

but, for m1.small's (at least what I have provisioned so far), the vCPUs that are allocated are only 1:

So unless I'm chasing something off base, it looks like the given configuration could go as high as 512 before I hit the account limit (assuming those were the only instances running).

negz · 2021-07-26T20:51:37Z

@tnthornton my intuition here is that this is one of the cases in which we need to 'interpret' the API a little to make it a little more declarative, while still aiming for (mostly) high fidelity. I would not expose the MinCount and MaxCount parameters under spec.forProvider (or at all) and instead always call RunInstances with a max and min of 1. This way if someone wants three instances they must declaratively create three Instance resources, not one Instance resource.

I just took a quick look at how our friends over on the Terraform project handle this, and it appears they use a fixed max/min; https://github.com/hashicorp/terraform-provider-aws/blob/e9309698/aws/resource_aws_instance.go#L739

tnthornton · 2021-07-26T20:57:31Z

Awesome. That makes things easier.

tnthornton · 2021-07-26T21:06:32Z

While we're on the subject of 'interpreting' the API, there's this setting that's available that seems dangerous:

	// If you set this parameter to true, you can't terminate the instance using
	// the Amazon EC2 console, CLI, or API; otherwise, you can. To change this attribute
	// after launch, use ModifyInstanceAttribute (https://docs.aws.amazon.com/AWSEC2/latest/APIReference/API_ModifyInstanceAttribute.html).
	// Alternatively, if you set InstanceInitiatedShutdownBehavior to terminate,
	// you can terminate the instance by running the shutdown command from the instance.
	//
	// Default: false
	// +optional
	DisableAPITermination *bool `json:"disableAPITermination,omitempty"`

"Dangerous" in the sense that it could result in an unpleasant experience for the end user. Our friends over at Terraform include it https://github.com/hashicorp/terraform-provider-aws/blob/e9309698/aws/resource_aws_instance.go#L728, however arguably we have a slightly different use case and are less "fire and forget".

Thoughts?

AaronME · 2021-07-26T22:21:40Z

The MinCount/MaxCount was meant to act as a gate on the instance launches. From the docs

The minimum number of instances to launch. If you specify a minimum that is more instances than Amazon EC2 can launch in the target Availability Zone, Amazon EC2 launches no instances.

One of the use-cases would be launching a block of workers to perform a bulk data task and needing to ensure these instances all exist in the same az. Using RunInstances the deployment tool could search for an az with enough capacity to handle the processing.

If we felt like honoring the behavior, we could implement "RunInstances" as an "order for instances," with no promise to manage the instances that are created. We could capture the instancesSet that is returned when the order is fulfilled. The user could capture this InstanceSet and pass it to a TerminateInstances Resource for cleanup.

This pattern works for the AWS API, because a failed call to "RunInstances" leaves no artifacts. In crossplane there would be a number of "failed" "RunInstances" objects that require cleaning up.

As @negz already said, the Instance resource in should always be a single (min=1, max=1).

tnthornton · 2021-07-26T22:48:27Z

@AaronME that's an interesting idea. So in my mind that would be something similar to a replicaSet.

negz · 2021-07-26T23:03:24Z

I think we should stick with a single Instance for the time being - we could potentially consider a higher level set of instances as a separate concern in future if we see use cases arising.

While we're on the subject of 'interpreting' the API, there's this setting that's available that seems dangerous

If I'm following correctly, I think we can leave this one in. It seems like it's disabled by default, can be updated once set, and when enabled will prevent us from being able to delete the instance? We have a similar option exposed on the RDSInstance type per https://github.com/crossplane/provider-aws/blob/009f048/apis/database/v1beta1/rdsinstance_types.go#L291. If folks enable it, they must then disable it before they delete the resource.

tnthornton · 2021-07-26T23:08:13Z

I think we should stick with a single Instance for the time being - we could potentially consider a higher level set of instances as a separate concern in future if we see use cases arising.

Sounds good.

If I'm following correctly, I think we can leave this one in. It seems like it's disabled by default, can be updated once set, and when enabled will prevent us from being able to delete the instance? We have a similar option exposed on the RDSInstance type per https://github.com/crossplane/provider-aws/blob/009f048/apis/database/v1beta1/rdsinstance_types.go#L291. If folks enable it, they must then disable it before they delete the resource.

Yep, spot on. I couldn't quite find a similar approach (poor searching skills apparently) which was why I raised it.

Alrighty with that all cleared up, I'll adjust the current impl to focus on just the single Instance and then cut over to support Updates so if users do enable DisableAPITermination they can subsequently disable it again.

Signed-off-by: Taylor Thornton <taylor@upbound.io>

add additional instance counts to api output Signed-off-by: Taylor Thornton <taylor@upbound.io>

adds support for user defined tags adjusts how we are grouping instances from the special 'Name' tag to using the external resource labeling Signed-off-by: Taylor Thornton <taylor@upbound.io>

add DisableAPITermination adjust management approach to focus on a single Instance object rather than a group Signed-off-by: Taylor Thornton <taylor@upbound.io>

Signed-off-by: Taylor Thornton <taylor@upbound.io>

adds late-init flow Signed-off-by: Taylor Thornton <taylor@upbound.io>

Signed-off-by: Taylor Thornton <taylor@upbound.io>

…al as they aren't guaranteed to come back Signed-off-by: Taylor Thornton <taylor@upbound.io>

Signed-off-by: Taylor Thornton <taylor@upbound.io>

AaronME

@tnthornton update loop is gone. This is good to merge.

Thank you for the contribution!

negz · 2021-10-07T23:06:32Z

Thanks @tnthornton and @AaronME for working to get this merged - I think this will be huge for our users.

tnthornton marked this pull request as draft July 22, 2021 17:40

jbw976 mentioned this pull request Jul 23, 2021

Add support for Amazon EC2 Instances #85

Closed

tnthornton marked this pull request as ready for review July 28, 2021 22:08

tnthornton force-pushed the add-ec2-instances branch from ecc3b9a to 9b3683f Compare July 29, 2021 16:02

tnthornton requested review from hasheddan and muvaf July 29, 2021 18:05

tnthornton force-pushed the add-ec2-instances branch 2 times, most recently from b61051d to e6cf850 Compare August 1, 2021 15:12

AaronME added the size/L label Aug 12, 2021

tnthornton force-pushed the add-ec2-instances branch 6 times, most recently from 898e33f to 9e58b42 Compare September 13, 2021 22:11

AaronME self-assigned this Sep 24, 2021

AaronME requested review from AaronME and removed request for muvaf and hasheddan September 24, 2021 17:32

tnthornton added 23 commits October 4, 2021 13:50

adds support for specifying metadataOptions

63a80d2

Signed-off-by: Taylor Thornton <taylor@upbound.io>

adds support for specifying networkInterfaces

8c7a5ea

Signed-off-by: Taylor Thornton <taylor@upbound.io>

adds support for specifying placement

697cb8d

Signed-off-by: Taylor Thornton <taylor@upbound.io>

adds support for specifying privateIPAddress

d974015

Signed-off-by: Taylor Thornton <taylor@upbound.io>

add support for specifying securityGroups and subnetID

748c8dc

Signed-off-by: Taylor Thornton <taylor@upbound.io>

add support for adding securityGroups and a subnet through refs

74c2d39

Signed-off-by: Taylor Thornton <taylor@upbound.io>

minor cleanups

d375fe0

Signed-off-by: Taylor Thornton <taylor@upbound.io>

shore up based on make reviewable test

0c6e88c

Signed-off-by: Taylor Thornton <taylor@upbound.io>

expand how multiple instance states are handled

8e296aa

add additional instance counts to api output Signed-off-by: Taylor Thornton <taylor@upbound.io>

adds external resource labeling

3cac96a

adds support for user defined tags adjusts how we are grouping instances from the special 'Name' tag to using the external resource labeling Signed-off-by: Taylor Thornton <taylor@upbound.io>

remove min/max count per PR discussion

1910737

add DisableAPITermination adjust management approach to focus on a single Instance object rather than a group Signed-off-by: Taylor Thornton <taylor@upbound.io>

build out GenerateInstanceObservation

dbd4f09

Signed-off-by: Taylor Thornton <taylor@upbound.io>

built out observe flow

8d748ee

adds late-init flow Signed-off-by: Taylor Thornton <taylor@upbound.io>

add tests around generate helper functions

9b66bc2

Signed-off-by: Taylor Thornton <taylor@upbound.io>

fix common tests

9332d45

Signed-off-by: Taylor Thornton <taylor@upbound.io>

add update flow

ddebc5e

Signed-off-by: Taylor Thornton <taylor@upbound.io>

make generate to fix diff

05a290e

Signed-off-by: Taylor Thornton <taylor@upbound.io>

update resource definition to allow fields in atProvider to be option…

4adb4ee

…al as they aren't guaranteed to come back Signed-off-by: Taylor Thornton <taylor@upbound.io>

added storageversion to pass make reviewable

1ccfb36

Signed-off-by: Taylor Thornton <taylor@upbound.io>

remove duplicate type definition for instance

3d58a71

Signed-off-by: Taylor Thornton <taylor@upbound.io>

updates per PR comments

17efa3e

Signed-off-by: Taylor Thornton <taylor@upbound.io>

remove securityGroups from spec in favor of securityGroupIDs

75f0e03

Signed-off-by: Taylor Thornton <taylor@upbound.io>

IsInstanceUpToDate using incorrect SecGroups compare

51fb4cc

Signed-off-by: Taylor Thornton <taylor@upbound.io>

tnthornton force-pushed the add-ec2-instances branch from 368293a to 51fb4cc Compare October 4, 2021 20:50

update to current aws-sdk-go used in master

a6b9ff1

Signed-off-by: Taylor Thornton <taylor@upbound.io>

tnthornton force-pushed the add-ec2-instances branch from 1d0e155 to a6b9ff1 Compare October 4, 2021 22:54

AaronME approved these changes Oct 5, 2021

View reviewed changes

AaronME merged commit 76bc036 into crossplane-contrib:master Oct 5, 2021

tektondeploy pushed a commit to gtn3010/provider-aws that referenced this pull request Mar 12, 2024

Update CODEOWNERS file (crossplane-contrib#777)

4e147dc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add AWS EC2 instances #777

Add AWS EC2 instances #777

tnthornton commented Jul 22, 2021 •

edited

Loading

tnthornton commented Jul 22, 2021

tnthornton commented Jul 26, 2021 •

edited

Loading

tnthornton commented Jul 26, 2021

tnthornton commented Jul 26, 2021

negz commented Jul 26, 2021 •

edited

Loading

tnthornton commented Jul 26, 2021

tnthornton commented Jul 26, 2021 •

edited

Loading

AaronME commented Jul 26, 2021 •

edited

Loading

tnthornton commented Jul 26, 2021

negz commented Jul 26, 2021

tnthornton commented Jul 26, 2021

AaronME left a comment

negz commented Oct 7, 2021

Add AWS EC2 instances #777

Add AWS EC2 instances #777

Conversation

tnthornton commented Jul 22, 2021 • edited Loading

Description of your changes

How has this code been tested

tnthornton commented Jul 22, 2021

tnthornton commented Jul 26, 2021 • edited Loading

Background

Problem

Solution

tnthornton commented Jul 26, 2021

tnthornton commented Jul 26, 2021

negz commented Jul 26, 2021 • edited Loading

tnthornton commented Jul 26, 2021

tnthornton commented Jul 26, 2021 • edited Loading

AaronME commented Jul 26, 2021 • edited Loading

tnthornton commented Jul 26, 2021

negz commented Jul 26, 2021

tnthornton commented Jul 26, 2021

AaronME left a comment

Choose a reason for hiding this comment

negz commented Oct 7, 2021

tnthornton commented Jul 22, 2021 •

edited

Loading

tnthornton commented Jul 26, 2021 •

edited

Loading

negz commented Jul 26, 2021 •

edited

Loading

tnthornton commented Jul 26, 2021 •

edited

Loading

AaronME commented Jul 26, 2021 •

edited

Loading