Dynamic stack linking #109

joeduffy · 2017-03-02T21:25:25Z

Our current model acts more like static linking, in that the resources for dependencies become part of the resource topology in the consuming library. This is okay in some cases -- like when you're building a larger topology out of other smaller 1st party components -- but not in other cases -- like when you are stitching together a complex topology built out of other separately deployed pieces.

As a concrete example, imagine you've factored your overall stack into three key pieces: 1) at the very bottom, one for infrastructure; 2) in the middle, another one for persistent state (presumably parameterized so that restoration from backup state is possible); and 3) at the top, services and applications. Each is revved and deployed independently -- with increasing frequency from bottom-to-top -- ideally in a way that isolates impacts to the other layers as much as possible.

So we will definitely want the equivalent to dynamic linking. But this is pretty fundamental.

As step 1, we should model resource dependencies using URIs. This at least affords some flexibility in the resolution of them to physical resources, at least conceptually. It also gives us guidance around existing schemes for versioning (make a new URI), redirects, and so on. This presumably means we need some way of resolving inter-stack dependencies, however, e.g. a Pulumi "DNS" server.

Even after that, many complexities remain; for example, infrastructure is not always perfectly versionable without a cascading impact to dependencies. A change to the infrastructure that necessitates rebuilding and redeploying the database and/or application tier, for instance, needs to carry that knowledge in a way that at least notifies the operator, if not actually doing something automatically. Even changes that merely alter output values that might have been depended upon, versus destroying and recreating resources, will have similar cascading impacts.

I'm putting this in S10 for consideration. I'm not yet certain when to bite this off, but my inclination is "soon" since it will have some fairly fundamental architectural impacts that we want to front-load.

This change is mostly just a rename of Moniker to URN. It does also prefix resource URNs to have a standard URN namespace; in other words, "urn:coconut:<name>", where <name> is the same as the prior Moniker. This is a minor step that helps to prepare us for #109.

joeduffy · 2017-03-06T22:54:24Z

This could also be a premium feature.

This change implements the `get` function for resources. Per #83, this allows Lumi scripts to actually read from the target environment. For example, we can now look up a SecurityGroup from its ARN: let group = aws.ec2.SecurityGroup.get( "arn:aws:ec2:us-west-2:153052954103:security-group:sg-02150d79"); The returned object is a fully functional resource object. So, we can then link it up with an EC2 instance, for example, in the usual ways: let instance = new aws.ec2.Instance(..., { securityGroups: [ group ], }); This didn't require any changes to the RPC or provider model, since we already implement the Get function. There are a few loose ends; two are short term: 1) URNs are not rehydrated. 2) Query is not yet implemented. One is mid-term: 3) We probably want a URN-based lookup function. But we will likely wait until we tackle #109 before adding this. And one is long term (and subtle): 4) These amount to I/O and are not repeatable! A change in the target environment may cause a script to generate a different plan intermittently. Most likely we want to apply a different kind of deployment "policy" for such scripts. These are inching towards the scripting model of #121, which is an entirely different beast than the repeatable immutable infrastructure deployments. Finally, it is worth noting that with this, we have some of the fundamental underpinnings required to finally tackle "inference" (#142).

joeduffy · 2017-10-10T19:50:17Z

This won't block our customer conversions this sprint, since we can just use strings and other workarounds. However, we should think about the design here, and if something obvious arises, we should do it. Either way, let's try to end the sprint with an idea of what to do in 0.9.

joeduffy · 2017-10-17T17:10:22Z

One idea I had here is to use JavaScript exports to define what is available across stacks.

For example, let's say that I have a stack, mystack, that defines a queue I want to consume elsewhere. To make it accessible, I simply say export:

export let queue = new aws.sqs.Queue("requests", ...);

Now in the consuming stack, I can say something like this:

let queue = aws.sqs.Queue.discover("mystack", "requests");

This may be an "abuse" of the module system, but I think I like it. If you prefer static linking, go ahead and statically link; if you prefer dynamic linking, you can do that using discover.

lukehoban · 2017-10-17T18:00:49Z

If you are referring to it by the urnName requests, why would you need to require exporting it from the source stack? Is that just to allow selectively exposing only a subset of names? Does feel a strange abuse of the module system (which we may want to abuse for a variety of other purposes as well in the future :-)) - especially since the export name is not even how it's being exported.

Al alternative might be just pulumi.export("exportedname", queue) which also allows using a stable export name instead of directly referencing the internal urnName.

joeduffy · 2017-10-19T16:04:45Z

We found a workaround for this, and so I'm pushing this to 0.9.

briandrennan · 2018-06-20T23:58:23Z

A change to the infrastructure that necessitates rebuilding and redeploying the database and/or application tier, for instance, needs to carry that knowledge in a way that at least notifies the operator, if not actually doing something automatically.

Isn't this a bit of an orchestration problem? Given the example of a non-compatible database migration, wouldn't it be possible to do something along the lines of create a separate Pulumi program that can handle provisioning the new infrastructure, and then setting up a task (AWS Lambda, Azure Serverless, etc.) that can be used to handle migration, and then make the relevant changes in the primary stack?

I'm not sure if this helps, but I'm dealing with something that I think is similar to what you've described @joeduffy: http://michaeljswart.com/2012/04/modifying-tables-online-part-1-migration-strategy/

jnancel · 2018-07-05T09:48:41Z

How about making some kind of an authorization system for stacks to read the checkpoint of other stacks ? In that case, no need to export things on the source stack, just to authorize, you then reference in the destination stack the source stack whose checkpoint you want to read, then the name of the resource, then the output you need.

These changes add a new API to the Pulumi SDK, `service.getStack`, that returns the outputs of a given stack. The Pulumi account performing the deployment that calls this API must have access to the indicated stack or the call will fail. This API is implemented as an invoke that is implemented by a builtin provider managed by the engine. This provider will be used for any custom resources and invokes inside the `pulumi:pulumi` module. Currently this provider's API is exactly the `pulumi:pulumi:getStack` invoke. This is the short-term fix for #109.

pgavlin · 2018-11-09T16:43:37Z

Here are the proposed approaches:

Short term

We will introduce a new custom resource type, pulumi.service.StackReference, that accepts a stack name as an argument and returns the stack's output properties. Limiting the returned data to outputs maintains a useful encapsulation boundary: the specific implementation details of the upstream stack are not exposed to the downstream stack. Using a custom resource provides a number of benefits:

Resources are stored in the checkpoint file. This gives us a simple way in the near term to tell whether or not a stack depends on other stacks from the checkpoint file alone. It does not, however, allow us to easily discover whether or not a stack is depended on by other stacks.
Resources have lifecycles. This allows us to easily determine when new references are created and old references are removed. In the medium-term, we could perform actual CRUD operations in the resource's lifecycle methods in order to reify these stack dependencies in the Pulumi service.
Custom resources are implemented by language-independent providers rather than language-dependent SDKs. This allows us to write the underlying code once. It can then be exposed rather easily to JS, Go, Python, etc.

We will also update the Pulumi console and Pulumi CLI to display stack references in a richer manner than normal resources.

Below is an example of three stacks that are layered using StackReference. The first stack creates a VPC, the second stack creates an EKS cluster in the VPC, and the third stack deploys a Helm chart to the EKS cluster.

base-vpc stack

import * as pulumi from "@pulumi/pulumi";
import * as awsinfra from "@pulumi/aws-infra";

// Create and export a VPC.
const vpc = new awsinfra.Network("network");
export const network = vpc;

eks-cluster stack

import * as pulumi from "@pulumi/pulumi";
import * as eks from "@pulumi/eks";

// Import the base VPC from our base stack.
const baseVpc = new pulumi.service.StackReference("myorg/base-vpc").output("network");

// Create an EKS cluster
const cluster = new eks.Cluster("cluster", {
    vpcId: baseVpc.apply(vpc => vpc.vpcId),
    subnetIds: baseVpc.apply(vpc => vpc.subnetIds),
});

// Export the cluster's kubeconfig.
export const kubeconfig = cluster.kubeconfig;

app-hackmd stack

import * as eks from "@pulumi/eks";
import * as k8s from "@pulumi/kubernetes";

// Import the EKS cluster stack
const kubeconfig = new pulumi.service.StackReference("myorg/eks").output("kubeconfig");

// Create a k8s provider that targets the EKS cluster
const k8sProvider = new k8s.Provider("eks", { kubeconfig: kubeconfig.apply(JSON.stringify) });

// Deploy the HackMD Helm chart.
const hackmd = new k8s.helm.v2.Chart("hackmd", {
    repo: "stable",
    chart: "hackmd",
    version: "0.1.1",
    values: {
        service: {
            type: "LoadBalancer",
            port: 80,
        },
    },
}, { providers: { kubernetes: k8sprovider } });

Medium term

We will flesh out the StackReference type:

We will add support for configuring the backend used to fetch the stack (e.g. service vs. local, API token, etc.)
We will flesh out the CRUD operations to expose reified stack dependencies in the service and update the console and CLI to display a list of a stack's dependent stacks.

These changes add a new API to the Pulumi SDK, `service.getStack`, that returns the outputs of a given stack. The Pulumi account performing the deployment that calls this API must have access to the indicated stack or the call will fail. This API is implemented as an invoke that is implemented by a builtin provider managed by the engine. This provider will be used for any custom resources and invokes inside the `pulumi:pulumi` module. Currently this provider's API is exactly the `pulumi:pulumi:getStack` invoke. This is the short-term fix for #109.

joeduffy added area/engine labels Mar 2, 2017

joeduffy added this to the S10 milestone Mar 2, 2017

joeduffy modified the milestones: S3, S2 May 24, 2017

joeduffy modified the milestones: 0.5, 0.3 Jun 5, 2017

joeduffy mentioned this issue Jun 19, 2017

Allow dynamic querying of resources #83

Closed

joeduffy removed this from the 0.5 milestone Aug 28, 2017

joeduffy assigned joeduffy and mmdriley Oct 10, 2017

joeduffy added this to the 0.8 milestone Oct 10, 2017

joeduffy added the customer/lm/m2 label Oct 11, 2017

joeduffy modified the milestones: 0.8, 0.9 Oct 19, 2017

joeduffy removed the customer/lm/m2 label Nov 13, 2017

joeduffy modified the milestones: 0.9, 0.11 Nov 13, 2017

lindydonna removed the kind/experimental label Feb 6, 2018

joeduffy modified the milestones: 0.11, 0.14 Feb 12, 2018

joeduffy modified the milestones: 0.14, 0.16 Mar 24, 2018

joeduffy assigned pgavlin and unassigned mmdriley and joeduffy Jul 4, 2018

lukehoban modified the milestones: 0.16, 0.17 Jul 26, 2018

pgavlin added the kind/feature label Aug 16, 2018

lukehoban modified the milestones: 0.17, 0.18 Aug 19, 2018

lukehoban modified the milestones: 0.18, 0.19 Sep 13, 2018

joeduffy changed the title ~~Dynamic linking~~ Dynamic stack linking Oct 29, 2018

lukehoban added the customer/feedback label Oct 30, 2018

joeduffy mentioned this issue Nov 1, 2018

Expose metadata tags to CLI/programs #2144

Closed

joeduffy added the p1 A bug severe enough to be the next item assigned to an engineer label Nov 2, 2018

pgavlin mentioned this issue Nov 8, 2018

Add an API for importing stack outputs #2180

Merged

pgavlin closed this as completed in bc08574 Nov 14, 2018

lukehoban mentioned this issue Nov 15, 2018

Referencing remote stacks #2208

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dynamic stack linking #109

Dynamic stack linking #109

joeduffy commented Mar 2, 2017

joeduffy commented Mar 6, 2017

joeduffy commented Oct 10, 2017

joeduffy commented Oct 17, 2017

lukehoban commented Oct 17, 2017

joeduffy commented Oct 19, 2017

briandrennan commented Jun 20, 2018

jnancel commented Jul 5, 2018

pgavlin commented Nov 9, 2018

Dynamic stack linking #109

Dynamic stack linking #109

Comments

joeduffy commented Mar 2, 2017

joeduffy commented Mar 6, 2017

joeduffy commented Oct 10, 2017

joeduffy commented Oct 17, 2017

lukehoban commented Oct 17, 2017

joeduffy commented Oct 19, 2017

briandrennan commented Jun 20, 2018

jnancel commented Jul 5, 2018

pgavlin commented Nov 9, 2018

Short term

base-vpc stack

eks-cluster stack

app-hackmd stack

Medium term