Provide docs/standards about client/worker/proxy interactions in redeployable taskcluster by petemoore · Pull Request #128 · taskcluster/taskcluster-rfcs

petemoore · 2018-09-25T18:03:50Z

Changes are required for clients of taskcluster services when taskcluster becomes redeployable.

This RFC serves to define the standards by which taskcluster deployments will publish a manifest of their service definitions, how client generators will query and interpret this manifest and associated reference and schema documents in order to generate clients, what the architectural changes to generated clients will be, how users of those clients (workers, worker authentication proxies, command line tools, software libraries) will be adapted in order to use the new clients, and how generated clients and client generators will be built, released and deployed.

This entails changes to the way workers will share deployment / authentication proxy configuration information with tasks, versioning of the API / Event reference schemas, versioning of the services definitions based on those API and Event reference schemas, and the possibility for clients to connect to different taskcluster deployments.

…ons in redeployable taskcluster

djmitche · 2018-09-25T18:22:10Z

Is this going to include schemas for reference files and so on (like https://bugzilla.mozilla.org/show_bug.cgi?id=1476602)? If so, I'll stop working on that until this RFC is complete.

imbstack

Looking good so far.

rfcs/0128-redeployable-clients.md

djmitche · 2018-10-23T14:14:12Z

rfcs/0128-redeployable-clients.md

+
+```
+{
+  "version" : 1,


We have found it easier to just include a $schema property to indicate the version of a document.

djmitche · 2018-10-23T14:16:06Z

rfcs/0128-redeployable-clients.md

+    "<serviceName>": {
+      "api": true|false,
+      "exchanges": true|false,
+    },


This seems to omit the API version.

Why switch from relative URLs to true/false? Both are probably fine, but the URLs are a little more discoverable for people trying to navigate these files by hand.

imbstack · 2018-10-24T18:19:11Z

rfcs/0128-redeployable-clients.md

+example:
+
+```
+queue := queue.NewFromEnvVars()


Let's just leave env var stuff to the person using the library? I don't think we need to build in any alternative ways of configuring other than passing in creds probably.

I guess for node.js and python, that wouldn't be the end of the world. The problem comes when you have static language clients where the client gets compiled into some other binary that get used in other places (for example, the taskcluster-client is used internally by generic-worker, but also by taskcluster-proxy, for example). In the case of generic worker, for example, it will fetch the root URL from the generic-worker.config file, rather than the TASKCLUSTER_ROOT_URL environment variable. If it were for some reason set in the environment, and then it took it from there, it might not necessarily be the correct TASKCLUSTER_ROOT_URL to use.

Also consider the situation where e.g. somebody writes a tool that syncs role definitions between two taskcluster deployments. If internally the client uses the environment variable TASKCLUSTER_ROOT_URL to discover which cluster it needs to talk to, only one can be specified in that process. It kind of assumes it is only talking to one deployment.

When writing executable code, such as a command line tool or web service, where you know what your tool is and what it is being used for, and have a feeling about the context it might be called from, it seems reasonable to take config from environment variables (e.g. for writing a web service). You also know only one value is required for each parameter (unlike the sync example above where internally in the code multiple clients can be created). But when it isn't a service or command line application you are writing, but just a library which is internally used by other command line tools and web services, (for example), it feels cleaner and more appropriate to take configuration as parameters passed to methods and functions. By providing a utility method to fetch config from env vars, you expose the possibility to the calling code to configure based on env vars, but you don't force it. Then it is up to the author of the command line tool / service / etc that calls your library to decide explicitly whether they want that or not. Then another advantage of being explicit is it is a little less magical and easier to reason about when you are degugging that code and trying to work out where it got the configuration from. I think. :-)

I think we all agree here -- libraries shouldn't be reading env variables without explicit instruction from the caller.

We just need to roll out that change gently since it will break every user of the client in possibly-subtle ways.

lol, I guess I misread @imbstack's comment, and thought he meant we should support only accepting env vars, but now I see he meant the opposite, so I didn't need to write all that! 😆

I thought we all agreed that all env vars will no longer be automatically read by the clients, but the text of the RFC still refers only to TASKCLUSTER_ROOT_URL. I think clients should be consistent in which env vars they read, and they currently read TASKCLUSTER_CLIENT_ID etc.

This will be a big breaking change for our users (and I've held @tomprince back from using newer clients in anticipation), so I think we should be explicit about it and make sure they get a chance to comment here. Alternately, we could remove this bit from this RFC and move it to a new one.

djmitche

Regarding the renaming (api -> http, events/exchanges -> amqp), I don't feel like the new names make more sense than the old, but the old aren't perfect. It'd be nice to be consistent, whatever we choose, but the fewer things we rename, the easier this work will be. Renaming always seems easy, then you end up breaking things in production because you missed somewhere!

djmitche · 2018-10-24T19:40:04Z

rfcs/0128-redeployable-clients.md

+
+# 3. Details
+
+## 3.1 The development lifecycle of APIs


In an earlier draft this was labeled as "current" -- I kind of liked that

Whoops that got lost in a rewrite, I'm putting it back in. Nice spot!

djmitche · 2018-10-24T19:41:35Z

rfcs/0128-redeployable-clients.md

+currently existing reference schemas are these:
+
+* [HTTP reference schema](https://schemas.taskcluster.net/base/v1/api-reference.json)
+* [AMQP 0.9.1 reference schema](https://schemas.taskcluster.net/base/v1/exchanges-reference.json)


We have always called these "api" and "events", so let's continue with those names. Actually, we've sometimes called the latter "exchanges", so fixing that to "events" would be good.

djmitche · 2018-10-24T19:42:44Z

rfcs/0128-redeployable-clients.md

+   versions, and released
+6. Released software is tested and deployed
+
+## 3.2 Changes to publishing API manifest


Maybe call this section 4, proposed changes?

rfcs/0128-redeployable-clients.md

djmitche · 2018-10-24T20:22:05Z

rfcs/0128-redeployable-clients.md

+references/schemas that the client reads dynamically should also be a
+dependency of the project, which may be either frozen references/schemas,
+fetched from a language package, or fetched dynamically during build/CI from a
+taskcluster deployment.


I'm not sure what this means -- how can the references be a dependency?

rfcs/0128-redeployable-clients.md

djmitche · 2018-10-24T20:32:19Z

rfcs/0128-redeployable-clients.md

+
+```go
+func HTTPReference(rootURL string, version string) string
+func AMQPReference(rootURL string, version string) string


These are equivalent to the existing ApiReference and ExchangesReference methods, but omit the service. I'm not sure how that would work? Also, given that the manifest format allows arbitrary URLs, I think these methods are unnecessary?

djmitche · 2018-10-24T20:32:38Z

rfcs/0128-redeployable-clients.md

+
+These will return absolute urls to the `*-reference.json` documents.
+
+## Changes to building docs site


Let's leave these out..

petemoore · 2018-10-25T13:20:31Z

Regarding the renaming (api -> http, events/exchanges -> amqp), I don't feel like the new names make more sense than the old, but the old aren't perfect. It'd be nice to be consistent, whatever we choose, but the fewer things we rename, the easier this work will be. Renaming always seems easy, then you end up breaking things in production because you missed somewhere!

That is a fair point. And I agree the new names aren't considerably better, so I will go back to the old names. The main naming issue I was trying to solve was that we have no collective term for API meaning HTTP API or AMQP API. We use API for the collective term, but then also sometimes just meaning the HTTP API (e.g. in the name apis.json). For example, does API references mean something that adheres to api-reference.json or exchange-reference.json, or just the former?

Maybe I can just update the docs to make it clear in each place I reference these terms, what I mean...

jhford

Looking good, @petemoore !

jhford · 2018-10-31T11:56:46Z

rfcs/0128-redeployable-clients.md

+## 4.1 Changes to publishing API manifest
+
+1. If a service of a taskcluster deployment provides an API interface, the
+   cluster may host the API reference document under


is this intentionally may and not must?

It was just to cover the case that you have an API but you don't want to publish it and people/clients to start using it...

rfcs/0128-redeployable-clients.md

jhford · 2018-10-31T12:00:55Z

rfcs/0128-redeployable-clients.md

+```
+
+How this list is generated by taskcluster is compiled and served by
+taskcluster-references is not the concern of this RFC, but it is recommended


A well formed manifest to drive client generation seems like a foundational element of the design. I don't want to speak for @petemoore , but I suspect what he's trying to say is that creating the manifest is out of scope for this document, but that the format which this document expects is in scope.

jhford · 2018-10-31T12:02:02Z

rfcs/0128-redeployable-clients.md

+generate the URL paths, rather than requiring the consumer of this library to
+use taskcluster-lib-urls. This keeps the involvement of taskcluster-lib-urls as
+high up in the stack as possible, which makes the lower parts of the stack more
+generic/flexible with fewer concerns.


I really like this design consideration!

jhford · 2018-10-31T12:03:29Z

rfcs/0128-redeployable-clients.md

+manifest simply says "these are the APIs I declare, here is where you can fetch
+their references, and they are self-describing, so go ask them". It doesn't
+burn in any concerns about URL path building, or the types of reference we
+support.


I really like this approach, I think it's the right way to handle the manifest.

jhford · 2018-10-31T12:06:22Z

rfcs/0128-redeployable-clients.md

+## Changes to publication of API references and schemas
+
+The implementation must serve the described resources under the given URLs set
+out in this document. The author is not concerned with how a service declares


I think it's useful information to clearly mark what is and isn't in scope.

jhford

This looks good to me!

jhford · 2018-11-01T12:43:46Z

rfcs/0128-redeployable-clients.md

+  `TASKCLUSTER_PROXY_URL` and `TASKCLUSTER_ROOT_URL`. This gives them the
+  freedom to refer to either the proxy or the target service, as required.
+  Since they must explicitly configure the root url when using a taskcluster
+  client, both endpoints are at their disposal, based on what they wish to do.


petemoore · 2018-11-02T08:20:23Z

Note, I'll squash commits when I land, of course, and give a reasonable commit message!

This is ready for review, I am not intending to land any more changes.

djmitche

I like this. I'm a little worried that some JSON-schema tooling will struggle with the metadata field, but as long as that's not the case it sounds like a really elegant solution, not least because it means we can make "trivial" changes to the reference schema (such as modifying a description) without bumping the version number.

We had earlier planned to have the version number in the reference schemas. That would allow us to have services that are publishing references adhering to different schema versions to co-exist in the same deployment (and would couple nicely with the metadata property, so nobody needs to parse "-v1" out of a filename).

Would the metadata-based versioning apply to the manifest schema (api-manifest.json) as well?

Could we rename reference.json to indicate that it is a metaschema, not a reference or schema? Perhaps metadata-metaschema.json or something like that?

djmitche · 2018-11-02T14:44:16Z

rfcs/0128-redeployable-clients.md

+example:
+
+```
+queue := queue.NewFromEnvVars()


I thought we all agreed that all env vars will no longer be automatically read by the clients, but the text of the RFC still refers only to TASKCLUSTER_ROOT_URL. I think clients should be consistent in which env vars they read, and they currently read TASKCLUSTER_CLIENT_ID etc.

This will be a big breaking change for our users (and I've held @tomprince back from using newer clients in anticipation), so I think we should be explicit about it and make sure they get a chance to comment here. Alternately, we could remove this bit from this RFC and move it to a new one.

djmitche · 2018-11-02T14:48:04Z

rfcs/0128-redeployable-clients.md

+* teams that deploy their own taskcluster environments, are able to include
+  additional references for programming interfaces not covered by the core
+  platform (for example, maybe it is desired to provide APIs for talking with
+  databases, other messaging buses, monitoring tools, ...)


Indeed, I see that you've been careful to indicate what this RFC requires, and what it merely discusses. I'm strongly opposed to a few of the things it discusses, but I'm happy to agree to the RFC only on the basis of what it requires.

imbstack

Seems generally reasonable to me!

imbstack · 2018-10-31T16:27:49Z

rfcs/0128-redeployable-clients.md

+
+* building taskcluster clients, that provide language-level programming
+  interfaces to taskcluster APIs
+* refreshing `taskcluster-raw-docs` AWS S3 bucket, which is used by taskcluster


I don't believe we actually use these for docs. Every service just publishes to that bucket directly.

Ah OK.

Do you know where the docs site fetches the references and schemas from?

djmitche · 2018-11-07T16:42:07Z

So, the meaning of the env vars section remains ambiguous, and there appears to have been no final-comment period here in which to catch that issue or check that release engineering and other client consumers are OK with the change. I don't want to delay this by standing on ceremony, and I think we have consensus on most of this, but I'm worried that we've missed discussion of a consumer-facing breaking change. Maybe we should consider pulling out the env-var bit into a separate RFC?

petemoore · 2018-11-07T18:03:38Z

there appears to have been no final-comment period here in which to catch that issue or check that release engineering and other client consumers are OK with the change

Apologies, I forgot about applying the tags, but I think we have a general consensus based on the review approvals. It is always plausible for there to be slight changes during the implementation phase, but I think as long as we are vocal about changes coming, and communicate these things well, we should be ok. When users upgrade a major version, they are generally aware there can be breaking changes - the mistake we have made in the past is not being very vocal about it and explicit in release notes. As long as the release notes are very explicit about the breaking changes, and we communicate them well, users can decide if they wish to upgrade, and how long they need for adjusting to the changes. I think this is where we have fallen down in the past.

petemoore mentioned this pull request Sep 25, 2018

Provide docs/standards about client/worker/proxy interactions in redeployable taskcluster #127

Closed

RFC 0128 - Provide docs/standards about client/worker/proxy interacti…

f15ee45

…ons in redeployable taskcluster

petemoore force-pushed the rfc0128 branch from 0173a98 to f15ee45 Compare September 25, 2018 18:08

petemoore self-assigned this Sep 25, 2018

WIP

eb69c3b

imbstack reviewed Sep 27, 2018

View reviewed changes

rfcs/0128-redeployable-clients.md Outdated Show resolved Hide resolved

rfcs/0128-redeployable-clients.md Outdated Show resolved Hide resolved

petemoore added 5 commits October 23, 2018 10:11

wip

91516ea

wip

7aad533

wip

fbd1c81

wip

aa3cca0

wip

ed3bf5c

djmitche reviewed Oct 23, 2018

View reviewed changes

petemoore added 5 commits October 23, 2018 19:15

WIP

9cd896c

wip

1a67c8b

WIP

b0d1052

WIP

d6ec2d0

WIP

f134fef

djmitche self-requested a review October 23, 2018 18:32

imbstack reviewed Oct 24, 2018

View reviewed changes

djmitche reviewed Oct 24, 2018

View reviewed changes

petemoore added 8 commits October 25, 2018 15:26

WIP

be2c15a

WIP

0ca9ca3

WIP

8b96252

wip

7e98906

wip

0c6f3a3

wip

9e613b6

wip

76baf27

wip

af0cbe0

petemoore added 7 commits October 30, 2018 15:56

wip

2125ba1

wip

e493835

WIP

3db75cf

WIP

75987e5

WIP

c7ea283

WIP

eee94cb

WIP

6b17272

jhford previously approved these changes Oct 31, 2018

View reviewed changes

WIP

c1ef4cd

petemoore dismissed jhford’s stale review via c1ef4cd October 31, 2018 13:11

WIP

15d0d48

petemoore requested review from djmitche, helfi92, imbstack, jhford, owlishDeveloper and walac October 31, 2018 13:41

jhford approved these changes Nov 1, 2018

View reviewed changes

djmitche approved these changes Nov 2, 2018

View reviewed changes

imbstack approved these changes Nov 6, 2018

View reviewed changes

petemoore merged commit f168db0 into master Nov 6, 2018


		These will return absolute urls to the `*-reference.json` documents.

		## Changes to building docs site

Conversation

petemoore commented Sep 25, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

djmitche commented Sep 25, 2018

Uh oh!

imbstack left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

petemoore Oct 25, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

djmitche left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

petemoore commented Oct 25, 2018

Uh oh!

jhford left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jhford left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

petemoore commented Nov 2, 2018

Uh oh!

petemoore commented Sep 25, 2018 •

edited

Loading

petemoore Oct 25, 2018 •

edited

Loading

petemoore commented Nov 7, 2018 •

edited

Loading