Add support for verifying the Kubernetes connection #2156

Fryguy · 2015-03-13T22:40:04Z

This will prevent worker cycling when the Kubernetes endpoint is not
available.

@abonas Please review.

Even though there are no credentials I need an authentications record to keep track of the verification status. So, I created a dummy record for now. I figure we can change that record into a certificate based record when we get to certificates.

@h-kataria This brings up the question of "verifying" the connection in the UI, which will now be needed, in case the connection goes bad. The user will want to go into the EMS and "revalidate" it to make it active again. However since there are no credentials, I'm not sure how you want to present this in the UI. cc @jrafanie

@abonas I this this PR would be better served with a method on Kubeclient called verify? or something similar. You can see what I did for Foreman here. What do you think? Basically, the method tries to connect and does a lightweight action. Additionally it verifies that the content it gets back is what is expected. This is important since you could technically hit any https://example.com/api and it could come back ok, so we want to verify it's the right one.

Fryguy · 2015-03-13T22:40:44Z

vmdb/app/models/ems_kubernetes.rb

@@ -20,7 +27,8 @@ def self.raw_connect(hostname, port, api_version)
  def self.raw_api_endpoint(hostname, port)
    require 'uri'

-    uri = URI::HTTP.build(:path => "/api", :port => port.to_i)
+    port &&= port.to_i
+    uri = URI::HTTP.build(:path => "/api", :port => port)


This was a bug...if the user did not enter a port, it should no be passed. This code was formerly passing 0

uri = URI::HTTP.build(:path => "/api", :port => port.try(:to_i))

uri = URI::HTTP.build(:path => "/api", :port => port.try(:to_i))

I like this syntax!

This was a bug...if the user did not enter a port, it should no be passed. This code was formerly passing

good point, great catch

port.try(:to_i)

Interesting. I thought that since nil.to_i == 0, then nil.try(:to_i) would also be 0, but I just tested and it's nil. Pretty cool...I'll change it.

Cool. Looks like this still works with rails 4.0's try.

NilClass#try is defined as def try ; nil ; end
Object#try has changed in rails 4.0 (as brandon has pointed out) to now have something like a responds_to? in it.

abonas · 2015-03-15T16:16:11Z

@abonas I this this PR would be better served with a method on Kubeclient called verify? or something similar. You can see what I did for Foreman here. What do you think? Basically, the method tries to connect and does a lightweight action. Additionally it verifies that the content it gets back is what is expected. This is important since you could technically hit any https://example.com/api and it could come back ok, so we want to verify it's the right one.

You are raising a good point, and I've started thinking about this.
Couple of thoughts/questions:
a. There are no lightweight actions on that api afaik, and we also are dependent on k8s for this.
Also, not always entities are present, so I am not sure what we can specifically validate that is lightweight and is present on all environments of k8s (especially since the api is still unstable and non backwards comp. changes are made)

b. User can point it mistakenly on any server, that's correct, but getting not expected content will not harm, as we will be searching for entities that are of an interest for us (for example, replication controllers) and if they won't be there, then they won't be there. It will also require some validation action "are you k8s" on k8s side and it might be problematic to add it there (or looking forward - openshift might not be interested to identify itself as k8s for that matter)

c. How does it behave with rhev? what is validated there?

d. It does make me wonder though, why not build the endpoint in the kubeclient (adding the "/api" part in kubeclient that is) and provide only the version and the host/port in the manageiq provider side, what do you think?
(currently this is done in manageiq: uri = URI::HTTP.build(:path => "/api", :port => port.to_i))

abonas · 2015-03-15T16:16:54Z

This will prevent worker cycling when the Kubernetes endpoint is not
available.

so it means there's no retry? how can it be "restarted/refreshed" to make it understand the endpoint became available?

Fryguy · 2015-03-16T14:08:43Z

a. There are no lightweight actions on that api afaik, and we also are dependent on k8s for this.
Also, not always entities are present, so I am not sure what we can specifically validate that is lightweight and is present on all environments of k8s (especially since the api is still unstable and non backwards comp. changes are made)

Well, that stinks. Many APIs have a root request that will give you basic information like version. Does k8s have something like this? For now I used endpoints, and I'm basically testing that it doesn't blow up with 401 or 404 errors. Is endpoints an acceptable thing to call? It sounded like something that would be relatively small.

b. User can point it mistakenly on any server, that's correct, but getting not expected content will not harm, as we will be searching for entities that are of an interest for us (for example, replication controllers) and if they won't be there, then they won't be there. It will also require some validation action "are you k8s" on k8s side and it might be problematic to add it there (or looking forward - openshift might not be interested to identify itself as k8s for that matter)

For fun, I pointed it to google.com and it blew up with JSON::ParserError: 757: unexpected token at '<!DOCTYPE html>... I'll add that error to the list of errors in this PR.

c. How does it behave with rhev? what is validated there?

We actually had a similar problem with discovery on RHEV in #1929, where we were just looking to see if it responded on a certain port and /api path. As you can imagine, there were false positives, so we changed it to look for a file called "engine.ssh.key.txt".

That is for discovery though. For verification, we just hit /api and look for URI::InvalidURIError. So, I think we should beef that up to do something similar to what discovery does. cc @jrafanie

d. It does make me wonder though, why not build the endpoint in the kubeclient (adding the "/api" part in kubeclient that is) and provide only the version and the host/port in the manageiq provider side, what do you think?
(currently this is done in manageiq: uri = URI::HTTP.build(:path => "/api", :port => port.to_i))

I think this is a great idea. The less a caller needs to know the better. I could see the Kubeclient.new taking either a full URI like it does now, or take parts (hostname and an optional port and path), and build its own URI. Would it make sense to make the version optional as well, defaulting to the most commonly used one?

Fryguy · 2015-03-16T14:26:48Z

so it means there's no retry? how can it be "restarted/refreshed" to make it understand the endpoint became available?

This is what i pinged @h-kataria about in the initial post. We would need a verify button. Additionally there is a scheduled task that verifies all connections every hour or so. We can probably change that if needed.

@jrafanie This reminds me. I feel like the task should check good connections every hour, but should check bad connection more frequently. Like every 5 minutes? Generally if a connection goes bad, you want it back ASAP if it gets restored. Thoughts?

abonas · 2015-03-16T15:00:44Z

a. There are no lightweight actions on that api afaik, and we also are dependent on k8s for this.
Also, not always entities are present, so I am not sure what we can specifically validate that is lightweight and is present on all environments of k8s (especially since the api is still unstable and non backwards comp. changes are made)

Well, that stinks. Many APIs have a root request that will give you basic information like version. Does k8s have something like this? For now I used endpoints, and I'm basically testing that it doesn't blow up with 401 or 404 errors. Is endpoints an acceptable thing to call? It sounded like something that would be relatively small.

endpoints can be very heavy in real environments so I wouldn't be using it.
the root /api responds with the available versions so perhaps it's better to use that.
ideally I would check the api/versionName (e.g. api/v1beta3/) but unfortunately that yields 404.

b. User can point it mistakenly on any server, that's correct, but getting not expected content will not harm, as we will be searching for entities that are of an interest for us (for example, replication controllers) and if they won't be there, then they won't be there. It will also require some validation action "are you k8s" on k8s side and it might be problematic to add it there (or looking forward - openshift might not be interested to identify itself as k8s for that matter)

For fun, I pointed it to google.com and it blew up with JSON::ParserError: 757: unexpected token at '... I'll add that error to the list of errors in this PR.

this is due to the following issue, I'd be happy to discuss the proper ways to solve it on kubeclient repo:
ManageIQ/kubeclient#35

c. How does it behave with rhev? what is validated there?
We actually had a similar problem with discovery on RHEV in #1929, where we were just looking to see if it responded on a certain port and /api path. As you can imagine, there were false positives, so we changed it to look for a file called "engine.ssh.key.txt".

That is for discovery though. For verification, we just hit /api and look for URI::InvalidURIError. So, I think we should beef that up to do something similar to what discovery does. cc @jrafanie
d. It does make me wonder though, why not build the endpoint in the kubeclient (adding the "/api" part in kubeclient that is) and provide only the version and the host/port in the manageiq provider side, what do you think?
(currently this is done in manageiq: uri = URI::HTTP.build(:path => "/api", :port => port.to_i))
I think this is a great idea. The less a caller needs to know the better. I could see the Kubeclient.new taking either a full URI like it does now, or take parts (hostname and an optional port and path), and build its own URI. Would it make sense to make the version optional as well, defaulting to the most commonly used one?

OK I'll take a note to refactor the calls for less information to be passed.
And I guess we could default the api version too, my guess is that some of the api versions will be deprecated soon and others renamed on kube side anyway towards their 1.0, I am waiting on their answer on that.

Fryguy · 2015-03-16T15:23:49Z

the root /api responds with the available versions so perhaps it's better to use that.

How do I access that via Kubeclient? Is there a .api method, or would that need to be added?

abonas · 2015-03-16T16:31:25Z

How do I access that via Kubeclient? Is there a .api method, or would that need to be added?

no, there's no such method, we might need to add it. I'll open an issue and link it here.

jrafanie · 2015-03-16T18:12:33Z

That is for discovery though. For verification, we just hit /api and look for URI::InvalidURIError. So, I think we should beef that up to do something similar to what discovery does. cc @jrafanie

I haven't looked at what verification does. Please open an issue and reference #1929 as we shouldn't just assume anything that responds to a very generic rest api endpoint with valid credentials is actually talking to rhevm/ovirt.

jrafanie · 2015-03-16T18:14:14Z

@jrafanie This reminds me. I feel like the task should check good connections every hour, but should check bad connection more frequently. Like every 5 minutes? Generally if a connection goes bad, you want it back ASAP if it gets restored. Thoughts?

Same here, we should open an issue to get that done, it sounds simple to add a new schedule to check currently bad authentications much more frequently and change the existing one to check only good authentications.

jrafanie · 2015-03-16T18:15:49Z

Maybe we should layout all of the current problems with authentication/verify_credentials in one github issue with bullet items so we don't lose them. I'm still trying to get #1990 to completion though we keep coming up with cleaner implementation ideas. 😺

miq-bot · 2015-03-24T17:13:06Z

<pr_mergeability_checker />This pull request is not mergeable. Please rebase and repush.

jrafanie · 2015-03-24T17:34:25Z

@Fryguy I think #2325 getting merged made this conflicted, let me know if you need any help using the new syntax. It should be much cleaner.

Fryguy · 2015-03-24T17:59:20Z

Thanks @jrafanie. I'm just waiting on the changes for kubeclient before
making changes here.
On Mar 24, 2015 1:34 PM, "Joe Rafaniello" notifications@github.com wrote:

@Fryguy https://github.com/Fryguy I think #2325
#2325 getting merged made this
conflicted, let me know if you need any help using the new syntax. It
should be much cleaner.

—
Reply to this email directly or view it on GitHub
#2156 (comment).

Fryguy · 2015-03-25T21:09:39Z

Updated with using the proposed Kubeclient#valid? method from ManageIQ/kubeclient#54 . So, this PR now depends on that change getting in and released. It will need a version bump of the gem as well.

Fryguy · 2015-03-25T21:19:19Z

Changed method name to api_valid on the kubeclient side, so updated here as well.

abonas · 2015-03-26T09:28:59Z

Updated with using the proposed Kubeclient#valid? method from ManageIQ/kubeclient#54 . So, this PR now depends on that change getting in and released. It will need a version bump of the gem as well.

new kubeclient is released (0.1.8)

Fryguy · 2015-03-27T19:05:24Z

@h-kataria Can you comment on my question to you in the original post?

Fryguy · 2015-03-27T19:36:14Z

Updated PR with the changes to kubeclient. Just waiting on @h-kataria on how to proceed with respect to the UI.

@h-kataria In addition, I see that port is missing in the UI. I think that's a mistake.

miq-bot · 2015-03-27T19:36:18Z

<gemfile_checker />@JPrause @jvlcek Gemfile changes detected in commit Fryguy@4221115. Please review.

h-kataria · 2015-03-27T20:12:57Z

@Fryguy About the missing "port" field, we provided just a skeleton of Containers UI, i think @erinboyd might be enhancing that screen to add any missing fields on that screen.

Regarding verification of credentials, currently Container Provider screen does have Credentials box so user can enter credentials and validate/save them as they would for any other type of Provider. Are we removing credentials box out of the Container Provider screen? if yes, then we would need a "Validate" button on that screen next to one of the other required fields such as Hostname or IPAddress.

Fryguy · 2015-03-30T14:42:51Z

@h-kataria Oh I see. So, then I think this is good to merge, and we'll have to verify that the verify button stays in the UI changes made by @erinboyd

erinboyd · 2015-03-30T22:21:38Z

@Fryguy @h-kataria @abonas I removed the credentials box from the new provider screen...so will we need an additional button that validates the connection manually? Is this what you are saying?

h-kataria · 2015-03-30T22:27:13Z

@erinboyd @abonas If you have removed Credentials box from Containers Provider screen then yes you do need add a validate button on that screen somewhere next to IP Address/Hostname box depending upon whichever is a required field, and have it do similar thing that other Validate button does in Credentials box

erinboyd · 2015-03-30T22:41:36Z

@h-kataria okay. I will do this.

miq-bot · 2015-04-02T18:08:16Z

<pr_mergeability_checker />This pull request is not mergeable. Please rebase and repush.

This will prevent worker cycling when the Kubernetes endpoint is not available.

Fryguy · 2015-04-20T18:09:42Z

@abonas I've rebased and fix the conflicts.

miq-bot · 2015-04-20T18:15:26Z

Checked commit Fryguy@4427ddd with rubocop 0.27.1
2 files checked, 6 offenses detected

vmdb/app/models/ems_kubernetes.rb

🔶 - Line 46, Col 38 - Style/RedundantSelf - Redundant self detected.

vmdb/spec/models/ext_management_system_spec.rb

🔹 - Line 120, Col 121 - Metrics/LineLength - Line is too long. [142/120]
🔹 - Line 121, Col 121 - Metrics/LineLength - Line is too long. [142/120]
🔹 - Line 127, Col 121 - Metrics/LineLength - Line is too long. [128/120]
🔹 - Line 128, Col 121 - Metrics/LineLength - Line is too long. [128/120]
🔹 - Line 129, Col 121 - Metrics/LineLength - Line is too long. [128/120]

abonas · 2015-04-20T20:07:28Z

cc @simon3z

simon3z · 2015-04-20T20:49:29Z

vmdb/app/models/ems_kubernetes.rb

-  def verify_credentials(_auth_type = nil, _options = {})
-    # TODO: support real authentication using certificates
-    true
+    with_provider_connection(options, &:api_valid?)


Validation here should be a check that remote endpoint supports the API that we know about (v1beta3 / future v1.0).
I'll send a separate patch about that.

isn't that the job of the api_valid? method on kubeclient?

simon3z · 2015-04-20T20:52:13Z

We need a real validation about the api, I'll send a patch. In general 👍 💯

blomquisg · 2015-04-20T22:42:29Z

@Fryguy, legit travis failures?

simon3z · 2015-04-29T16:27:24Z

@blomquisg @Fryguy I based my patches on this PR, and I fixed the travis failure. I kept @Fryguy 's authorship on the patch, obviously, and I added a real validation here: #2696

Can you try review/merge it? Thanks.

simon3z · 2015-04-30T12:26:38Z

vmdb/app/models/ems_kubernetes.rb

  end

-  def authentication_status_ok?(_type = nil)
+  private


Adding private here you mask other public methods (all_computer_system_ids, aggregate_logical_cpus, aggregate_memory).

Doh, must have happened when I rebased.

Fryguy · 2015-05-07T15:53:04Z

Subsumed into #2696

Fryguy added enhancement providers/containers labels Mar 13, 2015

Fryguy reviewed Mar 13, 2015
View reviewed changes

abonas mentioned this pull request Mar 16, 2015

Add defaults when kubeclient is initialized ManageIQ/kubeclient#38

Closed

abonas mentioned this pull request Mar 16, 2015

Add a "connection check" method on api root ManageIQ/kubeclient#40

Closed

Fryguy changed the title ~~Add support for verifying the Kubernetes connection~~ [WIP] Add support for verifying the Kubernetes connection Mar 25, 2015

Fryguy mentioned this pull request Mar 25, 2015

Add Kubeclient#api_valid? ManageIQ/kubeclient#54

Merged

Fryguy force-pushed the fix_kubernetes_connection_validation branch from 622033b to 6ea4a2a Compare March 25, 2015 21:08

Fryguy changed the title ~~[WIP] Add support for verifying the Kubernetes connection~~ [WIP Depends on abonas/kubeclient#54] Add support for verifying the Kubernetes connection Mar 25, 2015

Fryguy force-pushed the fix_kubernetes_connection_validation branch from 6ea4a2a to ab41a13 Compare March 25, 2015 21:18

Fryguy force-pushed the fix_kubernetes_connection_validation branch from ab41a13 to 4221115 Compare March 27, 2015 19:34

Fryguy changed the title ~~[WIP Depends on abonas/kubeclient#54] Add support for verifying the Kubernetes connection~~ [WIP] Add support for verifying the Kubernetes connection Mar 27, 2015

Fryguy added the wip label Mar 27, 2015

Fryguy added the dependencies label Mar 27, 2015

Fryguy removed the wip label Mar 30, 2015

Fryguy changed the title ~~[WIP] Add support for verifying the Kubernetes connection~~ Add support for verifying the Kubernetes connection Mar 30, 2015

This was referenced Apr 10, 2015

Update auth status when clicking validate for existing ems #1990

Merged

Addition of OpenShift provider #2611

Merged

Add support for verifying the Kubernetes connection

4427ddd

This will prevent worker cycling when the Kubernetes endpoint is not available.

Fryguy force-pushed the fix_kubernetes_connection_validation branch from 4221115 to 4427ddd Compare April 20, 2015 18:09

simon3z reviewed Apr 20, 2015
View reviewed changes

simon3z mentioned this pull request Apr 21, 2015

kubernetes: verify endpoint api version #2696

Merged

simon3z reviewed Apr 30, 2015
View reviewed changes

Fryguy closed this May 7, 2015

Fryguy deleted the fix_kubernetes_connection_validation branch July 2, 2015 21:48

smopihub mentioned this pull request Jun 18, 2018

Can't enable Embedded Ansible #17600

Closed

Add support for verifying the Kubernetes connection #2156

Add support for verifying the Kubernetes connection #2156

Conversation

Fryguy commented Mar 13, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

abonas commented Mar 15, 2015

abonas commented Mar 15, 2015

Fryguy commented Mar 16, 2015

Fryguy commented Mar 16, 2015

abonas commented Mar 16, 2015

Fryguy commented Mar 16, 2015

abonas commented Mar 16, 2015

jrafanie commented Mar 16, 2015

jrafanie commented Mar 16, 2015

jrafanie commented Mar 16, 2015

miq-bot commented Mar 24, 2015

jrafanie commented Mar 24, 2015

Fryguy commented Mar 24, 2015

Fryguy commented Mar 25, 2015

Fryguy commented Mar 25, 2015

abonas commented Mar 26, 2015

Fryguy commented Mar 27, 2015

Fryguy commented Mar 27, 2015

miq-bot commented Mar 27, 2015

h-kataria commented Mar 27, 2015

Fryguy commented Mar 30, 2015

erinboyd commented Mar 30, 2015

h-kataria commented Mar 30, 2015

erinboyd commented Mar 30, 2015

miq-bot commented Apr 2, 2015

Fryguy commented Apr 20, 2015

miq-bot commented Apr 20, 2015

abonas commented Apr 20, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

simon3z commented Apr 20, 2015

blomquisg commented Apr 20, 2015

simon3z commented Apr 29, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Fryguy commented May 7, 2015