pkg/daemon: ensure /home/core/.ssh is there, not just /home/core #448

runcom · 2019-02-17T14:54:04Z

Signed-off-by: Antonio Murdaca runcom@linux.com

- What I did

The code was just checking /home/core but we later try to write to /home/core/.ssh (that folder is surely there but this needs to be fixed anyway).

I've centralized file writes in #401 as well to reduce the chances of doing this again.

- How to verify it

- Description for the changelog

Since we later write inside /home/core/.ssh. Signed-off-by: Antonio Murdaca <runcom@linux.com>

runcom · 2019-02-17T15:00:46Z

pkg/daemon/update.go

@@ -608,9 +608,8 @@ func (dn *Daemon) updateSSHKeys(newUsers []ignv2_2types.PasswdUser) error {
 	// Keys should only be written to "/home/core/.ssh"
 	// Once Users are supported fully this should be writing to PasswdUser.HomeDir
 	glog.Infof("Writing SSHKeys at %q", coreUserSSHPath)
-
-	if err := dn.fileSystemClient.MkdirAll(filepath.Dir(coreUserSSHPath), os.FileMode(0600)); err != nil {


for clarity:

filepath.Dir("/home/core/.ssh") == "/home/core"

filepath.Dir("/home/core/.ssh/") == "/home/core/.ssh"

just a leading slash... but we don't need that anyway

gah i lost that on the refactor from the .join to the const. good catch!

runcom · 2019-02-17T15:12:48Z

unit flake from #417

/retest

runcom · 2019-02-17T16:22:42Z

Haproxy e2e-aws flake

/retest

runcom · 2019-02-17T17:29:14Z

failure looks like the bug fixed in #442

/retest

kikisdeliveryservice · 2019-02-18T18:03:33Z

/lgtm

runcom · 2019-02-18T19:30:10Z

Cluster operator network error

/retest

kikisdeliveryservice · 2019-02-18T21:09:28Z

"Cluster operator network has not yet reported success" -> then timed out.
/test e2e-aws

runcom · 2019-02-18T22:14:45Z

time="2019-02-18T21:33:32Z" level=info msg="Waiting up to 30m0s for the cluster to initialize..."
time="2019-02-18T21:33:32Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:33:40Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:33:55Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:34:10Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:35:40Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:36:25Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:36:40Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:39:40Z" level=debug msg="Still waiting for the cluster to initialize: Cluster operator monitoring is reporting a failure: Failed to rollout the stack. Error: running task Updating Prometheus-k8s failed: waiting for Prometheus object changes failed: waiting for Prometheus: retrieving Prometheus object failed: Get https://172.30.0.1:443/apis/monitoring.coreos.com/v1/namespaces/openshift-monitoring/prometheuses/k8s: dial tcp 172.30.0.1:443: connect: connection refused"
time="2019-02-18T21:43:40Z" level=debug msg="Still waiting for the cluster to initialize: Cluster operator network has not yet reported success"
time="2019-02-18T21:44:25Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:47:40Z" level=debug msg="Still waiting for the cluster to initialize: Cluster operator network has not yet reported success"
time="2019-02-18T21:49:25Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:52:55Z" level=debug msg="Still waiting for the cluster to initialize: Cluster operator network has not yet reported success"
time="2019-02-18T21:56:40Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T21:59:55Z" level=debug msg="Still waiting for the cluster to initialize: Cluster operator network has not yet reported success"
time="2019-02-18T22:03:25Z" level=debug msg="Still waiting for the cluster to initialize..."
time="2019-02-18T22:03:32Z" level=fatal msg="failed to initialize the cluster: timed out waiting for the condition"

/retest

ashcrow · 2019-02-18T22:21:47Z

/lgtm

openshift-ci-robot · 2019-02-18T22:21:53Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ashcrow, kikisdeliveryservice, runcom

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [ashcrow,kikisdeliveryservice,runcom]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

runcom · 2019-02-18T23:23:55Z

level=fatal msg="failed to initialize the cluster: Cluster operator network has not yet reported success"

FYI: that is correctly reported already and probably a container runtime bug

/retest

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 17, 2019

openshift-ci-robot requested review from ashcrow and jlebon February 17, 2019 14:54

openshift-ci-robot added the size/XS Denotes a PR that changes 0-9 lines, ignoring generated files. label Feb 17, 2019

runcom force-pushed the fix-filepathdir branch from 90766f3 to 1a2437c Compare February 17, 2019 14:54

runcom changed the title ~~pkg/daemon: ensure .ssh is there, not /home/core~~ pkg/daemon: ensure /home/core/.ssh is there, not just /home/core Feb 17, 2019

runcom force-pushed the fix-filepathdir branch from 1a2437c to 6f23585 Compare February 17, 2019 14:55

pkg/daemon: ensure /home/core/.ssh is there, not just /home/core

13bed7d

Since we later write inside /home/core/.ssh. Signed-off-by: Antonio Murdaca <runcom@linux.com>

runcom force-pushed the fix-filepathdir branch from 6f23585 to 13bed7d Compare February 17, 2019 14:59

runcom commented Feb 17, 2019

View reviewed changes

openshift-ci-robot assigned kikisdeliveryservice Feb 18, 2019

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Feb 18, 2019

ashcrow approved these changes Feb 18, 2019

View reviewed changes

openshift-ci-robot assigned ashcrow Feb 18, 2019

openshift-merge-robot merged commit 86a5a55 into openshift:master Feb 19, 2019

runcom deleted the fix-filepathdir branch February 19, 2019 07:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pkg/daemon: ensure /home/core/.ssh is there, not just /home/core #448

pkg/daemon: ensure /home/core/.ssh is there, not just /home/core #448

runcom commented Feb 17, 2019 •

edited

Loading

runcom Feb 17, 2019 •

edited

Loading

kikisdeliveryservice Feb 18, 2019 •

edited

Loading

runcom commented Feb 17, 2019

runcom commented Feb 17, 2019

runcom commented Feb 17, 2019 •

edited

Loading

kikisdeliveryservice commented Feb 18, 2019

runcom commented Feb 18, 2019

kikisdeliveryservice commented Feb 18, 2019

runcom commented Feb 18, 2019

ashcrow commented Feb 18, 2019

openshift-ci-robot commented Feb 18, 2019

runcom commented Feb 18, 2019

pkg/daemon: ensure /home/core/.ssh is there, not just /home/core #448

pkg/daemon: ensure /home/core/.ssh is there, not just /home/core #448

Conversation

runcom commented Feb 17, 2019 • edited Loading

runcom Feb 17, 2019 • edited Loading

Choose a reason for hiding this comment

kikisdeliveryservice Feb 18, 2019 • edited Loading

Choose a reason for hiding this comment

runcom commented Feb 17, 2019

runcom commented Feb 17, 2019

runcom commented Feb 17, 2019 • edited Loading

kikisdeliveryservice commented Feb 18, 2019

runcom commented Feb 18, 2019

kikisdeliveryservice commented Feb 18, 2019

runcom commented Feb 18, 2019

ashcrow commented Feb 18, 2019

openshift-ci-robot commented Feb 18, 2019

runcom commented Feb 18, 2019

runcom commented Feb 17, 2019 •

edited

Loading

runcom Feb 17, 2019 •

edited

Loading

kikisdeliveryservice Feb 18, 2019 •

edited

Loading

runcom commented Feb 17, 2019 •

edited

Loading