Bug 1769879: add AdditionalTrustBundles to master and worker shims #2639

iamemilio · 2019-11-07T16:40:50Z

If a cluster uses self signed certificates, the master and worker nodes created by cluster-api will be unable to retrieve their ignition configs without trusting the CA. This fix adds user added CAs found in the AdditionalTrustBundle field to the master and worker ignition pointer configs (shims).

openshift-ci-robot · 2019-11-07T16:40:52Z

@iamemilio: This pull request references Bugzilla bug 1769879, which is invalid:

expected the bug to target the "4.3.0" release, but it targets "---" instead

Comment /bugzilla refresh to re-evaluate validity if changes to the Bugzilla bug are made, or edit the title of this pull request to link to a different bug.

Details

In response to this:

Bug 1769879: add AdditionalTrustBundles to master and worker shims

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

openshift-ci-robot · 2019-11-07T16:41:02Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: iamemilio
To complete the pull request process, please assign abhinavdahiya
You can assign the PR to them by writing /assign @abhinavdahiya in a comment when ready.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

abhinavdahiya · 2019-11-07T17:40:00Z

/test e2e-azure

abhinavdahiya · 2019-11-07T17:40:08Z

/test e2e-gcp

abhinavdahiya · 2019-11-07T17:40:16Z

/test e2e-metal

iamemilio · 2019-11-07T18:46:29Z

/test e2e-metal

iamemilio · 2019-11-07T19:50:46Z

/test e2e-azure

sdodson · 2019-11-07T20:13:48Z

/test e2e-metal

openshift-ci-robot · 2019-11-07T21:49:05Z

@iamemilio: The following test failed, say /retest to rerun them all:

Test name	Commit	Details	Rerun command
ci/prow/e2e-aws-scaleup-rhel7	`8d0022b`	link	`/test e2e-aws-scaleup-rhel7`

Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

wking · 2019-11-08T01:56:31Z

Who manages this in-cluster as the configured CAs evolve and we continue to launch machines?

wking · 2019-11-08T01:57:37Z

Can we have the machine-config operator generate these instead?

tomassedovic · 2019-11-08T10:00:29Z

@wking I think the PR description is misleading here.

According to the BZ the issue is with the machine-controller pod being unable to talk to the OpenStack API due to a certificate failure.

Internally-deployed OpenStack clusters often run their API under an effectively self-signed cert (trusted inside the organisation but not publicly). In such cases, the CA needs to be added to the OpenStack VMs via Ignition or any OpenStack API request fails.

These certificates are managed by the OpenStack administrators and they're outside of the OpenShift cert management.

@iamemilio assuming what I wrote above is correct, will you please the PR description? Masters and workers should get their Ignition from the OpenShift cluster -- there should be no certificate issues there. It's about contacting the OpenStack APIs with cluster-api-provider-openstack.

tomassedovic · 2019-11-08T10:01:54Z

/bugzilla refresh

openshift-ci-robot · 2019-11-08T10:01:59Z

@tomassedovic: This pull request references Bugzilla bug 1769879, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker.

Details

In response to this:

/bugzilla refresh

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

abhinavdahiya · 2019-11-08T15:12:16Z

The worker and master pointer configs fetch their configs from the ignition-server running in the cluster, and they don't need the additional trust bundle.

iamemilio · 2019-11-08T15:14:07Z

/test e2e-azure

abhinavdahiya · 2019-11-08T15:15:59Z

Internally-deployed OpenStack clusters often run their API under an effectively self-signed cert (trusted inside the organisation but not publicly).

The current change doesn't fix that.

The additional trust bundle is already delivered to all the machines.

The machine-controller is the one doesn't have the trust. Because the machine's trusted bundle is not used by pods unless they mount in that from host.

iamemilio · 2019-11-08T15:31:18Z

/hold I might have misunderstood what the root problem David was reporting was. Hold until I dig into it further.

jobcespedes · 2019-11-13T20:10:33Z

Facing same error message. I tried to use internal endpoint (insecure) in clouds.yaml. However, the machine controller still tries the public endpoint:

Error listing the instances (machine/actuator.go 472): Get service list err: Get https://<IP>:13774/v2.1/servers/detail?name=dev-mgqts-worker-fhnl7: x509: certificate signed by unknown authority

https://<IP>:13774 is nova (osapi) public endpoint

This is lab in Newton with some modification to make the installer compatible so far. So you might ignore my case.

iamemilio · 2019-11-14T20:54:17Z

Closing, this does not solve the root problem. Pod based services that interact with Open
Stack API need to trust the additionalCAbundle, which needs to be injected via a configmap. Open discussions regarding current and future solutions to this problem are occuring. One possible part of the solution that is being considered for the short term is: #2658.

add AdditionalTrustBundles to master and worker shims

8d0022b

openshift-ci-robot added the bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. label Nov 7, 2019

openshift-ci-robot added the size/S Denotes a PR that changes 10-29 lines, ignoring generated files. label Nov 7, 2019

openshift-ci-robot requested review from jcpowermac and mtnbikenc November 7, 2019 16:41

openshift-ci-robot added the bugzilla/valid-bug Indicates that a referenced Bugzilla bug is valid for the branch this PR is targeting. label Nov 8, 2019

openshift-ci-robot removed the bugzilla/invalid-bug Indicates that a referenced Bugzilla bug is invalid for the branch this PR is targeting. label Nov 8, 2019

openshift-ci-robot added the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 8, 2019

iamemilio closed this Nov 14, 2019

iamemilio deleted the workers_certs branch November 14, 2019 20:54

Bug 1769879: add AdditionalTrustBundles to master and worker shims #2639

Bug 1769879: add AdditionalTrustBundles to master and worker shims #2639

Uh oh!

Conversation

iamemilio commented Nov 7, 2019

Uh oh!

openshift-ci-robot commented Nov 7, 2019

Uh oh!

openshift-ci-robot commented Nov 7, 2019

Uh oh!

abhinavdahiya commented Nov 7, 2019

Uh oh!

abhinavdahiya commented Nov 7, 2019

Uh oh!

abhinavdahiya commented Nov 7, 2019

Uh oh!

iamemilio commented Nov 7, 2019

Uh oh!

iamemilio commented Nov 7, 2019

Uh oh!

sdodson commented Nov 7, 2019

Uh oh!

openshift-ci-robot commented Nov 7, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wking commented Nov 8, 2019

Uh oh!

wking commented Nov 8, 2019

Uh oh!

tomassedovic commented Nov 8, 2019

Uh oh!

tomassedovic commented Nov 8, 2019

Uh oh!

openshift-ci-robot commented Nov 8, 2019

Uh oh!

abhinavdahiya commented Nov 8, 2019

Uh oh!

iamemilio commented Nov 8, 2019

Uh oh!

abhinavdahiya commented Nov 8, 2019

Uh oh!

iamemilio commented Nov 8, 2019

Uh oh!

jobcespedes commented Nov 13, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

iamemilio commented Nov 14, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

openshift-ci-robot commented Nov 7, 2019 •

edited

Loading

jobcespedes commented Nov 13, 2019 •

edited

Loading

iamemilio commented Nov 14, 2019 •

edited

Loading