Improved platform tests #1291

soltysh · 2022-11-28T14:52:49Z

/assign @deads2k

enhancements/testing/improved-platform-tests.md

bparees · 2022-12-01T18:05:27Z

enhancements/testing/improved-platform-tests.md

+The major issue with splitting the test binaries to be built from two separate
+repositories is that developers wishing to run kubernetes tests will have to
+build the binary manually, from a separate repository and ensure the binary
+is available during test runs.


or just pull down the latest tests image.

developers wishing to run kubernetes tests will have to
build the binary manually, from a separate repository

@bparees it's not clear to me how it affects the suite kubernetes/conformance currently included in the openshift-tests binary when running:

openshift-tests run kubernetes/conformance [args...]

Will the suite kubernetes/conformance keeps available without additional steps or we'll need to run it from the upstream way (building/downloading the binary [, using sonobuoy's embedded k8s suites], etc) ?

really a question for @soltysh at this point. The current implementation includes the k8s tests in the openshift-tests image and in a separate k8s image, so you can run them from either location (i.e. just having openshift-tests is sufficient to run the k8s tests, though the version of the k8s tests may vary between the two images).

But that is currently done by vendoring the k8s tests into origin, which ultimately we want to stop doing. So i think the end solution looks more like:

separate test images for different tests/suites/components/areas
an aggregated image (openshift-tests) that is built by pulling all those separate images together

and only the aggregated image ships in the payload, and it's all that is needed to run all the tests. That would maintain the current end user experience.

but that is not what has been built/delivered so far, out of a need for expediency.

@mtulio there are 2 possible ways with the current approach:

run openshift-tests as you did before, which will try to pull the appropriate k8s-tests binary from the release image, and it will use that binary for the test execution;

use OPENSHIFT_SKIP_EXTERNAL_TESTS=true env variable, when invoking openshift-tests which will use the embedded version of k8s conformance tests;

We are guaranteeing that the latter approach will also work for cases when pulling the binary from release is not possible for various reasons. When it comes to k8s conformance (which don't change between minor versions) that shouldn't be a problem for you.

Have a look at Risks and Mitigations section, if you don't find answers there, lemme know I'll gladly add whatever details are required.

enhancements/testing/improved-platform-tests.md

dhellmann · 2022-12-14T15:53:29Z

@elmiko @rvanderp3 @mtulio @bostrt @lobziik @julienlim FYI, since this will affect the VCSP work and certification tool. You'll want to work with @soltysh to make sure someone from your team is on the reviewer list for this enhancement.

enhancements/testing/improved-platform-tests.md

mtulio · 2022-12-14T20:24:30Z

enhancements/testing/improved-platform-tests.md

+### Non-Goals
+
+1. Abandon existing tests.
+2. Change the available functionality of `openshift-tests`.


IIUC it is not a goal to change the current content of openshift-tests suites, mainly the conformance ones (openshift/conformance and kubernetes/conformance), is that correct?

My concern is the e2e tests from kubernetes/K8s suite are in openshift/conformance, this suite is a base suite used to validate external clusters by partners on VCSP program - and we understand that the e2e tests in this suite cover all needed to validate an OCP installation.

$ ./openshift-tests run openshift/conformance --dry-run |grep -F "$(./openshift-tests run kubernetes/conformance --dry-run |tail -n1)" "[sig-storage] Subpath Atomic writer volumes should support subpaths with secret pod [Excluded:WindowsDocker] [Conformance] [Suite:openshift/conformance/parallel/minimal] [Suite:k8s]" $ ./openshift-tests run openshift/conformance --dry-run |grep -F "$(./openshift-tests run kubernetes/conformance --dry-run |head -n1)" "[sig-api-machinery] AdmissionWebhook [Privileged:ClusterAdmin] listing mutating webhooks should work [Conformance] [Suite:openshift/conformance/parallel/minimal] [Suite:k8s]" $ ./openshift-tests run openshift/conformance --dry-run |grep -F "$(./openshift-tests run kubernetes/conformance --dry-run |shuf |head -n1)" "[sig-api-machinery] Watchers should observe an object deletion if it stops meeting the requirements of the selector [Conformance] [Suite:openshift/conformance/parallel/minimal] [Suite:k8s]"

everything you said is true, i'm not clear what the concern is though?

we can meet the goal of having k8s tests run as part of the openshift/conformance suite w/o including the k8s tests in the openshift-tests binary.

we can meet the goal of having k8s tests run as part of the openshift/conformance suite w/o including the k8s tests in the openshift-tests binary.

That's what I would like to make sure I got correctly to track changes on the VCSP program/OPCT.

Overall the two steps in this proposal will bring a huge benefit.

@mtulio correct, we will ensure the current workloads continue working, at least wrt k8s conformance tests.

elmiko · 2022-12-14T20:31:11Z

adding myself to cc list for this pr, i have not had a chance to fully digest it yet though.

enhancements/testing/improved-platform-tests.md

elmiko

this makes sense to me and generally seems like a good idea.

openshift-bot · 2023-01-14T01:15:23Z

Inactive enhancement proposals go stale after 28d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle stale.
Stale proposals rot after an additional 7d of inactivity and eventually close.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle stale

bertinatto · 2023-01-17T11:54:01Z

/remove-lifecycle stale

openshift-bot · 2023-01-25T00:45:12Z

Stale enhancement proposals rot after 7d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Rotten proposals close after an additional 7d of inactivity.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

openshift-bot · 2023-02-01T08:15:54Z

Rotten enhancement proposals close after 7d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Reopen the proposal by commenting /reopen.
Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Exclude this proposal from closing again by commenting /lifecycle frozen.

/close

openshift-ci · 2023-02-01T08:16:06Z

@openshift-bot: Closed this PR.

Details

In response to this:

Rotten enhancement proposals close after 7d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Reopen the proposal by commenting /reopen.
Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Exclude this proposal from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

enhancements/testing/improved-platform-tests.md

bparees · 2023-06-20T17:39:57Z

enhancements/testing/improved-platform-tests.md

+2. Remove the requirement that all test code lives in a single repository.
+3. Maintain `openshift-tests` API for running tests.
+4. Ensure that kubernetes tests are still vendored in `openshift-tests` binary,
+   to allow for a fallback when the release payload image is not reachable.


I see this one as temporary....once we start having all teams moving their e2e tests into their own repos, we're just going to have to accept that the test images need to be available or the test job fails.

and we're not going to want to maintain this vendoring indefinitely just to give us a fallback (not to mention that falling back is risky since now you're not necessarily running the tests you thought you were)

For some cases the fallback will be a must, microshift comes to my mind immediately, since they don't provide release image. I'm not sure about hypershift, either. So at least for k8s-related tests I think that might be longer than just temporary. We'll need to re-assess this at a later point in time. That's why I've explicitly called out k8s tests only for backwards compatibility.

I've added this to "Even more future work" section, there's still work about upgrade tests, so this is a good future thing to look at .

if nothing else, fallback needs to be an opt-in behavior (do not fallback to another version of the test code silently or by default).

but this is another reason why producing a single image that contains all the test binaries would be a better solution in my mind.

cc @dhellmann since he can better speak to what microshift does today or would want to support in the future.

Yeah, that explicit fallback is being tracked in openshift/origin#28000 which will eventually allow us to drop the fallback we currently have.

enhancements/testing/improved-platform-tests.md

bparees · 2023-06-20T17:53:42Z

enhancements/testing/improved-platform-tests.md

+
+This proposal might lead to unnecessary proliferation of external tests binaries,
+which might cause problems when the release image is not available, or for local
+development.


hard to imagine a situation where the release image is not available to someone who's trying to run tests

What about microshift?

i'll have to defer to @dhellmann on how they manage this today (where they get the openshift-tests binary from, how they distribute it to customers) on the implications there, but in my mind it's another argument for why we're better off having a single test image that contains all the test binaries. Microshift can then consume/reference that image.

There is no release payload for MicroShift. It looks like the CI job that uses openshift-tests pulls it from an image built as part of the CI system? https://github.com/openshift/release/blob/master/ci-operator/config/openshift/microshift/openshift-microshift-main.yaml#L47

@pacevedom, @pmtk , or @copejon can any of you explain where that binary comes from?

We do not ship the openshift-tests binary as part of MicroShift.

Following the breadcrumbs it's: inputs.test-bin which is

test-bin: name: "4.14" namespace: ocp tag: tests

Which is imported: Tagging ocp/4.14:tests into pipeline:test-bin.
I think it's this image config

tl;dr: we just take it from a promoted image

the workflow where someone is trying to test outside of our CI system and has no release payload?

if they aren't having to modify the tests themselves, they could still use the test binaries from a recent payload image(since @soltysh's current proposal is that there would be multiple test images, one per binary, i defer to him on exactly how a consumer/client identifies all the images needed, pulls them, and extracts all the binaries, but that's something that CI will need to be doing too and i expect it will be relatively straightforward and handled by the openshift-tests binary itself)

if they are modifying the tests locally, then i'd expect them to be able to:

build/get a copy of the openshift-tests wrapper binary

invoke it while pointing it at their local externalized test binary for their component

(it would only run the tests that are built into openshift-tests plus the tests in their binary, but usually when people are doing this they are already narrowing the set of tests down to a few specific ones anyway).

Do those two options cover your use cases?

Number 2 is covered in risks sections, where I'm talking about possibility to use locally available binaries instead of pulling them from release. It's not implemented, but it's rather simple. I'll add it to future work section.

The only downside is you need to know which binaries you care about, but in case of microshift it's rather simple b/c you'll most likely care about k8s mostly.

Off the top of my head, MicroShift would need tests for Routes, SCCs, and oc, as well. Our current goal is to make the entire suite "work" (pass or skip), so we can add a job to ensure new tests also work on MicroShift (pass or skip). When the suite is split up, we could avoid doing that for the entire suite and focus on the parts that are most important.

If there's going to be a way to use local binaries for the tests, we should be able to come up with a way to get the "right" binaries, so I think my concerns would be covered by adding a bit of detail to the future work section as you suggest.

Based on Fabio's comment we'll stick with in-binary k8s tests for now, and we'll slowly work our way out in the future, remembering about cases like CSI and Microshift.

soltysh · 2023-06-26T14:17:20Z

just wanted to share that i've been working with upstream sig cloud provider in attempts to create more generic tests for CCMs, you can see the ideas we've been discussing here https://hackmd.io/@elmiko/BJGn1SQU3

@elmiko thx for that, but I think k8s is in better situation, since they could just export the framework for easier consumption, and the fact that they are directly invoking ginkgo they don't require the same sophistication when splitting the tests into separate binaries, since you can easily treat each one as a separate unit (they can be combined, but I'd probably discourage that).

Our case is a bit different in that we have the set of monitors which are probing the cluster throughout the entire test run, and only then we execute single tests. Upstream k8s doesn't have that functionality, which makes their tests' execution a bit simpler 😉

enhancements/testing/improved-platform-tests.md

bertinatto · 2023-06-29T16:20:50Z

enhancements/testing/improved-platform-tests.md

+   and minimize time when we start running newest kubernetes tests.
+2. Remove the requirement that all test code lives in a single repository.
+3. Maintain `openshift-tests` API for running tests.
+4. Ensure that kubernetes tests are still vendored in `openshift-tests` binary,


IMO this is a must for now, but I'm not sure if it's worth to keep this goal in the long-term. The fallback argument is important during the first stages, but eventually it's all about maintainability winning the game.

enhancements/testing/improved-platform-tests.md

bertinatto

LGTM.

I went through this enhancement and except for nits, I don't have anything to add.

I propose we merge this and work out the remaining details in separate changes.

deads2k · 2023-07-06T20:10:03Z

enhancements/testing/improved-platform-tests.md

+the kubernetes tests vendored since they don't provide release image from which
+we can extract the kubernetes test binaries.
+
+Re-evaluate [alternative approach](#build-time-assembly---aggregating-binaries)


It's probably worth starting this conversation with testplatform in 4.15 to understand what capabilities our CI build system provides and how difficult this actually is long term. I'd really like to see us not have vendored tests. That will discourage separation beyond a few well known cases.

deads2k · 2023-07-06T20:11:21Z

We have a significant improvement already merged. I'm ok landing this as a description, but we need to avoid losing sight of future improvements since we're still stuck vendoring at the moment.

deads2k · 2023-07-06T20:11:26Z

/approve

openshift-ci · 2023-07-06T20:11:50Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details

Needs approval from an approver in each of these files:

~~OWNERS~~ [deads2k]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

bertinatto

/lgtm

openshift-ci · 2023-07-07T14:24:15Z

@soltysh: all tests passed!

Full PR test history. Your PR dashboard.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

pierreprinetti · 2023-07-07T09:47:09Z

enhancements/testing/improved-platform-tests.md

+To achieve the first step of the proposal, we need to:
+1. Provide a library for building `<binary>-tests` which exposes two commands:
+   - `list` - responsible for listing tests in JSON format;
+   - `run` - run a single test, returning results in ginkgo compatible format;


This could be run-test, consistently with the main openshift-tests command. Having a compatible API potentially enables interesting future evolutions. For example, external tests may eventually grow some independence from openshift-tests for debugging purposes, and implement other bits of openshift-tests' API.

That's a good catch, thank you. The actual implementation is using run-test, see https://github.com/openshift/kubernetes/blob/0e0d15b865ffc36177dc8770b4723dc14476a630/openshift-hack/cmd/k8s-tests/k8s-tests.go#L64 Fix in #1547

openshift-ci bot assigned deads2k Nov 28, 2022

openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Nov 28, 2022

openshift-ci bot requested review from derekwaynecarr and zaneb November 28, 2022 14:54

deads2k reviewed Nov 28, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

dgoodwin reviewed Nov 28, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

bparees reviewed Dec 1, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Show resolved Hide resolved

bertinatto reviewed Dec 2, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

jsafrane reviewed Dec 9, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

dhellmann reviewed Dec 14, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

mtulio reviewed Dec 14, 2022

View reviewed changes

mtulio reviewed Dec 15, 2022

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

elmiko reviewed Dec 16, 2022

View reviewed changes

openshift-ci bot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 14, 2023

openshift-ci bot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jan 25, 2023

openshift-ci bot closed this Feb 1, 2023

This was referenced Mar 31, 2023

Running tests using external binary openshift/origin#27570

Merged

Add wrapper which will allow running o/k tests as external binary in origin openshift/kubernetes#1485

Merged

soltysh reopened this Apr 25, 2023

soltysh force-pushed the test_improvements branch 2 times, most recently from bb05711 to 1ae56a3 Compare April 25, 2023 13:54

soltysh force-pushed the test_improvements branch from 239f7ca to 49b77b4 Compare June 20, 2023 14:20

soltysh changed the title ~~[WIP] Initial take for improved platform tests~~ Improved platform tests Jun 20, 2023

openshift-ci bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jun 20, 2023

bparees reviewed Jun 20, 2023

View reviewed changes

soltysh added 2 commits June 26, 2023 15:55

Fix risks and mitigrations formating

d3f982d

Address comments from bparees and mtulio

44017a4

bparees reviewed Jun 26, 2023

View reviewed changes

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

enhancements/testing/improved-platform-tests.md Outdated Show resolved Hide resolved

enhancements/testing/improved-platform-tests.md Show resolved Hide resolved

soltysh added 2 commits June 29, 2023 13:35

Address additional comments

c9c599c

Add aggregating binaries alternative as a future consideration

6fbe0ba

bparees reviewed Jun 29, 2023

View reviewed changes

bertinatto reviewed Jun 29, 2023

View reviewed changes

ingvagabund reviewed Jun 30, 2023

View reviewed changes

enhancements/testing/improved-platform-tests.md Show resolved Hide resolved

enhancements/testing/improved-platform-tests.md Show resolved Hide resolved

bertinatto reviewed Jun 30, 2023

View reviewed changes

soltysh added the tide/merge-method-squash Denotes a PR that should be squashed by tide when it merges. label Jun 30, 2023

soltysh added 2 commits June 30, 2023 14:54

More comments addressed

5906924

More comments addressed

c825fb2

deads2k reviewed Jul 6, 2023

View reviewed changes

openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jul 6, 2023

bertinatto reviewed Jul 7, 2023

View reviewed changes

openshift-ci bot assigned bertinatto Jul 7, 2023

openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Jul 7, 2023

openshift-merge-robot merged commit c77298b into openshift:master Jul 7, 2023

soltysh deleted the test_improvements branch July 7, 2023 14:40

pierreprinetti reviewed Jul 7, 2023

View reviewed changes

soltysh mentioned this pull request Jan 24, 2024

tests: update command name to reflect reality #1547

Merged

Improved platform tests #1291

Improved platform tests #1291

Uh oh!

Conversation

soltysh commented Nov 28, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

dhellmann commented Dec 14, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

elmiko commented Dec 14, 2022

Uh oh!

Uh oh!

elmiko left a comment

Choose a reason for hiding this comment

Uh oh!

openshift-bot commented Jan 14, 2023

Uh oh!

bertinatto commented Jan 17, 2023

Uh oh!

openshift-bot commented Jan 25, 2023

Uh oh!

openshift-bot commented Feb 1, 2023

Uh oh!

openshift-ci bot commented Feb 1, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!