-
Notifications
You must be signed in to change notification settings - Fork 4.8k
OCPBUGS-61976: fix(azure-metrics): add retry logic with exponential backoff for loadbalancer lookup #30278
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
Skipping CI for Draft Pull Request. |
|
/test unit |
|
/test verify |
|
/test e2e-azure |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/a65e5280-955a-11f0-9c55-15d55240b00c-0 |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/a9663cae-955a-11f0-85a6-3df681f70f55-0 |
|
/payload-job periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-azure-aks-ovn-conformance |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/ae7da330-955a-11f0-8893-f3540b276085-0 |
|
/test e2e-azure |
|
/payload-job periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-azure-aks-ovn-conformance |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/649155e0-978b-11f0-8b53-b976e8dd9b68-0 |
|
/payload-job periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-azure-aks-ovn-conformance |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/0a2b4970-978c-11f0-9d3c-513df797b711-0 |
|
/payload periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-aks |
|
/payload-job periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-aks |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/cb48dc40-9790-11f0-8d01-b886ad32595f-0 |
|
/unassign @deads2k |
|
/assign @bryan-cox |
|
/assign @sjenning |
|
/unassign @p0lyn0mial |
|
/uncc @p0lyn0mial |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/79d17600-9863-11f0-9f22-6a1bc45aa370-0 |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/7f18ee90-9863-11f0-9dd3-db03123ef720-0 |
|
/payload-job periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-aks |
|
/payload-job periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-aks |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/8a399860-9863-11f0-8e5f-3ff9deaf6ded-0 |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/8dbbe150-9863-11f0-9145-34b833cb87f9-0 |
|
There are periodics failing because of this: openshift/hypershift#6872 |
|
/lgtm |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: jparrill, sjenning The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
/payload-job periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-aks |
|
/payload-job periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-azure-aks-ovn-conformance |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/a8d7b8a0-9918-11f0-9a5c-dc8c920a72ee-0 |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/ad0c88b0-9918-11f0-81a9-15908416e332-0 |
|
/payload-job periodic-ci-openshift-hypershift-release-4.21-periodics-e2e-azure-aks-ovn-conformance |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/b1e95890-9918-11f0-97b9-0f33933e851b-0 |
|
/payload-job periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-azure-aks-ovn-conformance |
|
@jparrill: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command
See details on https://pr-payload-tests.ci.openshift.org/runs/ci/b539e320-9918-11f0-97c9-0cbc1f5822e1-0 |
|
/retest |
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
Job Failure Risk Analysis for sha: c76c042
|
|
@jparrill: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
Job Failure Risk Analysis for sha: c76c042
|
What this PR does / why we need it
Azure AKS conformance tests were failing because the azure-metrics-collector
was attempting to access load balancers before they were fully created,
resulting in 404 ResourceNotFound errors.
This change adds retry logic with exponential backoff to the getLoadBalancerID
function:
The fix ensures the metrics collector waits for load balancer creation
instead of failing immediately on timing issues.
Which issue(s) this PR fixes
Additional Info
periodic-ci-openshift-hypershift-release-4.20-periodics-e2e-azure-aks-ovn-conformanceperiodic-ci-openshift-hypershift-release-4.21-periodics-e2e-azure-aks-ovn-conformanceperiodic-ci-openshift-hypershift-release-4.20-periodics-e2e-aksperiodic-ci-openshift-hypershift-release-4.21-periodics-e2e-aks