Skip to content

Readiness and Liveness Probe Failing under High CPU Load  #4505

@sreedharbukya

Description

@sreedharbukya

In my production cluster, under high CPU resouce utilization readiness and liveness probe is failing constantly.

I have setup ingress-nginx using helm package

ingress-nginx 3 Thu Aug 29 00:32:40 2019 DEPLOYED nginx-ingress-1.15.0 0.25.0 ingress-nginx

This is event logs from ingress-nginx namesapce.

kubectl get events -n  ingress-nginx
LAST SEEN   TYPE      REASON      KIND   MESSAGE
28m         Warning   Unhealthy   Pod    Readiness probe failed: Get http://100.96.166.13:10254/healthz: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
42m         Warning   Unhealthy   Pod    Liveness probe failed: Get http://100.96.166.13:10254/healthz: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
2m46s       Warning   Unhealthy   Pod    Readiness probe failed: Get http://100.96.164.19:10254/healthz: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
14m         Warning   Unhealthy   Pod    Liveness probe failed: Get http://100.96.164.19:10254/healthz: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
24m         Warning   Unhealthy   Pod    Liveness probe failed: HTTP probe failed with statuscode: 500
9m52s       Warning   Unhealthy   Pod    Liveness probe failed: Get http://100.96.166.12:10254/healthz: net/http: request canceled (Client.Timeout exceeded while awaiting headers)
47m         Warning   Unhealthy   Pod    Readiness probe failed: Get http://100.96.166.12:10254/healthz: net/http: request canceled (Client.Timeout exceeded while awaiting headers)

Is this a BUG REPORT or FEATURE REQUEST? (choose one):
BUG REPORT

NGINX Ingress controller version:
0.25.0

Kubernetes version (use kubectl version):
v1.12.8

Environment:

  • Cloud provider or hardware configuration: AWS
  • OS (e.g. from /etc/os-release): Debian GNU/Linux 9 (stretch)
  • Kernel (e.g. uname -a): 4.9.0-9-amd64
  • Install tools:
  • Others:

What happened:
ingress pods keep restarting.

What you expected to happen:

How to reproduce it (as minimally and precisely as possible):

Anything else we need to know:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions