Skip to content

read: connection reset by peer at high load #14239

@ashrafguitoni

Description

@ashrafguitoni

What version of Knative?

1.7.1

Note: I was able to reproduce the issue in Knative 1.9.X as well.

Expected Behavior

We have a microservices architecture implemented, where some Knative services call other Knative services. At high load, we expect the system to still perform normally.

Note: some of the services haves two user containers: a python application run via dumb-init and an instance of Nvidia Triton. The python application run via dumb-init does terminate gracefully (based on the container's log messages when a pod is terminated).

Actual Behavior

This error appears (rarely but consistently):

read tcp 127.0.0.1:49906->127.0.0.1:8877: read: connection reset by peer

Note: 8877 is the port of our application container

Steps to Reproduce the Problem

I'm not sure if this will be easily reproducible with bare-bone services, but I can try to create a minimum reproducible example if the error isn't very well known.

Metadata

Metadata

Assignees

No one assigned

    Labels

    kind/bugCategorizes issue or PR as related to a bug.lifecycle/staleDenotes an issue or PR has remained open with no activity and has become stale.triage/needs-user-inputIssues which are waiting on a response from the reporter

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions