Use docker log rotation mechanism instead of logrotate #40634

crassirostris · 2017-01-27T22:57:04Z

This is a solution for #38495.

Instead of rotating logs using logrotate tool, which is configured quite rigidly, this PR makes docker responsible for the rotation and makes it possible to configure docker logging parameters. It solves the following problems:

Logging agent will stop loosing lines upon rotation
Container's logs size will be more strictly constrained. Instead of checking the size hourly, size will be checked upon write, preventing Log rotating not enabled - master killed logs filled up #27754

It's still far from ideal, for example setting logging options per pod, as suggested in #15478 would be much more flexible, but latter approach requires deep changes, including changes in API, which may be in vain because of CRI and long-term vision for logging.

Changes include:

Change in salt. It's possible to configure docker log parameters, using variables in pillar. They're exported from env variables on gce, but for different cloud provider they have to be exported first.
Change in configure-helper.sh scripts for those os on gce that don't use salt + default values exposed via env variables

This change may be problematic for kubelet logs functionality with CRI enabled, that will be tackled in the follow-up PR, if confirmed.

CC @piosz @Random-Liu @yujuhong @dashpole @dchen1107 @vishh @kubernetes/sig-node-pr-reviews

On GCI by default logrotate is disabled for application containers in favor of rotation mechanism provided by docker logging driver.

k8s-reviewable · 2017-01-27T22:58:10Z

This change is

vishh · 2017-01-28T00:36:11Z

@dashpole can you do an initial review?

dashpole · 2017-01-28T00:43:16Z

Sure

dashpole · 2017-01-28T00:54:38Z

Do we need to do anything for providers other than GCE?

crassirostris · 2017-01-28T01:02:54Z

@dashpole If provider uses salt, it will pick up this change automatically

Otherwise this change doesn't influence provider, but its owners can make mirroring changes. As for ability to configure parameters using environment of kube-up.sh for different providers, as I've mentioned in the PR text, it requires copying several variables to pillar, which can also be done by provider's owners.

dashpole · 2017-01-30T17:33:10Z

Just to confirm, this only disables logrotate for docker, right?
One thing we may want to discuss are the defaults. The default in this PR is 5 rotations of 10mb each.
@crassirostris were you able to re-run your experiment to see if the missing log lines problem was fixed?
Code changes lgtm

yujuhong · 2017-01-30T17:43:02Z

This PR changes all OS images in GCE clusters to use docker's native log rotation. Could we instead change only the configuration relevant to GKE (i.e., GCI)?

Also ping @dchen1107, who wanted to take a look.

crassirostris · 2017-02-21T18:31:19Z

@mikedanese It needs your approval

piosz · 2017-02-22T11:04:14Z

@mikedanese @roberthbailey we need this for 1.6

roberthbailey · 2017-02-23T06:10:24Z

i approve conditional on @dchen1107 giving this an lgtm

piosz · 2017-02-23T06:24:42Z

@roberthbailey thanks!

dchen1107 · 2017-02-23T17:52:34Z

Sorry for the late response. Thought we already agreed upon the scope (limited to GCI image only) and the solution through an offline discussions.

/lgtm and
/approve

k8s-github-robot · 2017-02-23T17:59:14Z

[APPROVALNOTIFIER] This PR is APPROVED

The following people have approved this PR: Crassirostris, dchen1107

Needs approval from an approver in each of these OWNERS Files:

~~cluster/OWNERS~~ [dchen1107]
~~cluster/gce/OWNERS~~ [dchen1107]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

crassirostris · 2017-02-24T15:12:09Z

Applying lgtm from Dawn's comment

crassirostris · 2017-02-24T15:51:20Z

@k8s-bot kubemark e2e test this

crassirostris · 2017-02-24T15:53:53Z

@k8s-bot kops aws e2e test this

crassirostris · 2017-02-24T16:29:02Z

@k8s-bot kubemark e2e test this kubernetes/test-infra#2012

crassirostris · 2017-02-24T17:47:43Z

@k8s-bot cvm gce e2e test this
@k8s-bot kubemark e2e test this

k8s-github-robot · 2017-02-27T04:39:33Z

Automatic merge from submit-queue

kfox1111 · 2017-03-10T01:59:26Z

I'm guessing this is probably a rare edge case, but may possibly be hit if the log server being shipped to is down for a while. What happens if the log shipper was restarted before all the old data was shipped and after the symlinks were updated? The shipper would no longer know about the old file and loose data?

crassirostris · 2017-03-10T02:28:44Z

@kfox1111 Yes, that situation is possible, you have to be ready.

Moreover, if log shipper for some reason doesn't keep a track of a log file for some time and rotation happens twice, some potion of the logs is lost too. That doesn't depend on the way log files are rotated, or the way logs are written, it still may happen with journald or logrotate.

krmayankk · 2017-05-09T20:20:14Z

@crassirostris is this fix only for GCE based k8s deployments or for bare metal as well ?

dashpole · 2017-05-09T20:23:40Z

Its only for GCE based deployments

piosz · 2017-05-11T13:40:30Z

Due to some objections that I don't remember (@crassirostris can explain) we introduced this change only for GCE using GCI/COS. You can you similar approach in your deployment.

crassirostris · 2017-05-11T13:47:24Z

B/c it would be too disturbing otherwise, with possible long-lasting implications in the setups we don't control and don't test

alexbrand · 2017-05-23T17:43:16Z

Hey @crassirostris,

The logging documentation here says:

An important consideration in node-level logging is implementing log rotation, so that logs don’t consume all available storage on the node. Kubernetes uses the logrotate tool to implement log rotation.
Kubernetes performs log rotation daily, or if the log file grows beyond 10MB in size. Each rotation belongs to a single container; if the container repeatedly fails or the pod is evicted, all previous rotations for the container are lost. By default, Kubernetes keeps up to five logging rotations per container.

Are these docs correct? It seems like rotation is not happening on a cluster I setup outside of GCE. I am wondering if those docs are just out of date, or if I am misunderstanding something.

crassirostris · 2017-05-23T18:38:18Z

@alexbrand The documentation is little bit obsolete for the COS image on GCP, otherwise (e.g. debian on GCP or ubuntu on AWS) it's actually true. BUT it only applies to clusters brought up by kube-up.sh script, I think that's your problem. Generally, kubernetes per se doesn't handle log rotation (as for now, that's yet to be discussed), you have to set it up independently in your installation, e.g. by configuring logrotate script in crone, as kube-up.sh does it by default or by configuring the parameters of Docker the way it's done in cos on GCP now. I hope that answers your question.

Sorry for that misunderstanding, I'll patch the documentation. Thanks a lot for pointing that out!

alexbrand · 2017-05-23T18:40:46Z

@crassirostris Got it! That is what I kinda assumed, but wanted to make sure. Cheers!

crassirostris added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Jan 27, 2017

crassirostris added this to the v1.6 milestone Jan 27, 2017

k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Jan 27, 2017

k8s-github-robot assigned roberthbailey Jan 27, 2017

k8s-github-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. release-note Denotes a PR that will be considered when it comes time to generate release notes. labels Jan 27, 2017

vishh assigned dashpole and unassigned roberthbailey Jan 28, 2017

yujuhong requested a review from dchen1107 January 28, 2017 01:35

k8s-github-robot assigned roberthbailey and zmerlynn and unassigned dashpole and roberthbailey Jan 30, 2017

zmerlynn assigned dashpole and roberthbailey and unassigned zmerlynn and roberthbailey Jan 30, 2017

k8s-github-robot assigned mikedanese and eparis and unassigned dashpole and mikedanese Jan 30, 2017

apelisse assigned dashpole Jan 31, 2017

piosz assigned dchen1107 Feb 23, 2017

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 23, 2017

crassirostris added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 24, 2017

k8s-github-robot merged commit b18bad1 into kubernetes:master Feb 27, 2017

faraazkhan mentioned this pull request Mar 10, 2017

Use built in log rotation capabilities of the docker daemon kubernetes/kops#2095

Merged

crassirostris mentioned this pull request Apr 5, 2017

Create guidelines for consuming application logs #42718

Closed

chancez mentioned this pull request Apr 11, 2017

Enable customization of Docker log rotation coreos/tectonic-installer#221

Closed

crassirostris mentioned this pull request May 26, 2017

Fix log rotation description in the logging doc kubernetes/website#3918

Merged

blakebarnett mentioned this pull request Sep 22, 2017

Log rotation for containers running on master nodes kubernetes/kops#3410

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use docker log rotation mechanism instead of logrotate #40634

Use docker log rotation mechanism instead of logrotate #40634

crassirostris commented Jan 27, 2017 •

edited

k8s-reviewable commented Jan 27, 2017

vishh commented Jan 28, 2017

dashpole commented Jan 28, 2017

dashpole commented Jan 28, 2017

crassirostris commented Jan 28, 2017

dashpole commented Jan 30, 2017

yujuhong commented Jan 30, 2017

crassirostris commented Feb 21, 2017

piosz commented Feb 22, 2017

roberthbailey commented Feb 23, 2017 •

edited

piosz commented Feb 23, 2017

dchen1107 commented Feb 23, 2017

k8s-github-robot commented Feb 23, 2017

crassirostris commented Feb 24, 2017

crassirostris commented Feb 24, 2017

crassirostris commented Feb 24, 2017

crassirostris commented Feb 24, 2017 •

edited by fejta

crassirostris commented Feb 24, 2017

k8s-github-robot commented Feb 27, 2017

kfox1111 commented Mar 10, 2017

crassirostris commented Mar 10, 2017

krmayankk commented May 9, 2017

dashpole commented May 9, 2017

piosz commented May 11, 2017

crassirostris commented May 11, 2017

alexbrand commented May 23, 2017

crassirostris commented May 23, 2017

alexbrand commented May 23, 2017

Use docker log rotation mechanism instead of logrotate #40634

Use docker log rotation mechanism instead of logrotate #40634

Conversation

crassirostris commented Jan 27, 2017 • edited

k8s-reviewable commented Jan 27, 2017

vishh commented Jan 28, 2017

dashpole commented Jan 28, 2017

dashpole commented Jan 28, 2017

crassirostris commented Jan 28, 2017

dashpole commented Jan 30, 2017

yujuhong commented Jan 30, 2017

crassirostris commented Feb 21, 2017

piosz commented Feb 22, 2017

roberthbailey commented Feb 23, 2017 • edited

piosz commented Feb 23, 2017

dchen1107 commented Feb 23, 2017

k8s-github-robot commented Feb 23, 2017

crassirostris commented Feb 24, 2017

crassirostris commented Feb 24, 2017

crassirostris commented Feb 24, 2017

crassirostris commented Feb 24, 2017 • edited by fejta

crassirostris commented Feb 24, 2017

k8s-github-robot commented Feb 27, 2017

kfox1111 commented Mar 10, 2017

crassirostris commented Mar 10, 2017

krmayankk commented May 9, 2017

dashpole commented May 9, 2017

piosz commented May 11, 2017

crassirostris commented May 11, 2017

alexbrand commented May 23, 2017

crassirostris commented May 23, 2017

alexbrand commented May 23, 2017

crassirostris commented Jan 27, 2017 •

edited

roberthbailey commented Feb 23, 2017 •

edited

crassirostris commented Feb 24, 2017 •

edited by fejta