Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rotate pgAdmin/gunicorn logs according to retentionPeriod #4101

Merged
merged 9 commits into from
Mar 7, 2025

Conversation

benjaminjb
Copy link
Contributor

Checklist:

  • Have you added an explanation of what your changes do and why you'd like them to be included?
  • Have you updated or added documentation for the change, as applicable?
  • Have you tested your changes on all related environments with successful results, as applicable?
    • Have you added automated tests?

Type of Changes:

  • New feature
  • Bug fix
  • Documentation
  • Testing enhancement
  • Other

What is the current behavior (link to any open issues here)?

pgAdmin/Gunicorn log rotation is hard-coded

What is the new behavior (if this is a feature change)?

  • Breaking change (fix or feature that would cause existing functionality to change)

pgAdmin has a built in logfile rotation system, as does gunicorn. We want to use those systems, but respect the retention setting that a user sets in the instrumentation section. This PR uses the RetentionPeriod to

  • set the number of backups for gunicorn and pgAdmin;
  • set the rotation cycle for pgAdmin.

Other Information:

Issues: [PGO-2168]

Copy link
Contributor

@dsessler7 dsessler7 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Comment on lines 381 to 388
)
var (
maxBackupRetentionNumber = "1"
// One day in minutes for pgadmin rotation
pgAdminRetentionPeriod = "24 * 60"
// Daily rotation for gunicorn rotation
gunicornRetentionPeriod = "D"
)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thinking out loud: So it seems like we are defaulting to retaining a day's worth of logs whether we have the OpenTelemetryLogs feature enabled or not? Which looks like it was already the case for pgAdmin, but for gunicorn the behavior is changing from retaining based on size to time?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤔 Yeah -- and actually if we go back far enough, we weren't actually making any changes to the default logging until we added OTel. Which makes me think we should maybe only add these logging configs if OTel is enabled...

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mmhmm

Copy link
Contributor Author

@benjaminjb benjaminjb Mar 3, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK, we made some changes about where we're logging, so I couldn't put all the logging stuff in as optional, but made some changes to get us closer to that original state if you're not using the new telemetry.

Issues: [PGO-2168]
@benjaminjb benjaminjb force-pushed the benjb/pgadmin-logrotate branch from 2d85f6d to 1d8abd3 Compare March 2, 2025 03:40
Comment on lines 454 to 455
logconfig_dict['loggers']['gunicorn.access']['handlers'] = ['file']
logconfig_dict['loggers']['gunicorn.error']['handlers'] = ['file']
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🤔 What are these handlers before we overwrite them?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We start with the default settings from here, so

    "loggers": {
        "gunicorn.error": {
            "level": "INFO",
            "handlers": ["error_console"],
            "propagate": True,
            "qualname": "gunicorn.error"
        },

        "gunicorn.access": {
            "level": "INFO",
            "handlers": ["console"],
            "propagate": True,
            "qualname": "gunicorn.access"
        }
    },

So we set all our logger.*.handlers to file, and then below we set the handlers.file, so we skip using the default handlers (right below in the default settings):

    "handlers": {
        "console": {
            "class": "logging.StreamHandler",
            "formatter": "generic",
            "stream": "ext://sys.stdout"
        },
        "error_console": {
            "class": "logging.StreamHandler",
            "formatter": "generic",
            "stream": "ext://sys.stderr"
        },
    },

Copy link
Member

@cbandy cbandy left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for making those changes. One question, but not a blocker.

Comment on lines 272 to 273
" --reload-extra-file " + configMountPath + `/` + gunicornLogConfFilePath +
" --log-config-json " + configMountPath + `/` + gunicornLogConfFilePath +
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍🏻 Nice! We can do a lot with config files.

❓ My read of reload_extra_files is that it requires reload to be enabled. Do we use that?

❓ I see reload_extra_files and logconfig_json mentioned in the config file settings. Is it possible to set these through settings in generateGunicornConfig()? 🤔 If so, we might end up with all JSON and no change to the entrypoint?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

See benoitc/gunicorn#3353 for an issue I ran into.

Not in that GH issue, but as a precursor to it: all the reload stuff is expressly for development. So I was thinking of adding a SIGHUP signal into our mtime checking loop, only SIGHUP doesn't work -- but SIGKILL might if we want to do that every time a cluster changes or the config changes. (The mtime for all those files change together since they are in the same configmap.)

@benjaminjb benjaminjb merged commit c5665b6 into main Mar 7, 2025
19 checks passed
@benjaminjb benjaminjb deleted the benjb/pgadmin-logrotate branch March 7, 2025 00:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants