Skip to content
This repository has been archived by the owner on Jan 19, 2023. It is now read-only.

pypi/cdn-log-archiver

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyPI CDN Log Archiver

  1. Get raw logs from S3 like object store, written by the cdn
  2. Archive and compress them
  3. Upload archive back to S3 like object store, in a sane structure
  4. Remove raw logs that were archived

Usage

You'll need to export some env variables to get started:

$ export ACCESS_KEY=<your_access_key>
$ export SECRET_KEY=<your_access_key>
$ export PYPI_LOG_BUCKET=<bucket_name>
$ export S3_HOST=<s3_like_host> (e.g. s3.amazonaws.com, objects.dreamhost.com)

Optionally you can set the date to archive like so:

$ export PYPI_LOG_DATE=2014-02-11

You can enable some debug output with:

$ export DEBUG=10

If you don't set PYPI_LOG_DATE, the script will default to looking for yesterdays date.

Now you should be able to run it:

$ python pypi-cdn-log-archiver.py
112373 bytes written to testpypi-cdn-logs/archive/2014/2

About

Compress raw logs, organize them.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages