AWS's S3 Bucket file download

The commoncrawl Bucket stores many files containing lists of URL, that you can use to download a collection of files that was provided by a Web Crawler. For this task, the Bucket commoncrawl and key crawl-data/CC-MAIN-2022-05/wet.paths.gz was provided. The objective is to pick an URL from the Path File located in the Bucket and download a file containing the Web Crawler's data.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS's S3 Bucket file download

About

Releases

Packages

Languages

Krisalyd/aws-s3-file-downloader

Folders and files

Latest commit

History

Repository files navigation

AWS's S3 Bucket file download

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages