Skip to content

Services: S3, Athena, Redshift, Glue, Quicksight, Sagemaker, IoT, Kinesis

License

Notifications You must be signed in to change notification settings

sujikathir/Using-AWS-S3-for-Data-Storage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 

Repository files navigation

AWS S3 Project: Working with S3 Buckets and Data Files

This README provides detailed instructions on how to create an S3 bucket, upload raw data files, query data with S3 Select, and change properties like encryption and storage type. Follow these steps to replicate the setup and operations.

Prerequisites

  • AWS account
  • AWS Management Console access

*Steps to Create and Configure an S3 Bucket

  1. Create an S3 Bucket
  • Navigate to S3 in the AWS Management Console.
  • Click on the "Create bucket" button on the right side.

  • Enter a name for your S3 bucket using the format myawsticketdata. Remember to use your unique ID at the end. Review the naming rules if needed.
  • Select the AWS Region named "US West (Oregon) us-west-2".
  • Under the Block Public Access header, clear the radio button for "Block all public access". Select the checkbox to acknowledge that by making this choice you understand the bucket will be public.

  • Leave the Default encryption and other settings with their defaults. Click on the "Create bucket" button.

  1. Upload Raw Data Files to the S3 Bucket
  • Download the txt files uploaded in this folder.
  • Ensure you have seven folders each containing a data file.
  • Choose the bucket you created in the previous steps by clicking on the blue name of the bucket.

  • After the bucket opens, click the "Upload" button.

  • Monitor the file upload progress status bar.

  1. Query Data with S3 Select
  • Click on the "Version" tab. Check if versioning is turned on.

  • After choosing the data file, click the "Object actions" menu and choose "Query with S3 Select".

  • It will open the input and output settings. Click on the "Run SQL query" button to see the records.

  • Choose the "Add SQL from templates" button.

  • Choose the radio button beside SELECT COUNT * FROM s3object s. Click the "Copy SQL" button.

  • Replace the previous query with a paste of this copy. Click on the "Run SQL query" button. From this query, we are viewing how many records are in the data file.
  1. Change Properties of Encryption and Type of Storage
  • Go to the bucket you created and click on it. Click on the "Objects" tab. Select all the previously loaded data folders/files. Click the "Actions" menu and choose "Edit server-side encryption".

  • In the Server-side encryption settings section, click the radio button to "Enable" and then click the "Save changes" button. Click the "Close" button.
  • Select all the folders, click on the "Actions" menu, and select "Edit storage class".
  • Choose Intelligent-Tiering and then click on the "Save changes" button.

About

Services: S3, Athena, Redshift, Glue, Quicksight, Sagemaker, IoT, Kinesis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages