Skip to content

Awarded 1st position in 'Google-Krenicki Hackathon' for cloud based ML framework to predict US household income for real-estate brokerage firm; using Google Colab and AWS S3

Notifications You must be signed in to change notification settings

akshay-madar/applied-ML-hackathon-by-krenicki

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Applied Machine Learning Hackathon by Krenicki Center at Krannert and Google

  • Awarded 1st position for implementing cloud based ML framework to predict US household income for real-estate brokerage firm
  • Identified target segments for promoting low-cost housing options and used smote for oversampling to achieve AUC of 90.6

BAIM team wins hackathon with Krenicki Center for Business Analytics and Machine Learning:

Official Krannert News featured on - https://krannert.purdue.edu/news/features/?story=5773

Purdue Krenicki Center for BA and ML tweets - @PurdueKrenicki:

Abhishek Kumar, Technical Lead at Google AI, discussing #MachineLearning for decision making at @PurdueKrannert. The event is organized by @PurdueKrenicki. It also consists of a hackathon in which 42 MS-BAIM students participate. pic.twitter.com/AoEe0xAW6S

— Purdue Krenicki Center for BA and ML (@PurdueKrenicki) October 25, 2019

Purdue MBA tweets - @PurdueMBA:

A team of MS Business Analytics & Information Management program students, Akshay Madar, Robin Jindal, Karthik, Dibyamshu Shrestha, and Mehul Zawar, won the Applied Machine Learning Hackathon hosted by Krenicki Center last week! Great job team! #BoilerUp! pic.twitter.com/bssruNkIxb

— Purdue MBA (@PurdueMBA) November 1, 2019

Machine Learning algorithms used:

Logistic Regression, SVM, Decision Tree, Random Forest, Neural Networks

Why Google Colaboratory?

Colaboratory, or "Colab" for short, allows you to write and execute Python in your browser, with

  • Zero configuration required
  • Free access to GPUs
  • Easy sharing

Whether you're a student, a data scientist or an AI researcher, Colab can make your work easier. You can also harness the full power of popular Python libraries to analyze and visualize data.

Why Amazon S3?

Amazon Simple Storage Service (Amazon S3) is an object storage service that offers industry-leading scalability, data availability, security, and performance. This means customers of all sizes and industries can use it to store and protect any amount of data for a range of use cases, such as websites, mobile applications, backup and restore, archive, enterprise applications, IoT devices, and big data analytics. Amazon S3 provides easy-to-use management features so you can organize your data and configure finely-tuned access controls to meet your specific business, organizational, and compliance requirements.

Boto 3:

Boto is the Amazon Web Services (AWS) SDK for Python. It enables Python developers to create, configure, and manage AWS services, such as EC2 and S3. Boto provides an easy to use, object-oriented API, as well as low-level access to AWS services.

Set up Boto credentials to pull data from S3:

 import boto3  
 import botocore  
 BUCKET_NAME = 'amazing-bucket-am-1' # replace with your bucket name
 
 # enter authentication credentials
 #  Credentials for your AWS account can be found in the IAM Console. You can create or use an existing user. Go to manage access keys and generate a new set of keys.
 s3 = boto3.resource('s3', aws_access_key_id = 'ENTER YOUR ACCESS KEY', aws_secret_access_key= 'ENTER YOUR SECRET KEY')  

Download "training.csv" from S3:

KEY = 'hackathon/training.csv' # replace with your object key  

try:  
  # we are trying to download training set from s3 with name `training.csv` to colab dir with name `training.csv`  
  s3.Bucket(BUCKET_NAME).download_file(KEY, 'training.csv')  
except botocore.exceptions.ClientError as e:  
  if e.response['Error']['Code'] == "404":  
    print("The object does not exist.")  
  else:  
    raise  

About

Awarded 1st position in 'Google-Krenicki Hackathon' for cloud based ML framework to predict US household income for real-estate brokerage firm; using Google Colab and AWS S3

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages