Skip to content

IV Sem IDS Course project done under Dr. Sakthi Balan Muthiah and Dr. Subrat Kumar Dash

License

Notifications You must be signed in to change notification settings

anshuljain21120/IDS-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IDS-Project

Problem Description

Applying ML Classification algorithms on the data set and getting inferences from the data. You may use the appropriate ML algorithm packages available in R or Python. But know which algorithm you use and know the concept behind it.

Project Guidelines

  1. Collect a dataset from https://archive.ics.uci.edu/ml/datasets.php
  2. Understand the various features (columns) of the dataset.
  3. Do preprocessing on the dataset.
  4. Do preliminary analysis on the dataset as much as needed and get a summarisation of your inferences with proper validation
  5. You may use ether R or Python for this project
  6. You may use any appropriate ML classification algorithm (you need to justify why you are using the same)
  7. If the dataset is huge start doing your analysis on a random sample dataset from the original huge dataset and slowly scale it up.
  8. Write a complete report as with needed plots, charts, bar graphs, pie charts and so on and submit it. Report should also contain the description about the data set, preprocessing done - why, how and what, analysis done / ML classification done, for analysis what is the purpose of doing, for ML why specific algorithms / packages were used, what is the output, what are you inferences from the output, what are your questions about the data? what are questions addressed by your analysis and so on.
  9. Report should also contain all the details from where you collected the dataset, how many rows and columns are there, what are the features, why we do such and such preprocessing and after preprocessing what is changed in the dataset, what are your basic questions about your data, how you are going to address those questions through an analysis (give this with substantiation about why you use such analysis).

My Solution