Skip to content

AM1CODES/Kaggle-Survey-2020-Competition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

drawing

Kaggle-Survey-2020-Commpetition

The code in this Repo is submission for the annual Kaggle ML & DS survey Competition.

Our goal

In this competition we were given the data of the annual survey that Kaggle conducts every year and we were asked to explore this data and come up with some conclusions that might be unique and wouldn't be visible if we just glanced through the data.

I performed EDA on the data on features like - Age of a person, the country they live in, The Language they use to code, the IDEs they preder, their job titles and much more. After exploration we were able to come up with some conclusions which i have mentioned at the end of my notebook.

Few Results and Observations

  1. Most Kaggle user's are quite young with their age between 22-29.
  2. The Number of Men using Kaggle is huge as compared to the Woman. But we could see a significant growth in number of female Kagglers recently.
  3. Most Kaggler's are from India followed by USA and other countries.
  4. Most Kagglers have a Master's Degree.
  5. Majority of Kagglers are Students followed by Data Scientists and Machine Learning Engineers.
  6. Most Kagglers have Experience of 3-5 Years in the Programming and then there are Kagglers with an experience of 1-2 years.
  7. Most Kagglers use Python followed by SQL and R.
  8. The most Preffered IDEs are Jupyter, VScode and PyCharm.
  9. Most Recommended Languages for Data Science Beginners is Python followed by R.
  10. The most used data visualization Libraries are Matplotlib and Seaborn.
  11. The most used framework for Machine learning and Deep learning is Sci-Kit learn followed by Tensorflow along with Keras.
  12. The Most commonly used algorithms are Regression based followed by Decision trees, random forests and so on.
  13. Most users share their work on Github followed by Kaggle and Colab. There are also many who dont like to share their work.
  14. Most users preferred Coursera to learn Data science and Machine Learning followed by Kaggle Courses and Udemy.
  15. Most users make use of Kaggle notebooks and forums to stay updated about latest Data science and ML topics followed by Youtube and Blogs on various websites.

About

This was my submission for Annual Kaggle Survey Competition.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages