Skip to content

This repository contains all the work projects carried out with respect to learning and experiments on Big Data Analytics. The scripts are formed to build machine learning models for future predictions.

License

Notifications You must be signed in to change notification settings

gauravsharma884/Big-Data-And-Data-Analytics

Repository files navigation

Big-Data-And-Data-Analytics

This repository contains the following project files:

  • Start Up Succes Prediction.ipynb
  • NBA Players Career Duration.ipynb

Table of Content for the project files:

  1. Data Exploration
  2. Exploratory Data Analysis and Developing Insights with Data Visualization
  3. Feature Engineering (creating new features)
  4. Discover if data possess any structure
  5. Data pipelining for Data Processing and Modeling
  6. Cross-Validation
  7. Hyperparameter tuning both using Grid Search and Randomized Search

About the projects:

Start Up Succes Prediction

This project script is created to construct a model that can forecast the success of a StartUp on the basis of different variables and to determine which factors facilitates most in the success of a startup

  • Context

A startup or start-up is a company or project begun by an entrepreneur to seek, develop, and validate a scalable economic model. While entrepreneurship refers to all new businesses, including self-employment and businesses that never intend to become registered, startups refer to new businesses that intend to grow large beyond the solo founder. Startups face high uncertainty and have high rates of failure, but a minority of them do go on to be successful and influential. Some startups become unicorns: privately held startup companies valued at over US$1 billion. [Source of information: Wikipedia]

Unknown

Startups play a major role in economic growth. They bring new ideas, spur innovation, create employment thereby moving the economy. There has been an exponential growth in startups over the past few years. Predicting the success of a startup allows investors to find companies that have the potential for rapid growth, thereby allowing them to be one step ahead of the competition.

  • Problem Statement

Startup is a business that has just been established and grown supported by digital services and has also become an important element of innovation systems and economies around the world. The Startup ecosystem is growing very rapidly and still needs a lot of funding to operate with a minimalist working group. So it is very important for VC to monitor the performance and performance of Startup, so that it can be used as a consideration to decide whether to fund a Startup to drive its growth or refuse to take part in funding. To monitor startup performance, it is important to analyze what makes a Startup successful and how to determine its success.

  • Goal

The goal to be achieved is to determine whether a StartUp will be successful or not.


NBA Players Career Duration

Will NBA Rookies' Careers last for 5 years or not?

This project script is created to predict career duration of NBA Player and to determine whether new players will last for 5 years or not.

Unknown

  • Context

In terms of competitiveness, work ethics and training mentality, few leagues worldwide are as hard as the National Basketball Association. If a Rookie (new player) is successful or not depends on many variables - especially on his performance in the first season. Sometimes, it is possible to use statistics about such players to predict wheter they will last 5 years in the NBA or not.

  • Content

The tabular data contains 22 columns, all regarding a player's performance records such as e.g. the number of 3 Points made.

About

This repository contains all the work projects carried out with respect to learning and experiments on Big Data Analytics. The scripts are formed to build machine learning models for future predictions.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages