# Introduction to Analysis

## Goal

The goal of this project is to investigate interesting research questions using Python regarding the National Basketball Association.

## Datasets

The datasets this project is using is:
1. A collection of NBA player data from 2012-2022 in both the regular season and playoffs.
2. A database of NBA player information.

## Questions to Answer

1.  Who are the most efficient scorers in the league?

2.  How does player height and weight correlate with performance stats?

3.  Which colleges produce the best NBA talent?

## Here is the base code I will be working off of to answer the questions:

In [30]:
# Importing the Libraries
import pandas as pd
import seaborn as sns
import matplotlib.pyplot as plt

# Loading the Data
df1 = pd.read_excel('nba_player_data.xlsx')
df2 = pd.read_csv('PlayerIndex_nba_stats.csv')

# Merging the DataFrames
df2 = df2.rename(columns={'PERSON_ID': 'PLAYER_ID'})
df = pd.merge(df1, df2, on='PLAYER_ID', how='left')

# Data Cleaning
df.drop(columns=['RANK'], inplace=True)
df['season_start_year'] = df['Year'].str[:4].astype(int)
df['TEAM'] = df['TEAM'].replace(to_replace=['NOP','NOH'], value='NO')
df['Season_type'] = df['Season_type'].replace('Regular%20Season','Regular Season')

# Create Per Game Average Columns
df['PPG'] = df['PTS_x'] / df['GP']
df['RPG'] = df['REB_x'] / df['GP']
df['APG'] = df['AST_x'] / df['GP']
df['SPG'] = df['STL'] / df['GP'] 
df['BPG'] = df['BLK'] / df['GP']

## Data

In [32]:
df

Unnamed: 0,Year,Season_type,PLAYER_ID,PLAYER,TEAM,GP,MIN,FGM,FGA,FG_PCT,...,AST_y,STATS_TIMEFRAME,FROM_YEAR,TO_YEAR,season_start_year,PPG,RPG,APG,SPG,BPG
0,2012-13,Regular Season,201142,Kevin Durant,OKC,81,3119,731,1433,0.510,...,4.2,Season,2007,2024,2012,28.148148,7.901235,4.617284,1.432099,1.296296
1,2012-13,Regular Season,977,Kobe Bryant,LAL,78,3013,738,1595,0.463,...,4.7,Career,1996,2015,2012,27.346154,5.551282,6.012821,1.358974,0.320513
2,2012-13,Regular Season,2544,LeBron James,MIA,76,2877,765,1354,0.565,...,8.2,Season,2003,2024,2012,26.789474,8.026316,7.250000,1.697368,0.881579
3,2012-13,Regular Season,201935,James Harden,HOU,78,2985,585,1337,0.438,...,8.7,Season,2009,2024,2012,25.935897,4.858974,5.833333,1.820513,0.487179
4,2012-13,Regular Season,2546,Carmelo Anthony,NYK,67,2482,669,1489,0.449,...,2.7,Career,2003,2021,2012,28.656716,6.865672,2.552239,0.776119,0.477612
...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...,...
7288,2021-22,Playoffs,1629006,Josh Okogie,MIN,1,2,0,0,0.000,...,0.8,Season,2018,2024,2021,0.000000,0.000000,0.000000,0.000000,0.000000
7289,2021-22,Playoffs,1630556,Kessler Edwards,BKN,2,7,0,0,0.000,...,1.1,Season,2021,2024,2021,0.000000,0.000000,0.500000,0.500000,0.000000
7290,2021-22,Playoffs,1630201,Malachi Flynn,TOR,6,36,0,7,0.000,...,1.8,Season,2020,2024,2021,0.000000,0.500000,0.500000,0.166667,0.000000
7291,2021-22,Playoffs,202693,Markieff Morris,MIA,2,3,0,1,0.000,...,1.4,Season,2011,2024,2021,0.000000,0.500000,0.000000,0.000000,0.000000
