# INRODUCTION

#### In this notebook, we will use the NumPy library to perform fundamental numerical analysis on the Video Game Sales dataset. The focus will be on leveraging NumPy's powerful array capabilities to extract numerical data, calculate aggregate statistics such as totals and averages, and filter the data based on numerical conditions to identify high-performing games. This will demonstrate NumPy's core strengths in efficient mathematical computation.

In [None]:
import numpy as np
import pandas as pd

# Load the dataset using pandas to easily access columns
df = pd.read_csv('/kaggle/input/videogamesales/vgsales.csv')



# --- NumPy Analysis ---

# Extract numerical sales data into NumPy arrays

In [2]:
na_sales = df['NA_Sales'].to_numpy()
eu_sales = df['EU_Sales'].to_numpy()
jp_sales = df['JP_Sales'].to_numpy()
global_sales = df['Global_Sales'].to_numpy()

# 1. Calculate Total Sales for Each Region

In [3]:
total_na_sales = np.sum(na_sales)
total_eu_sales = np.sum(eu_sales)
total_jp_sales = np.sum(jp_sales)

print("--- Total Sales (in Millions) ---")
print(f"Total North American Sales: {total_na_sales:.2f}")
print(f"Total European Sales: {total_eu_sales:.2f}")
print(f"Total Japanese Sales: {total_jp_sales:.2f}")
print("\n")

--- Total Sales (in Millions) ---
Total North American Sales: 4392.95
Total European Sales: 2434.13
Total Japanese Sales: 1291.02




# 2. Calculate Basic Statistics for Global Sales

In [4]:
mean_global_sales = np.mean(global_sales)
median_global_sales = np.median(global_sales)
std_dev_global_sales = np.std(global_sales)
max_global_sales = np.max(global_sales)

print("--- Global Sales Statistics ---")
print(f"Average Global Sales per Game: {mean_global_sales:.2f} Million")
print(f"Median Global Sales per Game: {median_global_sales:.2f} Million")
print(f"Standard Deviation of Global Sales: {std_dev_global_sales:.2f}")
print(f"Highest Global Sales for a Single Game: {max_global_sales:.2f} Million")
print("\n")

--- Global Sales Statistics ---
Average Global Sales per Game: 0.54 Million
Median Global Sales per Game: 0.17 Million
Standard Deviation of Global Sales: 1.55
Highest Global Sales for a Single Game: 82.74 Million




# 3. Find Games with Global Sales > 20 Million

In [5]:
high_selling_games = global_sales[global_sales > 20]
print(f"--- High-Performing Games ---")
print(f"Number of games with global sales exceeding 20 million: {len(high_selling_games)}")
print("Sales figures for these games (in millions):")
print(high_selling_games)

--- High-Performing Games ---
Number of games with global sales exceeding 20 million: 20
Sales figures for these games (in millions):
[82.74 40.24 35.82 33.   31.37 30.26 30.01 29.02 28.62 28.31 24.76 23.42
 23.1  22.72 22.   21.82 21.4  20.81 20.61 20.22]
