# Financial News and Stock Price Integration Dataset - EDA
This notebook performs comprehensive Exploratory Data Analysis (EDA) on the FNSPID dataset.
## Dataset Description
- **headline**: Article release headline/title
- **url**: Direct link to the full news article
- **publisher**: Author/creator of article
- **date**: Publication date and time (UTC-4 timezone)
- **stock**: Stock ticker symbol

In [1]:
# Import required libraries
import sys
import os
sys.path.append('../src')
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from datetime import datetime
import warnings
# Import custom modules
from data_loader import FinancialDataLoader
from eda_analyzer import FinancialEDAAnalyzer
from utils import save_analysis_results, format_large_numbers, get_data_quality_report

# Configure plotting
plt.style.use('seaborn-v0_8')
sns.set_palette("husl")
warnings.filterwarnings('ignore')

# Set display options
pd.set_option('display.max_columns', None)
pd.set_option('display.max_rows', 100)
print("Libraries imported successfully!")

Libraries imported successfully!


## 1. Data Loading and Initial Exploration

In [2]:
# Initialize data loader
data_path = "../data/raw_analyst_ratings.csv"
loader = FinancialDataLoader(data_path)

# Load the dataset
print("Loading dataset...")
df = loader.load_data()
print(f"Dataset loaded successfully! Shape: {df.shape}")

INFO:data_loader:Loading data from ../data/raw_analyst_ratings.csv


Loading dataset...


INFO:data_loader:Data loaded successfully. Shape: (1407328, 6)


Dataset loaded successfully! Shape: (1407328, 6)
