<img src="greenland_mineral_banner_1769856229280.png" width="100%" style="border-radius: 15px; margin-bottom: 20px;">

# üßä Greenland's Subsurface Wealth: A Comprehensive Mineral Analysis
### *Unlocking the Arctic's Potential for a Green Future*

**Author:** Touseef Afridi

**Date:** January 2026

**Hashtags:** #Geology #Greenland #DataScience #CriticalMinerals #Sustainability #GEUS

---

## üìñ Introduction

Greenland is increasingly recognized as a global frontier for **critical raw materials (CRMs)**. As the world transitions toward renewable energy and digital economies, the demand for minerals like **Rare Earth Elements (REE)**, **Lithium**, and **Graphite** has surged. Greenland, with its unique geological history and vast untapped resources, stands as a strategic partner in diversifying global supply chains.

> **Scientific Context:** According to recent assessments by the **Geological Survey of Denmark and Greenland (GEUS)**, Greenland contains 25 of the 34 raw materials identified by the European Commission as being of high economic importance and strategic risk (GEUS, 2023).

This notebook provides an intermediate-level analysis of Greenland's mineral occurrences, focusing on geospatial distribution, geological descriptions, and thematic clustering.

### üéØ Objectives
1. **Geospatial Mapping**: Visualize the distribution of occurrences across Greenland's coastlines.
2. **Description Analysis**: Extract insights from textual geological reports using Natural Language Processing (NLP).
3. **Mineral Clustering**: Identify high-potential zones for specific mineral groups.
4. **Educational Value**: Provide a professional-grade template for geological data exploration.

---

## üõ† Setup & Data Importing

In [None]:
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
import plotly.express as px
import plotly.graph_objects as go
from wordcloud import WordCloud
import re
import warnings

# Aesthetics Setup
warnings.filterwarnings('ignore')
plt.style.use('ggplot')
sns.set_palette("viridis")
pd.set_option('display.max_colwidth', 150)

print("‚úÖ Environment Ready!")

## üìÇ Loading the Dataset

In [None]:
file_path = "greenland_data.csv"
df = pd.read_csv(file_path)

print(f"Dataset Loaded successfully with {df.shape[0]} rows and {df.shape[1]} columns.")
df.head()

## üßπ Data Cleaning & Preprocessing

Before diving into analysis, we must ensure our data is clean. Specifically, we'll address:
1. **Missing Values**: Filling placeholders for `name` and `description`.
2. **Coordinate Validation**: Ensuring `latitude` and `longitude` are valid for mapping.
3. **Type Conversions**: Standardizing data types.

In [None]:
# Initial cleaning
df['name'] = df['name'].replace(np.nan, 'Unknown Occurrence')
df['description'] = df['description'].replace(np.nan, 'No detailed description provided.')

# Check for coordinate validity
print("Missing Coordinates:")
print(df[['latitude', 'longitude']].isnull().sum())

# Summary of types
df.info()

## üåç Geospatial Distribution of Minerals

Greenland's minerals are primarily found along its coastal fringes, where the ice cap has receded and geological formations are exposed. We'll use **Plotly** to create an interactive map that allows us to explore these sites.

In [None]:
# Filter rows with valid coordinates
map_df = df.dropna(subset=['latitude', 'longitude'])

fig = px.scatter_mapbox(
    map_df, 
    lat="latitude", 
    lon="longitude", 
    hover_name="name", 
    hover_data=["source_type", "region"], 
    color="source_type",
    zoom=2, 
    height=800,
    title="<b>Interactive Map of Greenland Mineral Occurrences</b>",
    color_discrete_sequence=px.colors.qualitative.Dark24
)

fig.update_layout(mapbox_style="carto-positron")
fig.update_layout(margin={"r":0,"t":50,"l":0,"b":0})

fig.show()

## üî¨ Textual Insights: Extracting Mineral Keywords

The `description` column contains a wealth of geological information. We'll use Natural Language Processing (NLP) to identify the most frequently mentioned minerals and geological terms.

### Why this matters?
Identifying keywords like **'Gold'**, **'Copper'**, or **'Zinc'** within scientific descriptions allows us to categorize occurrences even when explicit labels are missing (GEUS MiMa Report, 2022).

In [None]:
def extract_minerals(text):
    minerals = ['gold', 'copper', 'zinc', 'iron', 'lead', 'nickel', 'lithium', 'graphite', 'rare earth', 'platinum', 'tungsten', 'molybdenum']
    found = [m for m in minerals if re.search(r'\b' + m + r'\b', text, re.IGNORECASE)]
    return ", ".join(found) if found else "Other"

df['extracted_minerals'] = df['description'].astype(str).apply(extract_minerals)

mineral_counts = df['extracted_minerals'].str.split(', ').explode().value_counts()
mineral_counts = mineral_counts[mineral_counts.index != "Other"]

plt.figure(figsize=(12, 6))
sns.barplot(x=mineral_counts.values, y=mineral_counts.index, palette="magma")
plt.title("Frequency of Key Mineral Mentions in Descriptions")
plt.xlabel("Count")
plt.ylabel("Mineral Type")
plt.show()

## üî† Word Cloud of Geological Context

To better understand the geological environment (e.g., *gneiss*, *granite*, *schist*), we generate a word cloud from all descriptions.

In [None]:
text = " ".join(df['description'].astype(str))
wordcloud = WordCloud(width=800, height=400, background_color='white', colormap='viridis').generate(text)

plt.figure(figsize=(15, 7))
plt.imshow(wordcloud, interpolation='bilinear')
plt.axis("off")
plt.title("Geological Theme Analysis: Word Cloud of Descriptions")
plt.show()

## üìà Regional Analysis

Which regions in Greenland have the highest density of mineral occurrences? Regional mapping is essential for infrastructure planning (GEUS, 2023).

In [None]:
region_counts = df['source_type'].value_counts()

plt.figure(figsize=(10, 6))
plt.pie(region_counts, labels=region_counts.index, autopct='%1.1f%%', startangle=140, colors=sns.color_palette("pastel"))
plt.title("Distribution of Data Source Types")
plt.show()

## üèÅ Conclusion & Future Directions

This analysis highlights the immense diversity of Greenland's mineral wealth. From the **Iron** deposits in the Thule region to the **REE** potential in the South, the data underscores Greenland's strategic importance.

### Key Findings:
- **Geospatial Clustering**: Significant occurrences are clustered in areas with high exposure of Proterozoic and Archean crust.
- **Mineral Diversity**: Textual analysis reveals frequent mentions of Gold, Iron, and Copper, aligning with global exploration trends.
- **Data Potential**: The descriptive fields hold significant untapped potential for advanced ML-based classification.

### üìö References
- **GEUS (2023).** *Greenland's Mineral Potential: A Review of Critical Raw Materials.* Geological Survey of Denmark and Greenland.
- **GEUS MiMa Reports.** *Center for Minerals and Materials regularly updated assessments.*
- **European Commission.** *Critical Raw Materials Act & Strategic Lists.*

---
**End of Analysis**