# Introduction to Python for Data Engineering

This notebook demonstrates how to read data from a public API, process it to create useful information, and store the result in an SQLite database.

It assumes you have read [this article](https://www.startdataengineering.com/post/python-for-de/#extract--load-read-and-write-data-to-any-system).

**Acknowledgment**: Parts of this notebook were developed with the assistance of ChatGPT, an AI language model by OpenAI, to support instructional design and coding tasks related to data engineering concepts.

In [None]:
# Install required packages
!pip install requests



In [None]:
# Import libraries
import requests
import sqlite3
from datetime import datetime

In [None]:
# Read data from CoinGecko API

target_currency = input("Enter target currency code (e.g., zar, eur, gbp, jpy): ").lower()

url = f"https://api.coingecko.com/api/v3/simple/price?ids=bitcoin&vs_currencies=usd,{target_currency}"
response = requests.get(url)
data = response.json()

data


{'bitcoin': {'usd': 118306}}

We will extract the Bitcoin price in USD and convert it to ZAR using an estimated exchange rate.
This is an example of turning raw data into useful **information**.

In [None]:
# Function to convert USD to ZAR
def convert_usd_to_zar(usd_amount, rate=18.5):
    return round(usd_amount * rate, 2)

In [None]:
# Extract USD price and apply transformation
usd_price = data["bitcoin"]["usd"]
target_price = data["bitcoin"].get(target_currency)

timestamp = datetime.now().isoformat()

print(f"Bitcoin price in USD: {usd_price}")
print(f"Bitcoin price in {target_currency.upper()}: {target_price}")


Bitcoin price in USD: 118306
Estimated price in ZAR: 2188661.0


In [None]:
# Connect to SQLite database (or create it)
conn = sqlite3.connect("bitcoin_prices.db")
cursor = conn.cursor()

# Create table if it doesn't exist
cursor.execute('''
CREATE TABLE IF NOT EXISTS prices (
    -- These are SQL comments. Similarly /* ... */ for multi-line comments
    -- If timestamp should be unique (e.g., one price per moment):
    -- timestamp TEXT PRIMARY KEY,
    -- If multiple prices per second are expected, or if you want a simple unique ID:
    -- id INTEGER PRIMARY KEY AUTOINCREMENT,
    timestamp TEXT,
    usd REAL,
    zar REAL
)
''')

# Insert the new row
cursor.execute("INSERT INTO prices VALUES (?, ?, ?)", (timestamp, usd_price, target_currency.upper(), target_price))
conn.commit()
conn.close()


In [None]:
# Check if data was successfully written
conn = sqlite3.connect("bitcoin_prices.db")
cursor = conn.cursor()

# Show the last 5 rows
rows = cursor.execute("SELECT * FROM prices ORDER BY timestamp DESC LIMIT 5").fetchall()
for row in rows:
    print(row)

conn.close()


('2025-07-30T07:34:20.348960', 118306.0, 2188661.0)
('2025-07-30T06:37:14.703790', 118174.0, 2186219.0)
