# Measuring Data Accuracy

**Activity Overview**: Assess data accuracy by comparing it to a trusted source and detecting incorrect values or mismatches.

## Title: Product Pricing

**Task**: Compare a dataset of product prices with the latest official price list.

**Steps**:
1. Obtain the latest product price list from the official company website.
2. Compare the dataset's product prices against the verified list.
3. Identify any discrepancies and mark them for correction.

In [None]:
# Write your code from here
import pandas as pd

# Step 1: Load datasets
# Assume you have CSV files for both datasets
company_prices = pd.read_csv('company_prices.csv')        # Your dataset with columns: product_id, price
trusted_prices = pd.read_csv('trusted_prices.csv')        # Trusted official price list with columns: product_id, price

# Step 2: Merge datasets on product_id for comparison
merged_prices = pd.merge(company_prices, trusted_prices, on='product_id', suffixes=('_company', '_trusted'))

# Step 3: Identify discrepancies where prices do not match
merged_prices['price_match'] = merged_prices['price_company'] == merged_prices['price_trusted']

# Extract mismatched prices
discrepancies = merged_prices[~merged_prices['price_match']]

# Step 4: Output the discrepancies for review
if discrepancies.empty:
    print("All product prices match the trusted source.")
else:
    print("Discrepancies found:")
    print(discrepancies[['product_id', 'price_company', 'price_trusted']])
