# Ensuring Consistency

**Activity Overview**: Ensure consistency by identifying and resolving conflicting values across datasets.

## Title: Customer Address Discrepancies

**Task**: Address customer address mismatches between CRM and marketing databases.

**Steps**:
1. Compare customer addresses in the CRM with those in the marketing database.
2. Identify records with conflicting address information.
3. Propose a method to consolidate records with verified addresses.

In [1]:
# Write your code from here
import pandas as pd

# Simulated CRM dataset
crm_data = pd.DataFrame({
    'customer_id': [1, 2, 3, 4],
    'address': ['123 Maple St', '456 Oak St', '789 Pine St', '101 Elm St']
})

# Simulated Marketing dataset
marketing_data = pd.DataFrame({
    'customer_id': [1, 2, 3, 4],
    'address': ['123 Maple St', '456 Oak Ave', '789 Pine St', '102 Elm St']
})

# Merge datasets on customer_id
merged = pd.merge(crm_data, marketing_data, on='customer_id', suffixes=('_crm', '_marketing'))

# Identify mismatched addresses
address_conflicts = merged[merged['address_crm'] != merged['address_marketing']]

print("Conflicting address records:")
print(address_conflicts)

# Example method to resolve: prefer CRM data unless marketing is verified (for demo purpose only)
resolved_addresses = merged.copy()
resolved_addresses['resolved_address'] = resolved_addresses['address_crm']  # Prioritize CRM

print("\nResolved addresses:")
print(resolved_addresses[['customer_id', 'resolved_address']])

Conflicting address records:
   customer_id address_crm address_marketing
1            2  456 Oak St       456 Oak Ave
3            4  101 Elm St        102 Elm St

Resolved addresses:
   customer_id resolved_address
0            1     123 Maple St
1            2       456 Oak St
2            3      789 Pine St
3            4       101 Elm St
