# Data Analysis of Revenues by Locations.

In [26]:
import boto3
import pandas as pd
import plotly.express as px
# Step 1: Establish the boto3 s3 Client

s3 = boto3.client('s3')
bucket_name = 'testingpipelineyesner'

doc_key = 'traffic/sales.csv'

summary_response = s3.get_object(Bucket=bucket_name, Key=doc_key)

summary_df = pd.read_csv(summary_response['Body'],sep=';',index_col=0)


In [27]:
summary_df.drop(columns=['order_id','customer_id','product_id',	'quantity', 'price','name_y','name_x'])

Unnamed: 0,order_date,total_price,location
0,2023-01-01,122.46,Norrisstad
1,2023-01-02,122.46,Norrisstad
2,2023-01-04,122.46,Norrisstad
3,2023-03-29,289.20,Norrisstad
4,2023-01-08,31.92,Norrisstad
...,...,...,...
95,2023-03-18,277.59,Davisburgh
96,2023-04-04,185.06,Davisburgh
97,2023-02-27,86.36,Davisburgh
98,2023-03-02,86.36,Davisburgh


In [28]:

grouped_df = summary_df.groupby('location').agg({'total_price': 'sum'}).reset_index()

fig = px.bar(grouped_df, x='location', y='total_price',
             text='total_price', title='Revenues by Location',
             labels={'total_price': 'Revenues', 'location': 'Location'},
             color='total_price', color_continuous_scale='Viridis')

fig.update_traces(texttemplate='%{text:.2s}', textposition='outside')
fig.update_layout(uniformtext_minsize=8, uniformtext_mode='hide')

fig.show()

**Chart Title: "Revenues by Location"**

**Introduction:**
The chart provides a clear visualization of the total revenues generated by location during a specific period. Each bar on the chart represents a location (A, B, C), and the height of the bar indicates the total revenue amount generated in that location.

**Interpretation:**
- **Bar Height:** The height of each bar indicates the total sum of revenues generated in the corresponding location. The higher the bar, the greater the total revenues associated with that location.
  
- **Bar Color:** The color of the bars visually reflects the relative levels of revenues. More intensely colored bars represent locations with higher total revenues.

**Specific Readings:**
- **X-Axis Labels (Location):** The labels on the X-axis show the locations (A, B, C).
  
- **Y-Axis Labels (Total Revenues):** The labels on the Y-axis indicate the total revenues associated with each location.

**Conclusions:**
- By observing the chart, one can quickly identify locations that have contributed more significantly to the total revenues during the analyzed period.
  
- Visual comparison between the bars facilitates the identification of patterns and trends in total revenues by location.

This chart provides an effective and easily interpretable representation of how total revenues are distributed among different locations, enabling informed decision-making based on the relative performance of each location.

- https://www.linkedin.com/in/yesner-salgado/
- https://yesnersalgado.me/