When factoring heat generation required for the manufacturing and transportation of products, _Greenhouse gas emissions attributable to products, from food to sneakers to appliances, make up more than 75% of global emissions._ -[The Carbon Catalogue](https://www.nature.com/articles/s41597-022-01178-9)

This data, which is publicly available on [nature.com](https://www.nature.com/articles/s41597-022-01178-9), contains product carbon footprints (PCFs) for various companies. PCFs are the greenhouse gas emissions attributable to a given product, measured in CO<sub>2</sub> (carbon dioxide equivalent).

This data is stored in a PostgreSQL database containing one table, `product_emissions`, which looks at PCFs by product as well as the stage of production that these emissions occurred. Here's a snapshot of what `product_emissions` contains in each column:

### `product_emissions`

| field                              | data type |
|------------------------------------|-----------|
| `id`                                 | `VARCHAR`   |
| `year`                               | `INT`       |
| `product_name`                       | `VARCHAR`   |
| `company`                            | `VARCHAR`   |
| `country`                            | `VARCHAR`   |
| `industry_group`                     | `VARCHAR`   |
| `weight_kg`                          | `NUMERIC`   |
| `carbon_footprint_pcf`               | `NUMERIC`   |
| `upstream_percent_total_pcf`         | `VARCHAR`   |
| `operations_percent_total_pcf`       | `VARCHAR`   |
| `downstream_percent_total_pcf`       | `VARCHAR`   |


In [None]:
-- See what's in the product_emissions table
SELECT * 
FROM product_emissions;

--The most recent date that data on carbon emissions was collected
SELECT MAX(year) AS most_recent_date
FROM product_emissions;

-- First year that data on carbon emissions was collected
SELECT MIN(year) AS first_date
FROM product_emissions;

-- For how many countries data was collected?
SELECT COUNT(DISTINCT country)
FROM product_emissions;

-- For which countries data was collected?
SELECT DISTINCT country
FROM product_emissions;

-- How many distinct industry_group we have?
SELECT COUNT(DISTINCT industry_group)
FROM product_emissions;

-- What industry_group we have?
SELECT DISTINCT industry_group
FROM product_emissions;

-- What is the average weight_kg and carbon_footprint_pcf for each company?
SELECT company, ROUND(AVG(weight_kg),1) AS avg_weight_kg, ROUND(AVG(carbon_footprint_pcf),1) AS avg_carbon_footprint_pcf
FROM product_emissions
GROUP BY company
ORDER BY avg_carbon_footprint_pcf DESC, avg_weight_kg DESC;

-- What is the average weight_kg and carbon_footprint_pcf for each country?
SELECT country, ROUND(AVG(weight_kg),1) AS avg_weight_kg, ROUND(AVG(carbon_footprint_pcf),1) AS avg_carbon_footprint_pcf
FROM product_emissions
GROUP BY country
ORDER BY avg_carbon_footprint_pcf DESC, avg_weight_kg DESC;

-- What is the average weight_kg and carbon_footprint_pcf for each country in each year?
SELECT country, year, ROUND(AVG(weight_kg),1) AS avg_weight_kg, ROUND(AVG(carbon_footprint_pcf),1) AS avg_carbon_footprint_pcf
FROM product_emissions
GROUP BY country, year
ORDER BY country, year;

-- Number of companies in each industry with total of the PCFs for that industry
SELECT industry_group, 
		COUNT(*) AS num_companies, 
		ROUND(SUM(carbon_footprint_pcf),1) AS total_industry_footprint
FROM product_emissions
WHERE year = (SELECT MAX(year) AS most_recent_date
				FROM product_emissions)
GROUP BY industry_group
ORDER BY total_industry_footprint DESC;