When factoring heat generation required for the manufacturing and transportation of products, _Greenhouse gas emissions attributable to products, from food to sneakers to appliances, make up more than 75% of global emissions._ [The Carbon Catalogue](https://www.nature.com/articles/s41597-022-01178-9)

Our data, which is publicly available on [nature.com](https://www.nature.com/articles/s41597-022-01178-9), contains product carbon footprints (PCFs) for various companies. PCFs are the greenhouse gas emissions attributable to a given product, measured in CO<sub>2</sub> (carbon dioxide equivalent).

This data is stored in a PostgreSQL database containing one table, `product_emissions`, which looks at PCFs by product as well as the stage of production that these emissions occurred. Here's a snapshot of what `product_emissions` contains in each column:

### `product_emissions`

| field                              | data type |
|------------------------------------|-----------|
| `id`                                 | `VARCHAR`   |
| `year`                               | `INT`       |
| `product_name`                       | `VARCHAR`   |
| `company`                            | `VARCHAR`   |
| `country`                            | `VARCHAR`   |
| `industry_group`                     | `VARCHAR`   |
| `weight_kg`                          | `NUMERIC`   |
| `carbon_footprint_pcf`               | `NUMERIC`   |
| `upstream_percent_total_pcf`         | `VARCHAR`   |
| `operations_percent_total_pcf`       | `VARCHAR`   |
| `downstream_percent_total_pcf`       | `VARCHAR`   |

You'll use this data to examine the carbon footprint of each industry in the dataset! 

### Calculate Total Industry Carbon Footprint for 2017

In [4]:
SELECT industry_group, COUNT(DISTINCT company) AS num_companies, ROUND(SUM(carbon_footprint_pcf), 1) AS total_industry_footprint
FROM product_emissions
WHERE year = '2017'
GROUP BY industry_group
ORDER BY total_industry_footprint DESC;

SyntaxError: invalid syntax. Perhaps you forgot a comma? (3053089763.py, line 1)

### Examine Carbon Footprints Across Industries

In [5]:
SELECT
    industry_group,
    AVG(carbon_footprint_pcf) AS avg_carbon_footprint
FROM
    product_emissions
GROUP BY
    industry_group
ORDER BY
    avg_carbon_footprint DESC;

IndentationError: unexpected indent (1505064499.py, line 2)

### Identify Key Contributors

In [None]:
SELECT
    company,
    industry_group,
    SUM(carbon_footprint_pcf) AS total_carbon_footprint
FROM
    product_emissions
GROUP BY
    company, industry_group
ORDER BY
    total_carbon_footprint DESC
LIMIT 10;

### Analyze Emissions by Production Stage

In [None]:
SELECT
    industry_group,
    AVG(CAST(upstream_percent_total_pcf AS NUMERIC)) AS avg_upstream_percent,
    AVG(CAST(operations_percent_total_pcf AS NUMERIC)) AS avg_operations_percent,
    AVG(CAST(downstream_percent_total_pcf AS NUMERIC)) AS avg_downstream_percent
FROM
    product_emissions
GROUP BY
    industry_group;