# Topics

Project topics have been identified across common sectors, technologies and research fields. The classification process was iterated multiple times as part of the analysis and will continue to evolve as niches emerge and develop. While it may be difficult to compare the scope of the topics directly, the relative size allow us to identify neglected, vibrant, and emerging areas.

In [2]:
import numpy as np
import pandas as pd
import plotly.io as pio
import plotly.graph_objects as go
import plotly.express as px
from opensustain_template import *

In [3]:
df_active = pd.read_csv("../csv/project_analysis.csv")

In [4]:
topic_his = (
    df_active["topic"]
    .value_counts()
    .to_frame()
    .rename_axis("topic_names")
    .reset_index()
)

fig = px.bar(
    topic_his,
    x="topic",
    y="topic_names",
    orientation="h",
)

fig.update_layout(
    height=1000,  # Added parameter
    yaxis_title= None,
    xaxis_title="Projects",
    title="Projects within Topics",
    coloraxis_colorbar=dict(
    title="DDS",
    ),
    hoverlabel=dict(
    bgcolor="white"
    )
)
fig['layout'].update(margin=dict(l=300,r=0,b=0,t=40))
fig.update_traces(marker_color=marker_color)
fig.update(layout_showlegend=False)
fig.add_layout_image(
    dict(
        source=logo_img,
        xref="paper", yref="paper",
        x=1, y=1,
        sizex=0.06, sizey=0.06,
        xanchor="right", yanchor="top"
    )
)
fig.show()

```{figure} data:image/gif;base64,R0lGODlhAQABAIAAAAAAAP///yH5BAEAAAAALAAAAAABAAEAAAIBRAA7
:figclass: caption-hack
:name: projects-within-topics

Number of individual projects within topics
```

**~45% of all identified projects can be found within Climate Data Processing and Acess, Biosphere, Hydrosphere, Water Supply and Quality, Energy System Modelling, Mobility and Transportation, and Buildings and heating.** This is likely due to the research maturity of these fields, the multitude of scientific organisations behind them, and the relatively good availability of open data from natural and engineered systems in these categories. We can see strong open source ecosystems, particularly in the field of Energy Modelling and Renewable Energy, such as Photovoltaics or Wind Energy. However, despite the central importance of batteries for energy storage, only a few OSS projects are under development. Furthermore, areas where software plays a central role, but only a small number of projects can be identified, are of particular interest.

**Open source solutions are largely underrepresented within [Sustainable Investment](https://opensustain.tech/#sustainable-investment), representing only 1.15 % (a total of 11 projects) of all identified projects.** Despite ongoing discussions about ESG (Environmental, Social and Governance) ratings regarding their quality and transparency, the field is dominated by proprietary closed-source frameworks and datasets. The lack of open source and open science in sustainable investment reflects the lack of transparent impact measurement and evaluation, which is key in financing a sustainable transformation. **Also, the field of Energy and Resource Consumption shows a very low level of OSS developments, at only 0.28 %.**

**In Emission Observation and Modeling, only 21 developments have been identified, representing 2.1% of all projects.** Despite the significant impact of anthropogenic emissions on the climate, there is a significant lack of open source tools, platforms, and communities that truly reflect the magnitude of the challenges we are facing. A significant business opportunity would exist for an open source community to bring together various emissions monitoring and modelling datasets from around the world within a single platform. Such a platform would be critical for creating transparency around pressing issues such as carbon trading, carbon taxes, and company sustainability assessments. Electricity Maps has demonstrated – with great success – how this approach works when applied to local energy grids. Hundreds of scientists and developers collaborate openly to integrate existing, publicly available data into one digital platform. There are new promising developments in this space, such as [The Global Registry of Fossil Fuels](https://fossilfuelregistry.org/).

**Topics with low OSS representation include bioenergy, hydrogen, and carbon capture.** This is likely due to the nascent nature of these areas and the smaller academic communities working in them. These technologies have a higher degree of uncertainty, with intellectual property closely guarded by a few for-profit companies. The small number of open source projects makes it difficult to quantify, transparently and independently, the sustainable developments in this area.

**Lastly, topics like carbon offsets or climate neutrality disclosure could not be investigated due to a general lack of OSS projects.** Despite intensive research, no OSS project or organisation (with the exception of [CarbonPlan](https://carbonplan.org/)) could be found that provides comprehensive and scientifically sound calculations and methodologies with respect to climate neutrality and carbon offsets claims made by individual companies. All statements about the environmental impact of companies are primarily based on black box algorithms and analyses performed by companies and consultancies, making sustainability claims that include Carbon Offsets rather opaque. 

 ```{figure} ../images/oco2peak.jpeg
---
width: 70%
---
The goal [oco2peak](https://github.com/dataforgoodfr/batch7_satellite_ges) is to localize CO2 emissions on Earth based on the the carbon concentration data measured by the OCO-2 Satellite from the NASA. It is one of the few software tools that have been released in the field of In Emission Observation and Modeling and an open licence and is still being developed further. 
 ```
