# Licences

**The use of various licences reveals potential intellectual property issues related to the use of software packages as well as the general openness of the projects.** Licences ensure that communities behind many of today's most innovative technologies can protect their creations in whatever way they see fit.

**Permissive licences like BSD, Apache and MIT are the most popular within the investigation. The MIT licence was the top choice, used in 26.1% of the projects.** MIT is a short and simple permissive licence. Permissive licences are easier to use in commercial products and unlike the copyleft licence, they do not enforce the release of all code under a similar licence associated with the open source project. Permissive licences like MIT create a lot of opportunity for reuse and commercialisation of OSS projects within proprietary software. On the other hand, projects under this licence may be jeopardised by the use of proprietary products. Companies are not forced to release bug fixes or enhancements, so improvements can not be contributed back into the open source project. This can lead to developers losing motivation to invest further time as their work is now being taken without supporting the development.

**The second most used licence was a Custom licence with 23.8%.** All projects that could not be mapped to standard open source licences are considered as custom licences. Licences with modifications from the original open source licences also fall under this category.

**GPL 3.0 is the third most popular licence used by 17.5% of all projects.** Permissions under this strong copyleft licence require licensing the entire source code of the licenced works and modifications. GPL was created to protect software from becoming proprietary, or private. Copyright and licence notices must be preserved and contributors provide an express grant of patent rights. Copyleft licences are more prevalent in business models that rely on direct monetization of the core project. 

Within the different categories, however, no small preference for one or the other licence can be identified. Compared with insights published by Mend the distribution of licences differs in multiple ways. Instead of Apache, the GPL 3.0 is the most commonly used licence after MIT.

In [1]:
import numpy as np
import pandas as pd
import plotly.io as pio
import plotly.graph_objects as go
import plotly.express as px
from opensustain_template import *

In [2]:
df_active = pd.read_csv("../csv/project_analysis.csv")

In [3]:
license_his = (
    df_active["license"]
    .value_counts()
    .to_frame()
    .rename_axis("license_names")
    .reset_index()
)
fig = px.pie(license_his, values="license", names="license_names", color_discrete_sequence=color_discrete_sequence, hole=0.2)

fig.update_layout(title="Distribution of Licences", showlegend=False, font_size=16)
fig.update_traces(textposition='inside', textinfo='percent+label', marker=dict(line=dict(color='#000000', width=1)))
fig.add_layout_image(
    dict(
        source="logo.png",
        xref="paper", yref="paper",
        x=1, y=1,
        sizex=0.15, sizey=0.15,
        xanchor="right", yanchor="top"
    )
)
fig.show()