# Darwin's naturalization Hypothesis 

The invasion success of non-native species is higher in areas that are poor in closely related species than in areas that are rich in closely related species.

This example notebook uses the [ORKG comparison data](https://orkg.org/orkg/comparison/R53407) for this hypothesis, processes and visualizes the data to show the support the hypothesis has in the literature.

In [None]:
from orkg import ORKG
import numpy as np
import matplotlib.pyplot as plt

In [None]:
orkg = ORKG(host='https://orkg.org/orkg', simcomp_host='https://orkg.org/orkg/simcomp')

df = orkg.contributions.compare_dataframe(comparison_id='R53407')
df = df.T

In [None]:
df.head(5)

**Q1: How much support does the hypothesis have?**

In [None]:
df1 = df.value_counts(['stand of hypothesis']).reset_index(name='count')
df1

In [None]:
labels = np.array(df1['stand of hypothesis']).astype(str)
sizes = np.array(df1['count']).astype(int) 

fig1, ax1 = plt.subplots()
ax1.pie(sizes, labels=labels, autopct='%1.1f%%', startangle=90)
plt.show()

**Q2: What kind of species have been investigated?**

In [None]:
df['Investigated species'].value_counts()

**Q3: How much support does the hypothesis have for plants (as investigated species)?**

In [None]:
df1 = df[df['Investigated species']=='Plants']
df1 = df1.value_counts(['stand of hypothesis']).reset_index(name='count')

labels = np.array(df1['stand of hypothesis']).astype(str)
sizes = np.array(df1['count']).astype(int) 

fig1, ax1 = plt.subplots()
ax1.pie(sizes, labels=labels, autopct='%1.1f%%', startangle=90)
plt.show()

**Q4: During which years were articles related to this hypothesis published most frequently?**

In [None]:
df.value_counts(['Study date']).reset_index(name='Count')

**Q5: Which regions have been studied most frequently?**

In [None]:
df['Continent'].value_counts()