# Your First Binary Simulations with POSYDON 🌠

**Tutorial goal:**

In this tutorial, we will run a small population of 10 binaries locally, and explore how to manipulate the output data from your population.

**New concepts:**

- Population ini file

If you haven't done so yet, export the path POSYDON environment variables.
Set these parameters in your `.bash_profile` or `.zshrc` if you use POSYDON regularly.

In [1]:
%env PATH_TO_POSYDON=/YOUR/POSYDON/PATH/
%env PATH_TO_POSYDON_DATA=/YOUR/POSYDON_DATA/PATH/

env: PATH_TO_POSYDON=/YOUR/POSYDON/PATH/
env: PATH_TO_POSYDON_DATA=/YOUR/POSYDON_DATA/PATH/


## Creating the Initialisation File

----
To run population synthesis with POSYDON, a `population_params.ini` file is required.
This file described how the stellar population is created and what prescriptions and parameters are implemented in specific steps.

POSYDON comes with a default `population_params_default.ini` file found at `PATH_TO_POSYDON/posydon/popsyn` or have a look [here](population_params.ini).

The file is split in three main parts. You can find more details about their properties by clicking on their links.
1. **[SimulationProperties]():**
   - these describe the properties and parameters of different steps in the evolution of a binary systems.
2. **[BinaryPopulation]():** 
   - parameters of the initial sampling of the binary population, such as initial mass function, period distribution, and metallicity.
   - Also contains parameters on how the population is ran, such as how many binaries are kept in memory.
3. **[SavingOutput]()**
   - Describes the data from the binary and each individual star to the output files.

We will copy the default population run parameter file to the current folder.

In [1]:
import os
import shutil
from posydon.config import PATH_TO_POSYDON

path_to_params = os.path.join(PATH_TO_POSYDON, "posydon/popsyn/population_params_default.ini")
shutil.copyfile(path_to_params, './population_params.ini')

'./population_params.ini'

# Creating and running a binary population

The copied `population_params.ini` contains the parameters to run 10 binaries at a metallicity of $Z=10^{-4}$.

If you open the file and scroll down to the **BinaryPopulation** section, you will see how they're defined:

```
metallicity = [0.0001] # [2., 1., 0.45, 0.2, 0.1, 0.01, 0.001, 0.0001]
# In units of solar metallicity
...
number_of_binaries = 10
# int
```

If you like to run a small population in a notebook, you can use the `PopulationRunner` to do this. If you want to run a specific binary instead, have a look at the [Binary Tutorial]().

The `PopulationRunner` class takes the `.ini` file and sets-up a population run.

This will create [BinaryPopulations]() for each metallicity defined in the `.ini` file.
In this case, we can check and see that a single BinaryPopulation is created and contains 10 binaries.

In [2]:
from posydon.popsyn.synthetic_population import PopulationRunner
poprun = PopulationRunner('./population_params.ini')

In [3]:
print('Number of binary populations:',len(poprun.binary_populations))
print('Metallicity:', poprun.binary_populations[0].metallicity)
print('Number of binaries:', poprun.binary_populations[0].number_of_binaries)  

Number of binary populations: 1
Metallicity: 0.0001
Number of binaries: 10


After setting up the population run, we can evolve it. This should take about 20 seconds, but depends on your machine.

In [4]:
poprun.evolve()

# Inspecting the population: Population class

When you ran the population, you might have seen that a temporary folder with the name `1e-04_Zsun_batches` was created while the binaries were evolved.
This is a temporary folder in which populations are temporarly saved.
After the binary evolution has finished, the binaries in the folder are moved to a single file named `1e-04_Zsun_popululation.h5`. This is done automatically, when you run a population using the `PopulationRunner` class.

The created file contains 3 main components:

1. **history:** the evolution of an individual binary in a pandas DataFrame
2. **oneline:** a single line to describe the initial and final conditions and some one-of parameters, such as the metallicity.
3. **mass_per_met:** some metadata on the population, such as the total simulated mass, the actual underlying mass of the population, and the number of binaries in the file.

The `Population` provides an interface with these components in the file, such that you're able to share the populations runs and can work with large population that do not fit in memory.


<div class="alert alert-block alert-warning"><b>Older Population Files</b> 

If you're using older population files, you can make them compatible with the `Population` class by calling `Population(pop_file, metallicity,ini_file)`, where the `metallicity` is in solar units. You will only need to do this once; afterwards you can initialise the class like normal.<div>

In [5]:
from posydon.popsyn.synthetic_population import Population

In [6]:
pop = Population('1e-04_Zsun_population.h5')

The `pop.mas_per_met` shows some basic information about the population you've just created.


1. The index is the metallicity of your population in solar units.
2. **simulated mass** is the total ZAMS mass that has been evolved in the population.
3. **underlying mass** is the actual mass of the population if one integrates the IMF and period distribution fully.
4. **number_of_systems** shows the 10 systems in the file.

In [7]:
pop.mass_per_met

Unnamed: 0,simulated_mass,underlying_mass,number_of_systems
0.0001,242.972243,1366.433688,10


There are some additional metadata properties available, such as:
- `metallicities`: the metallicity in absolute metallicity
- `solar_metallicities`: the metallicities in the file in solar metallicity
- `number_of_systems`: the total number of systems in the Population file
- `indices`: the indices of the binaries in the file
- `columns`: the columns available in the `history` and `oneline` dataframes

In [8]:
# you can also access the total number of systems in the file with
print(pop.number_of_systems)

10


## Population.history

`pop.history` loads in the full history of all the binaries into memory.
You can access individual or a selections of the population using several methods:


1. pop.history[5]
2. pop.history[[0,4]]
3. pop.history['time]
4. pop.history.select()

The select function is the most powerfull way to access the binaries, because it allows you to perform selections based on the columns available in the history dataframe.

For example, below we can select on `state == 'RLO1'`, which gives us all the rows with RLO1 occuring.
At the same time, we can request only specific columns, 


In [9]:
# select only binary_index 5
pop.history[5]

Unnamed: 0_level_0,state,event,time,orbital_period,eccentricity,lg_mtransfer_rate,step_names,step_times,S1_state,S1_mass,...,S2_he_core_mass,S2_he_core_radius,S2_co_core_mass,S2_co_core_radius,S2_center_h1,S2_center_he4,S2_surface_h1,S2_surface_he4,S2_surf_avg_omega_div_omega_crit,S2_spin
binary_index,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
5,detached,ZAMS,0.0,15.529847,0.0,,initial_cond,0.0,H-rich_Core_H_burning,23.295575,...,,,,,0.750999,0.249,,,,
5,RLO1,CC1,8770880.0,18.001391,0.0,-4.2781,step_HMS_HMS,0.598279,H-rich_Central_C_depletion,11.73852,...,0.0,0.0,0.0,0.0,0.313222,0.6867762,0.462349,0.537642,0.989594,12.225433
5,detached,,8770880.0,17.686081,0.051478,,step_SN,0.000337,BH,9.83993,...,0.0,0.0,0.0,0.0,0.313222,0.6867762,0.462349,0.537642,0.989594,12.225433
5,RLO2,oRLO2,11936150.0,16.854724,0.0,,step_detached,0.146234,BH,9.83993,...,7.39892,,6.135466,0.282459,0.0,0.06130808,0.750999,0.248974,0.291823,3.687763
5,RLO2,CC2,12027510.0,20.136671,0.0,-5.077129,step_CO_HMS_RLO,0.08373,BH,9.878767,...,7.592484,0.269422,6.241159,0.095801,0.0,1.606272e-12,0.359439,0.64056,0.514568,0.226465
5,detached,,12027510.0,13.092692,0.469924,,step_SN,0.000465,BH,9.878767,...,,,,,,,,,,0.003096
5,detached,maxtime,13800000000.0,12.925701,0.466729,,step_dco,0.002545,BH,9.878767,...,,,,,,,,,,0.003096
5,detached,END,13800000000.0,12.925701,0.466729,,step_end,1.7e-05,BH,9.878767,...,,,,,,,,,,0.003096


In [10]:
# select binary 0 and 4
pop.history[[0,4]]

Unnamed: 0_level_0,state,event,time,orbital_period,eccentricity,lg_mtransfer_rate,step_names,step_times,S1_state,S1_mass,...,S2_he_core_mass,S2_he_core_radius,S2_co_core_mass,S2_co_core_radius,S2_center_h1,S2_center_he4,S2_surface_h1,S2_surface_he4,S2_surf_avg_omega_div_omega_crit,S2_spin
binary_index,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
0,detached,ZAMS,0.0,1428.226281,0.0,,initial_cond,0.0,H-rich_Core_H_burning,7.475807,...,,,,,0.750999,0.249,,,,
0,RLO1,oCE1,43492320.0,657.384166,0.0,-1.278859,step_HMS_HMS,0.091135,H-rich_Central_He_depleted,7.382683,...,0.0,0.0,0.0,0.0,0.745515,0.254449,0.72498,0.274996,0.9855,4.536438
0,merged,oMerging1,43492320.0,657.384166,0.0,-1.278859,step_CE,5.8e-05,H-rich_Central_He_depleted,7.382683,...,0.0,0.0,0.0,0.0,0.745515,0.254449,0.72498,0.274996,0.9855,4.536438
0,merged,CC1,43543690.0,,,,step_merged,1.971473,H-rich_Central_C_depletion,8.039695,...,,,,,,,,,,
0,merged,,43543690.0,,,,step_SN,0.000468,NS,1.150693,...,,,,,,,,,,
0,merged,END,43543690.0,,,,step_end,1.9e-05,NS,1.150693,...,,,,,,,,,,
4,detached,ZAMS,0.0,16.263438,0.0,,initial_cond,0.0,H-rich_Core_H_burning,11.293998,...,,,,,0.750999,0.249,,,,
4,RLO1,oCE1,22125980.0,5.063589,0.0,-0.990378,step_HMS_HMS,0.008569,H-rich_Central_He_depleted,9.391559,...,0.0,0.0,0.0,0.0,0.694209,0.30579,0.750854,0.249121,0.988594,11.733179
4,merged,oMerging1,22125980.0,5.063589,0.0,-0.990378,step_CE,3e-05,H-rich_Central_He_depleted,9.391559,...,0.0,0.0,0.0,0.0,0.694209,0.30579,0.750854,0.249121,0.988594,11.733179
4,merged,CC1,22142100.0,,,,step_merged,0.137982,H-rich_Central_C_depletion,12.082614,...,,,,,,,,,,


In [11]:
pop.history['time']

Unnamed: 0_level_0,time
binary_index,Unnamed: 1_level_1
0,0.000000e+00
0,4.349232e+07
0,4.349232e+07
0,4.354369e+07
0,4.354369e+07
...,...
9,5.235732e+07
9,5.235893e+07
9,5.235893e+07
9,5.235893e+07


You can also check what columns are available in the history file:

In [12]:
pop.history.columns

Index(['state', 'event', 'time', 'orbital_period', 'eccentricity',
       'lg_mtransfer_rate', 'step_names', 'step_times', 'S1_state', 'S1_mass',
       'S1_log_R', 'S1_log_L', 'S1_lg_mdot', 'S1_he_core_mass',
       'S1_he_core_radius', 'S1_co_core_mass', 'S1_co_core_radius',
       'S1_center_h1', 'S1_center_he4', 'S1_surface_h1', 'S1_surface_he4',
       'S1_surf_avg_omega_div_omega_crit', 'S1_spin', 'S2_state', 'S2_mass',
       'S2_log_R', 'S2_log_L', 'S2_lg_mdot', 'S2_he_core_mass',
       'S2_he_core_radius', 'S2_co_core_mass', 'S2_co_core_radius',
       'S2_center_h1', 'S2_center_he4', 'S2_surface_h1', 'S2_surface_he4',
       'S2_surf_avg_omega_div_omega_crit', 'S2_spin'],
      dtype='object')

In [13]:
# using the select function
pop.history.select(where='index == 9')
# selecting all RLO1 states and only time and state columns
pop.history.select(where='state == RLO1', columns=['time', 'state'])
# selecting rows 10 to 16
pop.history.select(start=10, stop=16)


Unnamed: 0_level_0,state,event,time,orbital_period,eccentricity,lg_mtransfer_rate,step_names,step_times,S1_state,S1_mass,...,S2_he_core_mass,S2_he_core_radius,S2_co_core_mass,S2_co_core_radius,S2_center_h1,S2_center_he4,S2_surface_h1,S2_surface_he4,S2_surf_avg_omega_div_omega_crit,S2_spin
binary_index,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1,Unnamed: 6_level_1,Unnamed: 7_level_1,Unnamed: 8_level_1,Unnamed: 9_level_1,Unnamed: 10_level_1,Unnamed: 11_level_1,Unnamed: 12_level_1,Unnamed: 13_level_1,Unnamed: 14_level_1,Unnamed: 15_level_1,Unnamed: 16_level_1,Unnamed: 17_level_1,Unnamed: 18_level_1,Unnamed: 19_level_1,Unnamed: 20_level_1,Unnamed: 21_level_1
1,disrupted,,22870350.0,,,,step_SN,0.000472,NS,1.41161,...,,,,,,,,,,0.0
1,disrupted,END,22870350.0,,,,step_end,2.1e-05,NS,1.41161,...,,,,,,,,,,0.0
2,detached,ZAMS,0.0,1.488813,0.0,,initial_cond,0.0,H-rich_Core_H_burning,15.738981,...,,,,,0.750999,0.249,,,,
2,RLO1,CC1,14192290.0,1.625174,0.0,-4.148457,step_HMS_HMS,0.0087,H-rich_Central_C_depletion,6.560375,...,0.0,0.0,0.0,0.0,0.248156,0.751843,0.302142,0.697856,0.527531,8.071886
2,detached,,14192290.0,1.003052,0.555484,,step_SN,0.000314,NS,1.306695,...,0.0,0.0,0.0,0.0,0.248156,0.751843,0.302142,0.697856,0.527531,8.071886
2,initial_RLOF,,14192290.0,1.003052,0.555484,,step_detached,1.918676,NS,1.306695,...,0.0,0.0,0.0,0.0,0.248156,0.751843,0.302142,0.697856,0.527531,8.071886


You might have notices while using the above functions that not all the binaries will have the same length in the history.
You can access these with `pop.history_lengths`. This information is also stored in the population file.

In [14]:
pop.history_lengths

Unnamed: 0_level_0,index
index,Unnamed: 1_level_1
0,6
1,6
2,5
3,6
4,6
5,8
6,6
7,6
8,7
9,8


## Population.oneline

`Population.oneline` provides a similar interface to accessing the DataFrame in the population file as `Population.history`, with similar functionality being available.

In [15]:
pop.oneline[5]
pop.oneline[[0,4]]
pop.oneline.select(where='index == 9')
pop.oneline.select(where='S1_mass_i > 20', columns=['state_i','S1_state_i', 'S1_mass_i', 'S2_state_i','S2_mass_i'])

Unnamed: 0_level_0,state_i,S1_state_i,S1_mass_i,S2_state_i,S2_mass_i
binary_index,Unnamed: 1_level_1,Unnamed: 2_level_1,Unnamed: 3_level_1,Unnamed: 4_level_1,Unnamed: 5_level_1
5,detached,H-rich_Core_H_burning,23.295575,H-rich_Core_H_burning,17.488602
7,detached,H-rich_Core_H_burning,21.32455,H-rich_Core_H_burning,12.479684
8,detached,H-rich_Core_H_burning,33.782806,H-rich_Core_H_burning,18.035359


In [16]:
pop.oneline.columns

Index(['state_i', 'event_i', 'time_i', 'orbital_period_i', 'eccentricity_i',
       'lg_mtransfer_rate_i', 'step_names_i', 'step_times_i', 'state_f',
       'event_f',
       ...
       'interp_class_CO_HMS_RLO', 'interp_class_CO_HeMS',
       'interp_class_CO_HeMS_RLO', 'mt_history_HMS_HMS',
       'mt_history_CO_HMS_RLO', 'mt_history_CO_HeMS', 'mt_history_CO_HeMS_RLO',
      dtype='object', length=101)

## Population.formation_channels

You might be interested in figuring out what sort of formation pathways/channels a binary has followed through its evolution.

This is not a standard output of the population synthesis, but you can include it into the population file by calculating it. 
If you would like more detail on the initial mass transfer, you can set `mt_history=True`.

This will write the formation channels to the Population file, which can be accessed by `Population.formation_channels`.


In [17]:
pop.calculate_formation_channels(mt_history=True)

In [18]:
# the formation channels are loaded in with pop.formation_channels
pop.formation_channels

Unnamed: 0,channel_debug,channel
0,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
1,ZAMS_oRLO1_CC1_CC2_END,ZAMS_oRLO1_CC1_CC2_END
2,ZAMS_oRLO1_CC1_END,ZAMS_oRLO1_CC1_END
3,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
4,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
5,ZAMS_oRLO1_CC1_oRLO2_CC2_maxtime_END,ZAMS_oRLO1_CC1_oRLO2_CC2_maxtime_END
6,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
7,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
8,ZAMS_oRLO1_CC1_oRLO2_CC2_END,ZAMS_oRLO1_CC1_oRLO2_CC2_END
9,ZAMS_oRLO1_CC1_oRLO2_oCE2_oMerging2_END,ZAMS_oRLO1_CC1_oRLO2_oCE2_oMerging2_END


Next time you open this population file, the `formation_channels` will be available without having to be recalculated.

In [19]:
pop = Population('1e-04_Zsun_population.h5')
pop.formation_channels

Unnamed: 0,channel_debug,channel
0,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
1,ZAMS_oRLO1_CC1_CC2_END,ZAMS_oRLO1_CC1_CC2_END
2,ZAMS_oRLO1_CC1_END,ZAMS_oRLO1_CC1_END
3,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
4,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
5,ZAMS_oRLO1_CC1_oRLO2_CC2_maxtime_END,ZAMS_oRLO1_CC1_oRLO2_CC2_maxtime_END
6,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
7,ZAMS_oCE1_oMerging1_CC1_END,ZAMS_oCE1_oMerging1_CC1_END
8,ZAMS_oRLO1_CC1_oRLO2_CC2_END,ZAMS_oRLO1_CC1_oRLO2_CC2_END
9,ZAMS_oRLO1_CC1_oRLO2_oCE2_oMerging2_END,ZAMS_oRLO1_CC1_oRLO2_oCE2_oMerging2_END


## Selecting a sub-population

You might just want a small sub-selection of the full population, especially if you're working with large population and multi-metallicity runs.

The `Population.export_selection()` function will export just the indices of the binaries you're interested in into a new file.
The simulated and underlying mass will remain the same, since they are dependent on the population run.

If we select just 2 binaries and export them, we create a new population of just the binaries you're interested in.
In the [BBH analysis]() and [GRB analysis]() tutorials, we show how to perform a selection with multiple criteria and metallicities.



In [20]:
indices = [0,9]
pop.export_selection(indices, 'selected.h5')

In [21]:
selected = Population('selected.h5')
selected.mass_per_met

Unnamed: 0,simulated_mass,underlying_mass,number_of_systems
0.0001,493.801856,2777.055856,4


<br/><br/>

Feel free to explore the small binary population you've just created!

If you want to learn more about population synthesis and how to build more complex models it is advised to continue with the remaining tutorials and consult the POSYDON documentation.

### Local MPI runs

To speed up population synthesis runs, you can run on a computing cluster, as described in [HPC Facilities](pop_syn), or you can distribute the population synthesis across multiple cores on your local machine using MPI.

To enable local MPI runs, go into the `population_params.ini`  and change `use_MPI` to `True`.

It's important to note that you cannot run have this option enabled for cluster runs!

We create a binary population simulation script to run the population:

In [None]:
%%writefile script.py
from posydon.popsyn.synthetic_population import PopulationRunner

if __name__ == "__main__":
    synth_pop = PopulationRunner("./population_params.ini")
    synth_pop.evolve()

This script can be initiated using a local where `NR_processors` is the number of processors you would like to us.

In [None]:
mpiexec -n ${NR_processors} python script.py

This will create a folder for each metallicity in the population and store output of the parallel runs in it.

You will have to concatenate these runs manually into a single population file per metallicity, which can be achieved using the following code:

In [None]:
from posydon.popsyn.synthetic_population import PopulationRunner

synth_pop = PopulationRunner("./population_params.ini")
for pop in synth_pop.binary_populations:
    synth_pop.merge_parallel_runs(pop)