# Report Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) Internship

Author: **Alessandro Salvatore Tramuto**

References: Fundamental Astronomy, Hannu Karttunen, sixth edition, 2017;

In this notebook I'm going to describe and summarize my internship's topics, all the work done with my tutor **Dr Rosaria Bonito** at  **INAF OAPA**-Osservatorio Astronomico di Palermo. 
- The first part of the report is the **state of the art**, where I list all the theoretical prerequisites needed, that I learned during the first part of the internship.

- Then follows a brief description of my experience in the **Rubin LSST collaboration** and in the **Stack Club** meetings.

- I'm going to explain the basis use of **metric analysis framework** (MAF) I learned and used in the Astro data Lab jupyter notebook portal (https://datalab.noirlab.edu/).

- Moreover I'll describe the **Rubin Science Platform** (RSP) aspects I used, the Portal and the Notebook Aspects.

- I'm also going to use this notebook to create multiple graphs, presented all together and analysed simultaneously thanks to the use of **Bokeh**, which gives the reader the ability to interact with the graphs (e.g., to select some data with a lasso button, or to recognize the same point in different graphs). 

# State of the Art
Here I report all the astronomical theoretical topics I needed to know to get started with the internship:

### 1. Stellar Spectra, Light Curves, Color-Color and Color-Magnitude Diagrams and the Hertzsprung-Russell Diagram:
The **Stellar spectra** gives us many physical properties of stars, it consists in the energy flux density over frequency (or wavelength). These spectra are mostly made by absorption lines (dark spots) and some bright emission lines, which depend on the star's chemical composition and other variables.
In order to talk about the H-R diagram We need to define these basic concepts: The **Filters** are used to block the most part of the band spectrum other than a well-defined passband, characterized by a very high signal-to-noise ratio. Filters are used in every astronomy branch but are mostly important for photometry studies. It follows of the Rubin LSST Ideal Filter Passbands (https://community.lsst.org/t/lsst-filter-profiles/1463).

![filters_for_report.png](attachment:d4cd7abe-d6e9-4d02-ace0-599485f54c54.png)

The **Magnitude** is a unitless measure of the brightness of an object in a defined passband, it's defined with a logarithmic scale. As the brightness increases, the magnitude decreases, going to negative values for the brightest objects. There are two kinds of magnitude, the **Apparent Magnitude** depends on the brightness of the object as seen on the Earth, whereas the **Absolute Magnitude** is the luminosity of an object as if it would be placed at a defined distance from the Earth. The Color Index is a numerical expression which gives an in indication of the color of an object, and in particular for the stars, it gives information about the Temperature. The lowest index tends to the blue (hot) and the highest red (cold). To obtain the color index It's necessary to evaluate the magnitude for two successive filters. The **observed color index** is the difference between the apparent magnitudes of two bands, the normal color index the same but using the absolute magnitudes.

The **Hertzsprung–Russell diagram** (HRD) is a scatter plot of stars showing the relationship between the stars' absolute magnitudes or luminosities versus their stellar classifications or effective temperatures.

![Hertzsprung-Russel_for_report.png](attachment:20cb516d-dba3-4c2e-9946-dc4d9e2f4c16.png)

stars of greater luminosity are toward the top of the diagram, and stars with higher surface temperature are toward the left side of the diagram.

The **Light Curve** is the plot of light intensity in function of time, it often examines the intensity in a specific light band. These kinds of plots can be periodic, for example if It regards the transiting of an extrasolar planet, or aperiodic, in the case of observation of eruptive variables.

The **Color-Color Diagram** is the plot of the color index referring to two bands versus the color of two other bands (usually one band is involved in both axis). It's used to compare the apparent magnitude of stars at different wavelengths.



### 2. Evolutionary time scales (nuclear, thermal and dynamical), the contraction of stars towards the Main Sequence and the Young Stellar Objects (class 0, 1, 2, 3) 

There are three basic **time scales**:
1. The Nuclear time scale: the time It takes to the stars to radiate all the energy obtained by the nuclear reactions (the longest)
2. The Thermal time scale: the time needed to radiate all the thermal energy of a star if the energy production would suddenly stop
3. The Dynamical time scale: the time a star would take to collapse if the pressure against gravity would be removed (the shortest)

The process of **contraction of a protostar**:

When a cloud contracts, the gravitational potential converts into thermal energy and radiation, most of which can pass through the cloud since its density and opacity at the start is very low, so that the temperature doesn't increase. The density and pressure increase near the centre of the cloud, so does the opacity. This leads to a larger absorption of radiation by the cloud so that temperature increases. In this way the contraction slows down in the center. There's a sudden change as $T = 1800 K$, since at this temperature the Hydrogen molecules are dissociated into atoms, consuming energy. This leads to a slowdown in the temperature growth, which re-accelerate the contraption.
This process is repeated at $T = 10^4$ and $10^5 K$, as the Hydrogen first and then helium is ionised. The formation of the protostar stops when most of the gas is ionised in the form of plasma and further evolution takes place in the Thermal time scale.

**Young Stellar Objects** (YSO):

YSO denotes a star in its early stage of evolution. This class consists of two groups of objects: protostars and pre-main-sequence stars.
As the surrounding cloud disperses and the accretion process ends, the protostar goes to the Pre-Main Sequence star phase (PMS), whose energy source still is the gravitational potential, and not the HYdrogen's fusion as in the Main Sequence. There isn't any PMS star with a mass greater than 8 $M_☉$, since this could bring the conditions to have nuclear reactions.

To obtain a specific classification of YSO It's necessary to use the **Spectral Energy Distribution** (SED), namely the radiation intensity variation as a function of the wavelength. This leads to the following of classification:

![ygo_for_report.jpg](attachment:2fd63d9f-1ef4-4ea2-804d-b2eea7af62ac.jpg)

- Class 0 (core, protostar): the protostar appears to be completely surrounded and obscured by gas and clouds; the black body radiation contribute is almost null. 
- Class 1 (developed protostar): The cloud doesn't interfere in a portion identified by a large solid angle, letting the black body radiation get away the cloud itself.
- Class 2: the disk radiation and the black body radiation are almost equal, showing that the cloud disk is warmed up by the PMS star. The disk has a temperature gradient that decreases growing the distance to the star.
- Class 3: the black body radiation prevails over the disk one, showing that the energy comes mostly from the star. At this phase, the disk is very faint, probably It has formed some planets too.

### 3. Associations, Open and Globular Star Clusters

**Associations** are groups of very young stars. Studying their internal motion, We can see that They are rapidly dispersing, because They contain so few stars that gravity can't hold them together: There are three generations of associations, the oldest ones are extended whereas the youngest ones are most dense.

A **Star Cluster** is a very dense group of stars; in general, They lead together young stars and passing time They tend to break apart. All the stars of a cluster are born in the same nebula, so that They have the same chemical compositions.
There are two main cluster types:
- Globular Clusters: Very dense star groups formed by about $10^5$ old stars each. They typically have spherical symmetry and central density 10 times larger than in open clusters.
- Open Clusters: They contain thousands of young stars. There are so called because most of the times They have no defined shape.

### 4. Eruptive variables, flare stars and nebular variables (T Tauri stars)

In the eruptive variables there are no regular pulsations, quite the opposite sudden outbursts take place during which material is ejected. Brightness changes are due to sudden eruptions in the chromosphere or in the corona. The **flare stars** and some kinds of **nebular variables** are eruptive variables.

- **Flare stars**: They are young stars found in young star clusters or associations and are characterized by sudden and irregular flare outbursts on their surface. The flares are caused by disturbances in the surface magnetic fields. They light up in a few seconds and then fades away in minutes.

- **Nebular variables**: In connection with interstellar clouds there are variable stars. The **T Tauri stars** are the most interesting of them; They are newly formed or contracting towards the main sequence, and most of them have mass and temperatures very similar to the Sun's ones, even though They are sometimes bigger and brighter than the Sun. T Tauri stars are characterized by irregular brightness variations, and their spectra shows that matter streams out from them. They're really hard to see, since they are usually inside dense gas clouds. About half of T Tauri stars have a protoplanetary disk, that might be a residue of the nebula They are from, and that may lead to the formation of planets.

Other kinds of prerequisites:
1. Basics of Python programming language,
2. Use of Github to get all the Stack Club's tutorials (https://github.com/LSSTScienceCollaborations/StackClubCourse),
3. Use of the Portal aspect (tutorials on: https://www.youtube.com/watch?v=nKAqkOyKvAE&list=PLPINAcUH0dXY-xfGQ5JZQh_kyWDj8xS0E&index=2),
4. Basic use of MatPlotLib (https://matplotlib.org/), Bokeh (https://docs.bokeh.org/en/latest/), Holoviews (http://holoviews.org/),
5. How to research scientific articles using the astrophysics data system (https://ui.adsabs.harvard.edu/),
6. Use of Slack (https://slack.com/intl/it-it/) as a communication platform,
7. Use of Gather Town (https://www.gather.town/) for online meetings.

 # LSST and Stack Club
### Legacy Survey of Space and Time (LSST)
As the title suggests, the internship is strongly related with the Rubin Science Platform, which is used for the **Rubin LSST project**.
the LSST Science Collaborations (SC) comprehend: AGNSC, DESC (Dark Energy SC), Galaxies, ISSC (Informatics and Statistics SC), SLSC (Strong Lensing SC), SMWLV (Stars Milky Way and Local Volume), SSSC (Solar System SC), TVS (Transients and Variable Stars). These SC collaborate to obtain the best use of the **Vera C. Rubin Observatory** (https://www.lsst.org/scientists/science-collaborations), whose construction will finish in the next months.

For this purpose, SC are working on the creation of a baseline and the deep-drilling fields (http://astro-lsst-01.astro.washington.edu:8080/allMetricResults?runId=1) to guide the use of the telescope for 10 years, trying to accomplish a compromise between all the SC requests regarding different astronomic topics.
In particular, I entered for a limited period of time in the LSST via TVS SC, giving peculiar attention to the non-degenerate eruptive variables, following the work path of Dr Sara Bonito.

### Stack Club
The Stack Club is formed by Rubin LSST scientists. It consists of bi-weekly sessions of zoom meetings used to work together, and to help each other to resolve any kind of issues. There's also a slack channel dedicated to It, where people ask questions also outside the window of the meetings.

At the start of every session everybody describes what is going to do and eventually asks the host for a breakout room. Once that is done, every person decides between remain in the general room and enter a breakout room depending on what He/She is willing to do, so that people can help each other depending on the topic chosen. At the end of the meeting everyone talks about what He/She has accomplished during that session.

# Metric Analysis Framework (MAF) 

MAF is developed to evaluate simulation surveys generated by Rubin LSST survey scheduler **OpSim** (i.e. operations simulator). Its main purpose is to transform the OpSim output, which consists in tables of many parameters (columns) for each visit, into plots and statistics that may help to evaluate the survey.
### What MAF does:

1. It connects the user with the OpSim survey simulation database;
2. It allows the user to query the database, in order to pick and choose only the data wanted applying filters.
3. It slices the queried data, creating groups of data on the basis of the sample's nights or the position on the sky (so that We obtain the sliced data).
4. At this point, MAF can create metrics and make statistical analysis with the sliced data.

There are three principal slicers in MAF:
- **Unislicer**: It gathers all data in one group. It counts the total number of visits;
- **OneDslicer**: It groups visits in subsets depending on the value of a single parameter from the OpSim data;
- **Healpix slicer**: It groups visits based on the fact They fall in a given healpixel (a well-defined space region).

### A typical MAF workflow
The most important features in the workflow for creating metrics are the Metric Bundles and the Metric Bundles Groups. 
The **Metric Bundle** is the crucial operative function of MAF; It comes in the form:

MetricBundle(metric1, slicer1, sqlconstraint)

where sqlconstraint is a **constraint**, used to limit the visist to be queried from the database applying filters; slicer1 is a **slicer**, as defined beforehand; finally, metric1 is the **metric**, which is the object that specify the computations to execute on each slice.
Therefore, the Metric Bundle queries the data using filters, slices them as preferred and computes them. The results can be plotted on graphs of various kinds.

Creating a Dictionary of Metric Bundles, it's then possible to make a **Metric Bundle Group**, that will work through all the metric bundles in the dictionary.