# A Short Tour of an Astronomical Inference

Goals:

* Understand what is meant by "data", "noise", and "models"
* Gain some appreciation for what astronomical data is like, and what astronomers are typically trying to do

### References

* Ivezic Chapter 1, sections 1.1, 1.2, 1.4, 1.5, and Chapter 3, section 3.2.

## Data analysis

* What is data?

* How should we think about data in science?

You probably already have an unarticulated but likely strong sense of what data is: like many other obvious things, _you know it when you see it._

One of our tasks will be to define what we mean by data mathematically. We'll start by taking a short tour through a simple example astronomical data analysis, that will briefly introduce many key concepts.

## Data analysis in astronomy

* Propose observations
* Observe sky, collect and "reduce" data
* **Explore and summarize the data**
* **Hypothesize**, and **test**
* **Interpret,** conclude, speculate
* Report

> This course primarily concerns the parts of the investigation listed in **bold**.

## Learning from data

* Data analysis is _central_ to the scientific process: statistical inference is the mathematical formalization of _learning_.

* The formalism is important: hypothesizing, testing, and intepreting are all potentially _very messy._

## An example image dataset

* In optical, X-ray and gamma-ray astronomy, the most basic datasets are *images*

* Images can be 2D, from cameras, or 1D, from spectrographs, or 3D, from IFUs (integral field units). 

* Image data come packaged as an *array* of numbers, which we can visualize, and do calculations with.

Let's look at some X-ray image data from the XMM satellite, for the galaxy cluster A1835.

<img src="../examples/XrayImage/figures/cluster_image.png" width=70%>

## What is "data"?

* Data are *constants* (usually numbers) 

* That we are *handed* (typically in a data file) 
       
* That *we hope to learn something from.*

## Uncertainty 

Here's a zoomed in view of the central part of the A1835 XMM image:

<img src="../examples/XrayImage/figures/cluster_image_zoom.png" width=50%>

#### 5 minutes, in Pairs:
Why does this image look the way it does? Identify some features of it, and be prepared to explain what you don't know about it.

## Sources of uncertainty

* Noise: "statistical" uncertainty, random error

* Astrophysical sources: "signal", from the target and otherwise

* Instrumental effects: variable sensitivity, point spread function blurring, vignetting, artifacts etc

* Calibration: units of pixel values

> See [examples/XrayImage/FirstLook.ipynb](../examples/XrayImage/FirstLook.ipynb) for more details

## Coping with uncertainty

* Fundamentally, learning from data ("statistical inference") is about being able to make a mock dataset that matches, or "fits", the observed one, _to within the statistical uncertainties_

* Coping with statistical uncertainty means acknowledging that things could have been different: if we took the observation again we would get different pixel values.

* This thought leads us to the notion of a _probability distribution for the data_

## Noise

* The photons arriving in our pixels seem to have been emitted, and arrive, "at random", giving rise to a "noisy" image. (You may have seen such images on TV.)

* Ultimately, the source of this randomness is quantum mechanics: atoms do not emit photons at regular intervals

* We expect the total number of photons arriving in each pixel during the exposure time to be well described as being drawn from a probability distribution, whose functional form we can hope to first guess and then refine.

## Systematics

* Through the noise, we can see a variety of features in the image

* Understanding these features means being able to "predict" them: that is, to _generate_ mock images that have the same types features.

* The feature we care about most is the cluster of galaxies in the center of the field

* Failure to account for the other features will introduce _systematic errors_ in our inferences

## Modeling data

* In order to generate mock data for comparison with our observations we need a _mathematical model_ 

* In practice, this model needs to be implemented in computer code.

* Writing this model involves making assumptions - about both the noise and the various signals in the data

* These assumptions are unavoidable

## You cannot do inference without making assumptions

## An example analysis

Let's take a quick tour through the [following paper](https://arxiv.org/abs/1509.01322), from 2016:

<img src="../examples/MantzEtal2016/title.png" width=80%>


<img src="../examples/MantzEtal2016/abstract.png" width=80%>

## Program

* **Observe** 40 clusters, producing X-ray images and spectra

* **Model** the variation in gas density and temperature with radius in each cluster, assuming spherical symmetry

* **Check** how well these simple models fit the image data

* **Summarize** each cluster with "measurements" of gas mass, overall temperature, total luminosity, total mass (assuming hydrostatic equilibrium)

* **Model** the population of clusters, using simple "scaling relations" between their total masses and gas masses, temperatures and luminosities, using these measurements

* **Check** how well this simple model fits the measurements

## Modeling the cluster gas

## Checking the cluster models

## Summarizing/measuring each cluster

## Modeling the population

## Checking the population model

## Take-home messages

* Learning from data means making generative models 

* Matching models to observations allows us to cope with uncertainty 

* You cannot do inference without making assumptions

* Assumptions can, and should, be tested, with the data

* The result of one inference can be (summarized and) used as the data for a subsequent one