Skip to content

Commit

Permalink
add tutorial
Browse files Browse the repository at this point in the history
  • Loading branch information
LauraOlivera committed Apr 13, 2023
1 parent 53fa844 commit e1b1208
Showing 1 changed file with 257 additions and 0 deletions.
257 changes: 257 additions & 0 deletions examples/tutorials/data/hawc.py
Original file line number Diff line number Diff line change
@@ -0,0 +1,257 @@
"""
HAWC with Gammapy
=====================
Explore HAWC event lists and IRFs and perform the data reduction steps.
`HAWC <https://www.hawc-observatory.org/>`__ is an array of
water Cherenkov detectors located in Mexico that detects gamma-rays
in the range between hundreds of GeV and hundreds of TeV.
Gammapy recently added support of HAWC high level data analysis,
after export to the current `open data level 3
format <https://gamma-astro-data-formats.readthedocs.io/>`__.
The HAWC data is largely private. However, in 2022, a small
sub-set of HAWC Pass4 event lists from the Crab nebula region
was publicly `released <https://data.hawc-observatory.org/datasets/crab_events_pass4/index.php>`__.
This dataset is 1 GB in size, so only a subset will be used here as an example.
This notebook is a quick introduction to HAWC data analysis with Gammapy.
It briefly describes the HAWC data and how to access it, and then uses a
subset of it to produce a MapDataset, to show how the data reduction is performed.
The HAWC data release contains events where energy is estimated using
two different algorithms, referred to as "NN" and "GP" (see this `paper <https://iopscience.iop.org/article/10.3847/1538-4357/ab2f7d>`__
for a detailed description). These two event classes are not independent, meaning that
events are repeated between the NN and GP datasets. So these data should never
be analyzed jointly, and one of the two estimators needs to be chosen before
proceeding.
Once the data has been reduced to a MapDataset, there are no differences
in the way that HAWC data is handled with respect to data from any other
observatory, such as H.E.S.S. or CTA.
HAWC data access and reduction
------------------------------
This is how to access data and IRFs from the HAWC Crab event data release.
"""

import astropy.units as u
from astropy.coordinates import SkyCoord
import matplotlib.pyplot as plt
import numpy as np
from IPython.display import display
from gammapy.datasets import MapDataset
from gammapy.makers import MapDatasetMaker, SafeMaskMaker
from gammapy.data import DataStore, HDUIndexTable, ObservationTable
from gammapy.maps import Map, MapAxis, WcsGeom
from gammapy.estimators import ExcessMapEstimator

######################################################################
# Check setup
# -----------
from gammapy.utils.check import check_tutorials_setup
from gammapy.visualization import plot_theta_squared_table

check_tutorials_setup()

######################################################################
# Chose which estimator we will use

which = 'NN'


######################################################################
# A useful way to organize the relevant files are the index tables. The
# observation index table contains information on each particular observation,
# such as the run ID. The HDU index table has a row per
# relevant file (i.e., events, effective area, psf…) and contains the path
# to said file.
# The HAWC data is divided into different event types, classified using
# the fraction of the array that was triggered by an event, a quantity
# usually referred to as "fHit". These event types are fully independent,
# meaning that an event will have a unique event type identifier, which
# is usually a number indicating which fHit bin the event corresponds to.
# For this reason, a single HAWC observation has several HDU index tables
# associated to it, one per event type. In each table, there will be
# paths to a distinct event list file and IRFs.
# In the HAWC event data release, all the HDU tables are saved into
# the same FITS file, and can be accesses through the choice of the hdu index.


######################################################################
# Load the tables

data_path = "$GAMMAPY_DATA/hawc/crab_events_pass4/"
hdu_filename = 'hdu-index-table-' + which + "-Crab.fits.gz"
obs_filename = 'obs-index-table-' + which + "-Crab.fits.gz"

# there is only one observation table
obs_table = ObservationTable.read(data_path+obs_filename)

# we read the hdu index table of fHit bin number 6
fHit = 6
hdu_table = HDUIndexTable.read(data_path+hdu_filename, hdu=fHit)


######################################################################
# From the tables, we can create a Datastore

data_store = DataStore( hdu_table=hdu_table, obs_table=obs_table)

data_store.info()


######################################################################
# There is only one observation

obs = data_store.get_observations()[0]

######################################################################
# Select and peek events

obs.events.peek()


######################################################################
# Peek the energy dispersion

obs.edisp.peek()

######################################################################
# Peek the psf
obs.psf.peek()

######################################################################
# Peek the background for one transit
plt.figure()
obs.bkg.reduce_over_axes().plot(add_cbar=True)

######################################################################
# Peek the effective exposure for one transit
plt.figure()
obs.aeff.reduce_over_axes().plot(add_cbar=True)


######################################################################
# Data reduction into a MapDataset
# --------------------------------
#
# We will now produce a MapDataset using the data from one of the
# fHit bins. In order to use all bins, one just needs to repeat this
# process inside of a for loop that modifies the variable fHit.


######################################################################
# First we define the geometry and axes

# create the energy reco axis
energy_axis = MapAxis.from_edges([1.00,1.78,3.16,5.62,10.0,17.8,31.6,56.2,100,177,316] * u.TeV,
name="energy",
interp="log")


# and energy true axis
energy_axis_true = MapAxis.from_energy_bounds(1e-3, 1e4, nbin=140, unit="TeV", name="energy_true")


# create a geometry around the Crab location
geom = WcsGeom.create(skydir=SkyCoord(ra=83.63,dec=22.01, unit='deg', frame="icrs"),
width=6*u.deg,
axes=[energy_axis],
binsz=0.05)


######################################################################
# Define the makers we will use

maker = MapDatasetMaker(selection = ["counts", "background", "exposure", "edisp", "psf"])
safemask_maker = SafeMaskMaker(methods=['aeff-max'], aeff_percent=10)


######################################################################
# Create empty Mapdataset
dataset_empty = MapDataset.create(geom,
energy_axis_true= energy_axis_true,
name ='fHit ' + str(fHit),
reco_psf=True )
# run the map dataset maker
dataset = maker.run(dataset_empty, obs)

# The livetime info is used by the SafeMask maker to retrieve the
# effective area from the exposure. The HAWC effective area is computed
# for one source transit above 45º zenith, which is around 6h
# Note that since the effective area condition used here is relative to
# the maximum, this value does not actually impact the result
dataset.exposure.meta["livetime"] = "6 h"
dataset = safemask_maker.run(dataset)


######################################################################
# Now we have a dataset that has background and exposure quantities for
# one single transit, but our dataset might comprise more. The number
# of transits can be derived using the good time intervals (GTI) stored
# with the event list. For convenience, the HAWC data release already
# included this quantity as a map

transit_map = Map.read(data_path + 'irfs/TransitsMap_Crab.fits.gz')
transit_number = transit_map.get_by_coord(geom.center_skydir)

# Correct the quantities
dataset.background.data*=transit_number
dataset.exposure.data*=transit_number




######################################################################
# Check the dataset we produced
# -----------------------------
#
# We will now check the contents of the dataset


######################################################################
# We can use the .peek() method to quickly get a glimpse of the contents
dataset.peek()



######################################################################
# And make significance maps to check that the Crab is visible

excess_estimator = ExcessMapEstimator(correlation_radius='0.2 deg', selection_optional = [], energy_edges=energy_axis.edges)
excess = excess_estimator.run(dataset)

(dataset.mask*excess['sqrt_ts']).plot_grid(add_cbar=True, cmap='coolwarm', vmin=-5, vmax=5)
plt.show()

######################################################################
# Combining all energies
excess_estimator_integrated = ExcessMapEstimator(correlation_radius='0.2 deg', selection_optional = [])
excess_integrated = excess_estimator_integrated.run(dataset)

excess_integrated['sqrt_ts'].plot(add_cbar=True)
plt.show()


######################################################################
# Exercises
# ---------
#
# - Repeat the process for a different fHit bin
# - Repeat the process for all the fHit bins provided in the data
# release and fit a model to the result.
#


######################################################################
# Next steps
# ----------
#
# Now you know how to access and work with HAWC data. All other
# tutorials and documentation concerning 3D analysis and MapDatasets
# can be used from this step.
#

0 comments on commit e1b1208

Please sign in to comment.