Skip to content

jwuphysics/xSAGA

Repository files navigation

xSAGA: Extending the SAGA Survey

arXiv ADS

Identifying low-redshift galaxies with CNNs

We train a convolutional neural network (CNN) to distinguish low-redshift (z < 0.03) galaxies from the more numerous high-redshift objects. Each galaxy is given a CNN prediction (pCNN) betwen 0 and 1 that encodes how likely its redshift is z < 0.03.

Examples image cutouts of SAGA and xSAGA galaxies. Figure 1 from the xSAGA I paper.

Results

More massive host galaxies have more satellites.

We see that host galaxies with higher stellar masses also tend to have more satellites (see left panel of figure below). This is in line with expectations, but it is exciting to see such a strong signal!

Satellite radial profiles (un-normalized on the left, and normalized on the right) in bins of host galaxy stellar mass. Figure 5 from the xSAGA I paper.

The radial distributions of satellites do not vary with host properties.

In the right panel above, we see that the normalized radial distributions of CNN-selected satellites are surprisingly insensitive to their host galaxies' stellar masses. The shapes of the radial profiles also do not appear to vary with the host's morphology or the magnitude gap between the brightest satellite and host galaxy.

Satellites are most abundant around massive ellipticals.

In addition to the stellar mass dependence, we also find a modest increase in satellite richness around elliptical hosts (hatched shading in figure below) compared to around disky hosts (solid shading). The impact of morphology is stronger for more massive host galaxies.

Satellite radial profiles as a function of host stellar mass, separated into disky and elliptical morphologies. Figure 7 from the xSAGA I paper.

The satellite abundance correlates strongly with the magnitude gap between a host and its brightest satellite.

In the figure below, we now compute the total number of satellites within the projected virial radius, and plot it against the magnitude gap and host stellar mass.

The satellite abudance within the virial radius as a function of magnitude gap and host stellar mass. Figure 12 from the xSAGA I paper.

xSAGA confirms and extends key SAGA Survey results.

We compare the radial profiles of satellites around isolated host galaxies in the same stellar mass range as SAGA (1010–1011 M; see figure below). Our results are in excellent agreement.

 Part of Figure 6 from the xSAGA I paper.

Data

We feed image cutouts DESI Legacy Imaging Surveys as inputs to our CNN. Using the Legacy Survey RESTful API, we download grz (three-channel), 224×224 pixel images in JPG format. Example download scripts can be found in the src/ directory.

Redshifts from the Satellites Around Galactic Analogs (SAGA) Survey comprise the training set labels. While the SAGA Stage II redshifts are publicly available online, we use a larger redshift catalog that will be released in a future SAGA paper.

After we have trained the CNN, we can make predictions for a data set of 4.4 million photometrically selected candidates in SDSS/Legacy Survey (see paper for details). The CNN identifies over 100,000 objects as probable low-redshift galaxies, which vastly exceeds the number of spectroscopically confirmed low-redshift galaxies in existing surveys (e.g., SDSS, GAMA, or SAGA).

We define CNN-selected galaxies as satellite galaxies if they are within 300 projected kpc from a spectroscopically confirmed z < 0.03 host galaxy (from NASA-Sloan Atlas version 1_0_1). We compute the number of satellites around each host galaxy as a function of projected radius, while correcting for incompleteness, false positives, and non-satellite galaxy contamination (each of which is independently determined via cross-validation).

Figures and Code

All figures presented in our paper can be reproduced by running the cells in the paper-figures.ipynb Jupyter notebook. The notebook can be accessed through the nbviewer website. Note that the requirements in requirements_at_publication.txt must be used; otherwise there is a chance that the dependencies won't match.

The code used for this analysis can be found in xsaga/. Most of the relevant methods are located in satellites.py and radial_profiles.py.

All steps for training the CNN (using Pytorch and Fastai version 2) are in xsaga/cnn_train.py. Note that this relies on the deconvolution module and the hybrid deconvolution code, which are copied into the src/model/ directory.

Additional helper scripts are located in src/.

Citation

The paper is titled: "Extending the SAGA Survey (xSAGA) I: Satellite Radial Profiles as a Function of Host Galaxy Properties."

The authors are: John F. Wu (STScI), J. E. G. Peek (STScI/JHU), Erik J. Tollerud (STScI), Yao-Yuan Mao (Rutgers), Ethan O. Nadler (Carnegie/USC), Marla Geha (Yale), Risa H. Wechsler (Stanford/KIPAC/SLAC), Nitya Kallivayalil (U. Virginia), Benjamin J. Weiner (Steward/Arizona)

@ARTICLE{2022ApJ...927..121W,
       author = {{Wu}, John F. and {Peek}, J.~E.~G. and {Tollerud}, Erik J. and {Mao}, Yao-Yuan and {Nadler}, Ethan O. and {Geha}, Marla and {Wechsler}, Risa H. and {Kallivayalil}, Nitya and {Weiner}, Benjamin J.},
        title = "{Extending the SAGA Survey (xSAGA). I. Satellite Radial Profiles as a Function of Host-galaxy Properties}",
      journal = {\apj},
     keywords = {Dwarf galaxies, Galaxy evolution, Galaxy formation, Galaxy groups, Convolutional neural networks, 416, 594, 595, 597, 1938, Astrophysics - Astrophysics of Galaxies, Astrophysics - Instrumentation and Methods for Astrophysics},
         year = 2022,
        month = mar,
       volume = {927},
       number = {1},
          eid = {121},
        pages = {121},
          doi = {10.3847/1538-4357/ac4eea},
archivePrefix = {arXiv},
       eprint = {2112.01542},
 primaryClass = {astro-ph.GA},
       adsurl = {https://ui.adsabs.harvard.edu/abs/2022ApJ...927..121W},
      adsnote = {Provided by the SAO/NASA Astrophysics Data System}
}