-
Notifications
You must be signed in to change notification settings - Fork 0
/
chapter3.tex
1540 lines (1373 loc) · 97.6 KB
/
chapter3.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
\cleartoevenpage
\myepigraph
{Nature and Nature's laws lay hid in night:\\God said, Let Newton be! and all was light.}
{Alexander Pope}
\chapter{The Radio Sky at Meter Wavelengths: $m$-mode Analysis Imaging with the OVRO-LWA}
\label{chapter3}
\mypublishedas{thesis}{1538-3881-156-1-32}
\begin{bibunit}
\section*{Abstract}
A host of new low-frequency radio telescopes seek to measure the 21~cm transition of neutral
hydrogen from the early universe. These telescopes have the potential to directly probe star and
galaxy formation at redshifts $20 \gtrsim z \gtrsim 7$, but are limited by the dynamic range they
can achieve against foreground sources of low-frequency radio emission. Consequently, there is a
growing demand for modern, high-fidelity maps of the sky at frequencies below 200 MHz for use in
foreground modeling and removal. We describe a new wide-field imaging technique for drift-scanning
interferometers: Tikhonov-regularized $m$-mode analysis imaging. This technique constructs images
of the entire sky in a single synthesis imaging step with exact treatment of wide-field effects. We
describe how the CLEAN algorithm can be adapted to deconvolve maps generated by $m$-mode analysis
imaging. We demonstrate Tikhonov-regularized $m$-mode analysis imaging using the Owens Valley Radio
Observatory Long Wavelength Array (OVRO-LWA) by generating eight new maps of the sky north of
$\delta=-30^\circ$ with 15\arcmin angular resolution at frequencies evenly spaced between 36.528 and
73.152~MHz, and $\sim$800 mJy/beam thermal noise. These maps are a 10-fold improvement in angular
resolution over existing full-sky maps at comparable frequencies, which have angular resolutions
$\ge 2^\circ$. Each map is constructed exclusively from interferometric observations and does not
represent the globally averaged sky brightness. Future improvements will incorporate total power
radiometry, improved thermal noise, and improved angular resolution due to the planned expansion of
the OVRO-LWA to 2.6~km baselines. These maps serve as a first step on the path to the use of more
sophisticated foreground filters in 21~cm cosmology incorporating the measured angular and frequency
structure of all foreground contaminants.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{Introduction}
At redshifts $20 \gtrsim z \gtrsim 7$, the 21~cm hyperfine structure line of neutral hydrogen is
expected to produce a 10 to 100 mK perturbation in the cosmic microwave background (CMB) spectrum
\citep{2006PhR...433..181F, 2012RPPh...75h6901P}. The amplitude of this perturbation on a given line
of sight is a function of the neutral fraction of hydrogen, the baryon overdensity, the spin
temperature relative to the CMB temperature at the given redshift, and the line-of-sight peculiar
velocity of the gas. The spatial power spectrum of this perturbation is thought to be dominated by
inhomogeneous heating of the intergalactic medium (IGM) at $z\sim 20$ \citep{2014MNRAS.437L..36F},
and by growing ionized bubbles during the epoch of reionization (EoR) at $z\sim 7$, where a
detection can constrain the ionizing efficiency of early galaxies, the UV photon mean-free path, and
the minimum halo mass that can support star formation \citep{2015MNRAS.449.4246G}.
Current 21~cm cosmology experiments can be broadly separated into two classes: global signal
experiments that aim to detect the spectral signature of the cosmologically redshifted 21~cm
transition after averaging over the entire sky (otherwise known as the monopole), and power spectrum
experiments that incorporate angular information to attempt to measure the 3D spatial power spectrum
of cosmological 21~cm perturbations. Ongoing global signal experiments include EDGES
\citep{2017ApJ...847...64M}, LEDA \citep{2018MNRAS.478.4193P}, BIGHORNS \citep{2015PASA...32....4S},
SCI-HI \citep{2014ApJ...782L...9V}, and SARAS 2 \citep{2017ApJ...845L..12S}. Ongoing power spectrum
experiments include PAPER/HERA \citep{2015ApJ...809...61A, 2017PASP..129d5001D}, LOFAR
\citep{2017ApJ...838...65P}, and the MWA \citep{2016ApJ...833..102B, 2016MNRAS.460.4320E}. Recently,
EDGES reported the first detection of 21~cm absorption in the globally averaged sky signal
\citep{2018Natur.555...67B}.
Just as for CMB experiments, foreground removal or suppression is an essential component of both
classes of 21~cm cosmology experiments. The brightness temperature of the galactic synchrotron
emission at high galactic latitudes is measured by \citet{2017MNRAS.464.4995M} as
\begin{equation}
T \sim 300\,{\rm K} \times \left(\frac{\nu}{150\,{\rm MHz}}\right)^{-2.6}\,.
\end{equation}
Therefore, experiments conservatively need to achieve five orders of dynamic range against this
foreground emission before the cosmological signal can be measured. Current foreground removal
methods (for example, \citealt{2012ApJ...756..165P} and \citealt{2013MNRAS.429..165C}) rely on the
assumption that the foreground emission is spectrally smooth. The low-frequency radio sky is
composed of several components: galactic synchrotron emission, supernova remnants, radio galaxies,
free-free emission and absorption from \ion{H}{2} regions, and a confusing background of radio
sources. Ideally, a foreground removal strategy should be informed by the measured spatial
structure and frequency spectrum of all foreground components. For instance, CMB experiments
typically construct several maps at several frequencies to enable component separation. At low
frequencies, this possibility is limited by the availability of suitable high-fidelity sky maps on
angular scales ranging from tens of degrees to arcminutes.
Recently, a host of new low-frequency sky surveys have been conducted, including MSSS
\citep{2015A&A...582A.123H}, GLEAM \citep{2015PASA...32...25W}, and TGSS
\citep{2017A&A...598A..78I}. However, the primary data product generated by these surveys is a
catalog of radio point sources. At 45 MHz, \citet{2011A&A...525A.138G} created a map of the sky that
captures the diffuse emission with 5$^\circ$ resolution. The LWA1 Low Frequency Sky Survey
\citep[LLFSS;][]{2017MNRAS.469.4537D} similarly maps the sky at a range of frequencies between
35 and 80~MHz with resolution between 4.5$^\circ$ and 2$^\circ$.
The Global Sky Model \citep[GSM;][]{2008MNRAS.388..247D} is currently the most commonly used
foreground model. The GSM is a nonparametric interpolation of various maps between 10 MHz and 100
GHz. However, the majority of information contained in the GSM is derived at frequencies $>1.4$~GHz,
where the majority of the modern, high-fidelity input maps are located. At 408~MHz, the venerable
Haslam map \citep{1981A&A...100..209H, 1982A&AS...47....1H} covers the entire sky at $1^\circ$
resolution. Below 408~MHz, the GSM uses three input sky maps. \citet{2017MNRAS.464.3486Z}
constructed an improved GSM with five maps below 408~MHz, and \citet{2017MNRAS.469.4537D} used the
LWA1 to improve the GSM with their own sky maps. However, the GSM generally suffers from low
angular resolution ($\sim 5^\circ$) and systematic errors associated with instrumental artifacts in
the input maps. For instance, \citet{2017MNRAS.469.4537D} reported errors of $\pm 50\%$ between the
GSM and their own maps at 74 MHz, which they attribute to the increasing contribution of free-free
absorption and modifications to the synchrotron spectral index at low frequencies.
Wide-field interferometric synthesis imaging is a challenging computational problem, and it has been
particularly difficult to capture large angular scales $\gg 10^\circ$ and small angular scales $\ll
1^\circ$ in a single synthesis image. We will derive a new imaging technique---Tikhonov-regularized
$m$-mode analysis imaging---that allows a drift-scanning interferometer to image the entire visible
sky in a single coherent synthesis imaging step with no gridding and no mosaicking.
As a demonstration of this technique, we apply Tikhonov-regularized $m$-mode analysis imaging to the
Owens Valley Radio Observatory Long Wavelength Array (OVRO-LWA) and generate a series of new
low-frequency maps of the sky between 36.528 and 73.152~MHz. These maps capture the full sky
visible from OVRO with an angular resolution of $\sim 15$~arcmin. These new maps complement the
existing full-sky maps at these frequencies with greatly improved angular resolution.
We aim for these maps to inform foreground removal strategies in 21~cm cosmology, and we anticipate
additional ancillary science taking advantage of the combination of high fidelity and high
resolution of these maps, including but not limited to studies of the cosmic-ray emissivity at low
frequencies, searches for giant radio galaxies, and constraining the galactic synchrotron spectrum.
The maps will be made freely available online at the Legacy Archive for Microwave Background Data
Analysis (LAMBDA)\footnote{
\url{https://lambda.gsfc.nasa.gov/product/foreground/fg_ovrolwa_radio_maps_info.cfm}
}.
The structure of this paper is as follows. In \S\ref{sec:imaging}, we present Tikhonov-regularized
$m$-mode analysis imaging, a new imaging technique that allows us to image the entire visible sky in
one coherent synthesis imaging step with exact wide-field corrections. In \S\ref{sec:observations1}
we describe our observations with the OVRO-LWA. In \S\ref{sec:results1} we present the sky maps and
compare these maps against other low-frequency sky maps. In \S\ref{sec:error1}, we discuss some of
the sources of error present in the maps, and finally, in \S\ref{sec:conclusion1} we present our
conclusions.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{All-sky Imaging}\label{sec:imaging}
The goal of all imaging algorithms is to estimate the brightness of the sky $I_\nu(\hat r)$ in the
direction $\hat r$ and frequency $\nu$. A radio interferometer measures the visibilities
$V^{ij,pq}_{\nu}$ between pairs of antennas numbered $i$ and $j$ respectively, and between
polarizations labeled $p$ and $q$ respectively. We will neglect subtleties associated with polarized
imaging, so the Stokes~$I$ visibilities are constructed from the sum of the $pp$ and $qq$
correlations such that $V^{ij}_{\nu} = (V^{ij,pp}_{\nu}+V^{ij,qq}_{\nu})/2$. If the antennas are
separated by the baseline $\vec b_{ij}$, and $A_\nu(\hat r)$ describes an antenna's response to the
incident Stokes~$I$ radiation (here assumed to be the same for each antenna), then
\begin{equation}\label{eq:basic-imaging}
V^{ij}_\nu = \int_\text{sky}
A_\nu(\hat r) I_\nu(\hat r)
\exp\bigg(2\pi i \hat r\cdot\vec b_{ij}/\lambda\bigg) \,\d\Omega \, ,
\end{equation}
where the integral runs over the solid angle $\Omega$. Constructing an image from the output of a
radio interferometer consists of estimating $I_\nu(\hat r)$ given the available measurements
$V^{ij}_\nu$.
For later convenience, we will define the baseline transfer function $B^{ij}_\nu(\hat r)$ such that
\begin{equation}\label{eq:baseline-transfer-function}
V^{ij}_\nu = \int_\text{sky} B^{ij}_\nu(\hat r) I_\nu(\hat r) \,\d\Omega \, .
\end{equation}
The baseline transfer function defines the response of a single baseline to the sky and is a
function of the antenna primary beam, and baseline length and orientation.
Naively, one might attempt to solve Equation~\ref{eq:basic-imaging} by discretizing and
subsequently solving the resulting matrix equation. If the interferometer is composed of
$N_\text{base}$ baselines and measures $N_\text{freq}$ frequency channels over $N_\text{time}$
integrations, then the entire data set consists of $N_\text{base}N_\text{freq}N_\text{time}$ complex
numbers. If the sky is discretized into $N_\text{pix}$ pixels, then the relevant matrix has
dimensions of $(N_\text{base}N_\text{freq}N_\text{time})\times(N_\text{pix})$. For making
single-channel maps with the OVRO-LWA, this becomes a 5 PB array (assuming each matrix element
is a 64 bit complex floating point number). This matrix equation is therefore prohibitively large,
and solving Equation~\ref{eq:basic-imaging} by means of discretization is usually intractable,
although \citet{2017MNRAS.465.2901Z} demonstrated this technique with the MITEOR telescope.
Instead, it is common to make mild assumptions that simplify Equation~\ref{eq:basic-imaging} and
ease the computational burden in solving for $I_\nu(\hat r)$. For example, when all of the baselines
$\vec b_{ij}$ lie in a plane and the field of view is small, Equation~\ref{eq:basic-imaging} can be
well approximated by a two-dimensional Fourier transform \citep{2001isra.book.....T}. The
restriction on baseline coplanarity and field of view can be relaxed by using W-projection
\citep{2008ISTSP...2..647C}. Known primary beam effects can also be accounted for during imaging by
using A-projection \citep{2013ApJ...770...91B}.
\subsection{$m$-mode Analysis}\label{sec:mmode-analysis}
Transit telescopes can take advantage of a symmetry in Equation~\ref{eq:basic-imaging} that greatly
reduces the amount of computer time required to image the full sky with exact incorporation of
wide-field imaging effects. This technique, called $m$-mode analysis, also obviates the need for
gridding and mosaicking. Instead, the entire sky is imaged in one coherent synthesis imaging step.
We will briefly summarize $m$-mode analysis below, but the interested reader should consult
\citet{2014ApJ...781...57S, 2015PhRvD..91h3514S} for a complete derivation.
In the context of $m$-mode analysis, a transit telescope is any interferometer for which the
response pattern of the individual elements does not change with respect to time. This may be an
interferometer like the OVRO-LWA, where the correlation elements are fixed dipoles, but it may also
be an interferometer like LOFAR or the MWA if the steerable beams are held in a fixed position (not
necessarily at zenith). The interferometer also does not necessarily have to be homogeneous.
Heterogeneous arrays composed of several different types of antennas are allowed as long as care is
taken to generalize Equation~\ref{eq:basic-imaging} for a heterogeneous array.
For a transit telescope, the visibilities $V^{ij}_\nu$ are a periodic function of sidereal
time.\footnote{
This is not strictly true. Ionospheric fluctuations and non-sidereal sources (such as the Sun)
will violate this assumption. This paper will, however, demonstrate that the impact on the final
maps is mild.
}
Therefore, it is a natural operation to compute the Fourier transform of the visibilities with
respect to sidereal time $\phi\in[0,2\pi)$.
\begin{equation}
V^{ij}_{m,\nu} = \int_0^{2\pi} V^{ij}_\nu(\phi)\exp\bigg(-im\phi\bigg)\,\d\phi\,.
\end{equation}
The output of this Fourier transform is the set of $m$-modes $V^{ij}_{m,\nu}$ where
$m=0,\,\pm1,\,\pm2,\,\ldots$ is the Fourier conjugate variable to the sidereal time. The $m$-mode
corresponding to $m=0$ is a simple average of the visibilities over sidereal time. Similarly, $m=1$
corresponds to the component of the visibilities that varies over half-day timescales. Larger values
of $m$ correspond to components that vary on quicker timescales.
\citet{2014ApJ...781...57S, 2015PhRvD..91h3514S} showed that there is a discrete linear relationship
between the measured $m$-modes $V^{ij}_{m,\nu}$ and the spherical harmonic coefficients of the sky
brightness $a_{lm,\nu}$.
\begin{equation}\label{eq:m-mode-sum-equation}
V^{ij}_{m,\nu} = \sum_l B^{ij}_{lm,\nu} a_{lm,\nu}\,,
\end{equation}
where the transfer coefficients $B^{ij}_{lm,\nu}$ are computed from the spherical harmonic transform
of the baseline transfer function defined by Equation~\ref{eq:baseline-transfer-function}. These
transfer coefficients define the interferometer's response to the corresponding spherical harmonic
coefficients.
Equation~\ref{eq:m-mode-sum-equation} can be recognized as a matrix equation, where the transfer
matrix $\b B$ is block-diagonal:
\begin{equation}\label{eq:m-mode-matrix-equation}
\overbrace{\left(
\begin{array}{c}
\vdots \\
m\text{-modes} \\
\vdots \\
\end{array}
\right)}^{\b v}
=
\overbrace{\left(
\begin{array}{ccc}
\ddots & & \\
& \text{transfer matrix} & \\
& & \ddots \\
\end{array}
\right)}^{\b B}
\overbrace{\left(
\begin{array}{c}
\vdots \\
a_{lm} \\
\vdots \\
\end{array}
\right)}^{\b a}
\end{equation}
\begin{equation}
\b B = \left(\begin{array}{cccc}
m = 0 &&& \\
& m=\pm1 && \\
&& m=\pm2 & \\
&&& \ddots \\
\end{array}\right)
\end{equation}
The vector $\b v$ contains the list of $m$-modes and the vector $\b a$ contains the list of
spherical harmonic coefficients representing the sky brightness. In order to take advantage of the
block-diagonal structure in $\b B$, $\b v$ and $\b a$ must be sorted by the absolute value of $m$.
Positive and negative values of $m$ are grouped together because the brightness of the sky is
real-valued, and the spherical harmonic transform of a real-valued function has $a_{l(-m)} = (-1)^m
a_{lm}^*$.
In practice, we now need to pick the set of spherical harmonics we will use to represent the sky.
For an interferometer like the OVRO-LWA with many short baselines, a sensible choice is to use all
spherical harmonics with $l\le l_\text{max}$ for some $l_\text{max}$. The parameter $l_\text{max}$
is determined by the maximum baseline length of the interferometer. For an interferometer without
short spacings, a minimum value for $l$ might also be used. This $l_\text{min}$ parameter should be
determined by the minimum baseline length. A rough estimate of $l$ for a baseline of length $b$ at
frequency $\nu$ is $l \sim \pi b\nu/c$. Based on this estimate for the OVRO-LWA and other
computational considerations, we therefore adapt $l_\text{min}=1$ and $l_\text{max}=1000$ across all
frequencies. However, this choice of $l_\text{max}$ actually limits the angular resolution above
55~MHz, and therefore future work will increase $l_\text{max}$ to obtain better angular resolution.
The interferometer's sensitivity to the monopole ($a_{00}$) deserves special consideration.
\citet{2016ApJ...826..116V} proveed -- under fairly general assumptions -- that a baseline with
nonzero sensitivity to $a_{00}$ must also have some amount of cross-talk or common-mode noise. In
fact, the sensitivity to $a_{00}$ is proportional to a sum of these effects. For example, one way a
baseline can have nonzero sensitivity to $a_{00}$ is if the baseline is extremely short. In this
case, the antennas are so close together that voltage fluctuations in one antenna can couple into the
other antenna. In order to make an interferometric measurement of $a_{00}$, this coupling must be
measured and calibrated. Consequently, we set $a_{00}=0$ in our analysis. In the future, this
limitation will be addressed with the inclusion of calibrated total power radiometry.
The size of a typical block in the transfer matrix is
$(2N_\text{base}N_\text{freq})\times(l_\text{max})$. If each element of the matrix is stored as a
64 bit complex floating point number, a single block is 500 MB for the case of single-channel
imaging with the OVRO-LWA, which a modern computer can easily store and manipulate in memory.
However, with additional bandwidth, these blocks quickly become unwieldy; thus, as a first pass, the
analysis in this paper is restricted to single-channel imaging. Note also that for the OVRO-LWA,
$N_\text{base} \gg l_\text{max}$, so there are more measurements than unknowns in
Equation~\ref{eq:m-mode-matrix-equation}.
The key advantage of $m$-mode analysis is the block-diagonal structure of
Equation~\ref{eq:m-mode-matrix-equation}. The computational complexity of many common matrix
operations (e.g., solving a linear system of equations) is $\mathcal{O}(N^3)$, where $N$ is the
linear size of the matrix. By splitting the equation into $M$ independent blocks, the number of
floating point operations required to solve the linear system of equations is now
$\mathcal{O}(N^3M^{-2})$, because each block can be manipulated independently of the other blocks.
This computational savings is what makes this matrix algebra approach to interferometric imaging
feasible. For the data set presented in this paper, computing the elements of the transfer matrix
takes $\sim$10 hours per frequency channel on a 10-node cluster, but once the matrix has been
computed, the imaging process described in \S\ref{sec:mmode-imaging} takes $\sim$10 minutes, and the
deconvolution process described in \S\ref{sec:clean} was allowed to run for $\sim$10 hours.
\subsection{$m$-mode Analysis Imaging}\label{sec:mmode-imaging}
Imaging in $m$-mode analysis essentially amounts to inverting
Equation~\ref{eq:m-mode-matrix-equation} to solve for the spherical harmonic coefficients $\b a$.
The linear least-squares solution, which minimizes $\|\b v - \b B\b a\|^2$, is given by
\begin{equation}
\b{\hat a}_\text{LLS} = (\b B^*\b B)^{-1}\b B^*\b v\,,
\end{equation}
where $^*$ indicates the conjugate-transpose.
However, usually one will find that $\b B$ is not full rank, and hence $\b B^*\b B$ is not an
invertible matrix. For example, an interferometer located in the northern hemisphere will never see
a region of the southern sky centered on the southern celestial pole. The $m$-modes contained in the
vector $\b v$ must contain no information about the sky around the southern celestial pole, and
therefore the act of multiplying by $\b B$ must destroy some information about the sky. The
consequence of this fact is that $\b B$ must have at least one singular value that is equal to zero.
It then follows that $\b B^*\b B$ must have at least one eigenvalue that is equal to zero, which
means it is not an invertible matrix.
Another way of looking at the problem is that because the interferometer is not sensitive to part of
the southern hemisphere, there are infinitely many possible solutions to
Equation~\ref{eq:m-mode-matrix-equation} that will fit the measured data equally well. We will
therefore regularize the problem and apply an additional constraint that prefers a unique yet
physically reasonable solution.
\subsubsection{Tikhonov Regularization}
The process of Tikhonov regularization minimizes $\|\b v - \b B\b a\|^2 + \varepsilon\|\b a\|^2$ for
some arbitrary value of $\varepsilon > 0$ chosen by the observer. The solution that minimizes this
expression is given by
\begin{equation}\label{eq:tikhonov-solution}
\b{\hat a}_\text{Tikhonov} = (\b B^*\b B + \varepsilon\b I)^{-1}\b B^*\b v\,.
\end{equation}
Tikhonov regularization adds a small value $\varepsilon$ to the diagonal of $\b B^*\b B$, fixing the
matrix's singularity. By using the singular value decomposition (SVD) of the matrix $\b B = \b U \b
\Sigma \b V^*$, Equation~\ref{eq:tikhonov-solution} becomes
\begin{equation}
\b{\hat a}_\text{Tikhonov} = \b V (\b\Sigma^2 + \varepsilon \b I)^{-1}\b\Sigma \b U^*\b v\,,
\end{equation}
where
\[
\b\Sigma = \left(
\begin{array}{ccc}
\sigma_1 & & \\
& \sigma_2 & \\
& & \ddots \\
\end{array}
\right)\,.
\]
The diagonal elements of $\b\Sigma$ are the singular values of $\b B$. The contribution of each
singular component to the Tikhonov-regularized solution is scaled by $\sigma_i / (\sigma_i^2 +
\varepsilon)$, where $\sigma_i$ is the singular value for the $i$th singular component. Tikhonov
regularization therefore acts to suppress any component for which
$\sigma_i\lesssim\sqrt{\varepsilon}$. If $\sigma_i = 0$, the component is set to zero.
In practice, the measurement $\b v$ is corrupted by noise with covariance $\b N$. For illustrative
purposes, we will assume that $\b N=n\b I$ for some $n>0$. In this case, the covariance of the
Tikhonov-regularized spherical harmonic coefficients is
\begin{equation}
\b C = n \b V (\b\Sigma^2 + \varepsilon\b I)^{-2} \b\Sigma^2 \b V^*\,.
\end{equation}
Each singular component is scaled by a factor of $\sigma_i^2/(\sigma_i^2 + \varepsilon)^2$. In the
absence of Tikhonov regularization ($\varepsilon=0$), singular components with the smallest singular
values -- the ones that the interferometer is the least sensitive to -- actually come to dominate
the covariance of the measured spherical harmonic coefficients. Tikhonov regularization improves
this situation by down-weighting these components.
\subsubsection{L Curves}
\begin{figure}[t]
\includegraphics[width=\columnwidth]{figures/chapter3/lcurve}
\caption{
Example L curve computed from OVRO-LWA data at 36.528~MHz by trialing 200 different values
of the regularization parameter $\varepsilon$. The $x$-axis is the norm of the solution (in
this case, the spherical harmonic coefficients) given in arbitrary units, and the $y$-axis
is the least-squares norm given in arbitrary units. Where the regularization parameter is
small, the norm of the solution grows rapidly. Where the regularization parameter is large,
the least-squares norm grows rapidly.
}
\label{fig:lcurve}
\end{figure}
Tikhonov regularization requires the observer to pick the value of $\varepsilon$. If $\varepsilon$
is too large, then too much importance is placed on minimizing the norm of the solution and the
least-squares residuals will suffer. Conversely, if $\varepsilon$ is too small, then the problem
will be poorly regularized and the resulting sky map may not represent the true sky. Picking the
value of $\varepsilon$ therefore requires understanding the trade-off between the two norms.
This trade-off can be analyzed quantitatively by trialing several values of $\varepsilon$, and
computing $\|\b v - \b B\b a\|^2$ and $\|\b a\|^2$ for each trial. An example is shown in
Figure~\ref{fig:lcurve}. The shape of this curve has a characteristic L shape, and as a result, this
type of plot is called an L curve. The ideal value of $\varepsilon$ lies near the turning point, of
the plot. At this point a small decrease in $\varepsilon$ will lead to an undesired rapid increase
in $\|\b a\|^2$, and a small increase in $\varepsilon$ will lead to an undesired rapid increase in
$\|\b v - \b B\b a\|^2$.
In practice, the L curve should be used as a guide to estimate a reasonable value of $\varepsilon$.
However, better results can often be obtained by tuning the value of $\varepsilon$. For instance,
increasing the value of $\varepsilon$ can improve the noise properties of the map by down-weighting
noisy modes. Decreasing the value of $\varepsilon$ can improve the resolution of the map by
up-weighting the contribution of longer baselines, which are likely fewer in number. In this
respect, choosing the value of $\varepsilon$ is analogous to picking the weighting scheme in
traditional imaging where robust weighting schemes can be tuned to similar effect \citep{briggs}.
For the OVRO-LWA, we selected $\varepsilon = 0.01$ across all frequency channels. The distribution
of singular values of the transfer matrix with respect to $\sqrt{\varepsilon}$ is summarized in
Table~\ref{tab:summary}.
\subsubsection{Other Regularization Schemes}
The choice of applying Tikhonov regularization to $m$-mode analysis imaging is not unique. There
exists a plethora of alternative regularization schemes that could also be applied. Each
regularization scheme has its own advantages and disadvantages. For instance, Tikhonov
regularization is simple, independent of prior information, and sets unmeasured modes to zero (a
sensible expectation). We will now briefly discuss a few other alternatives.
The Moore--Penrose pseudo-inverse (denoted with a superscript $\dagger$) is commonly applied to find
the minimum-norm linear least-squares solution to a set of linear equations. This can be used in
place of Tikhonov regularization as
\begin{equation}
\b{\hat a}_\text{Moore-Penrose} = \b B^\dagger\b v\,.
\end{equation}
Much like Tikhonov regularization, the Moore--Penrose pseudo-inverse sets components with small
singular values (below some user-defined threshold) to zero. Components with large singular values
(above the user-defined threshold) are included in the calculation at their full amplitude with no
down-weighting of modes near the threshold. The essential difference between using the
Moore--Penrose pseudo-inverse and Tikhonov regularization is that the pseudo-inverse defines a hard
transition from ``on'' to ``off.'' Modes are either set to zero or included in the map at their full
amplitude. On the other hand, Tikhonov regularization smoothly interpolates between these behaviors.
Because of this, Tikhonov regularization tends to produce better results in practical applications.
If the measured $m$-modes have a noise covariance matrix $\b N \neq n\b I$ for some scalar $n$
(e.g., the interferometer is inhomogeneous), then the observer should minimize $(\b v-\b B\b a)^*\b
N^{-1}(\b v-\b B\b a) + \varepsilon\|\b a\|^2$. The noise covariance matrix $\b N$ is used to weight
the measurements such that
\begin{equation}
\b{\hat a}_\text{min variance} = (\b B^*\b N^{-1}\b B + \varepsilon\b I)^{-1}
\b B\b N^{-1}\b v\,.
\end{equation}
In the event that the observer has a prior map of the sky, $\|\b a - \b a_\text{prior}\|^2$ can be
used as the regularizing norm. This will use the prior map to fill in missing information instead of
setting these modes to zero. In this case, the minimum is at
\begin{equation}
\b{\hat a}_\text{with prior} = (\b B^*\b B + \varepsilon\b I)^{-1}
(\b B^*(\b v - \b B\b a_\text{prior}))
+ \b a_\text{prior}\,.
\end{equation}
If instead the observer has a prior expectation on the covariance of the spherical harmonic
coefficients, Wiener filtering can also be used. This technique is demonstrated for simulated
measurements by \citet{2016arXiv161203255B}.
Alternatively, we could opt to regularize the problem by enforcing smoothness in the sky maps. In
this case, the regularizing norm should be of the form $\|\nabla I(\hat r)\|^2$, where $\nabla I$ is
the gradient of the sky brightness in the direction $\hat r$. This is actually a generalization of
Tikhonov regularization, where the objective function is $\|\b v-\b B\b a\|^2 + \varepsilon\|\b A\b
a\|^2$ for some matrix $\b A$. The minimum is at
\begin{equation}
\b{\hat a}_\text{generalized} = (\b B^*\b B + \varepsilon\b A^*\b A)^{-1}\b B^*\b v\,.
\end{equation}
Finally, in many machine-learning applications the $L_1$-norm\footnote{
$\|\b a\|_1 = \sum_i |a_i|$
} is used in place of the usual $L_2$-norm in order to encourage sparsity in the reconstructed
signal. Applying this to $m$-mode analysis imaging would amount to minimizing $\|\b v-\b B\b a\|_2^2
+ \varepsilon\|\b a\|_1$. However, because we have decomposed the sky in terms of spherical
harmonics, the vector $\b a$ is not expected to be sparse. Consequently, the $L_1$-norm is generally
inappropriate for $m$-mode analysis imaging without an additional change of variables designed to
introduce sparsity.
\subsection{CLEAN}\label{sec:clean}
In traditional radio astronomy imaging, CLEAN \citep{1974A&AS...15..417H} is a physically motivated
algorithm that interpolates between measured visibilities on the $uv$ plane. In the absence of this
interpolation, gaps in the interferometer's $uv$ coverage are assumed to be zero, and -- in the
image plane -- sources are convolved with a point spread function (PSF) that is characteristic of
the $uv$ coverage. Fundamentally, the interferometer's PSF is determined by which modes were
assumed to be zero in the initial imaging process.
In $m$-mode analysis imaging, we assumed modes were zero in two separate ways.
\begin{enumerate}
\item We selected a set of spherical harmonic coefficients $a_{lm}$ to describe the
sky-brightness distribution. All modes with $l>l_\text{max}$ are neglected and assumed to be
zero.
\item Tikhonov regularization forces linear combinations of spherical harmonic coefficients with
$\sigma_i \lesssim \sqrt{\varepsilon}$ toward zero.
\end{enumerate}
As a consequence, the final map of the sky is not assembled from a complete set of spherical
harmonics. Therefore, just as in traditional imaging, $m$-mode analysis imaging produces dirty maps
in which sources are convolved with a PSF. This PSF can be improved by increasing the number and
variety of baselines, which increases the number of modes for which $\sigma_i \gg
\sqrt{\varepsilon}$. Alternatively, by collecting more data, the signal-to-noise ratio of the
measured $m$-modes increases, which allows the observer to lower the value of $\varepsilon$ without
increasing the noise in the maps. Finally, the CLEAN algorithm can be applied to interpolate some
of the missing information that was assumed to be zero.
The PSF of a dirty $m$-mode analysis map may be computed with
\begin{equation}\label{eq:psf}
\b a_\text{PSF}(\theta, \phi)
= (\b B^*\b B + \varepsilon\b I)^{-1}\b B^*\b B\b a_\text{PS}(\theta, \phi)\,,
\end{equation}
where $\b a_\text{PSF}(\theta, \phi)$ is the vector of spherical harmonic coefficients representing
the PSF at the spherical coordinates $(\theta, \phi)$, and $\b a_\text{PS}(\theta, \phi)$ is the
vector of spherical harmonic coefficients for a point source at $(\theta, \phi)$ given by
\begin{equation}
\b a_\text{PS}(\theta, \phi) = \begin{pmatrix}
\vdots \\
Y_{lm}^*(\theta, \phi) \\
\vdots \\
\end{pmatrix}
= \begin{pmatrix}
\vdots \\
Y_{lm}^*(\theta, 0)\times e^{im\phi} \\
\vdots \\
\end{pmatrix} \,.
\end{equation}
In general, the PSF can be a function of the right ascension and declination. However, point sources
at the same declination take the same track through the sky and (barring any ionospheric effects)
will have the same PSF. The PSF is therefore only a function of the declination. For example,
sources at low elevations will tend to have an extended PSF along the north--south axis due to
baseline foreshortening. For the OVRO-LWA antenna configuration (Figure~\ref{fig:antenna-layout}),
example PSFs at three separate frequencies are shown in Figure~\ref{fig:psf}. Adapting CLEAN for
$m$-mode analysis requires either precomputing Equation~\ref{eq:psf} at a grid of declinations, or
a method for rapidly evaluating Equation~\ref{eq:psf} on the fly.
For an interferometer with more baselines than spherical harmonics used in the maps (e.g., the
OVRO-LWA), $\b B^*\b B$ can be a much smaller matrix than the full transfer matrix $\b B$.
Therefore, precomputing $\b B^*\b B$ can allow the entire matrix to fit into memory on a single
machine. This greatly reduces the amount of disk I/O necessary for solving Equation~\ref{eq:psf}.
Additionally, we can precompute the Cholesky decomposition of $\b B^*\b B + \varepsilon\b I = \b
U^*\b U$, where $\b U$ is an upper triangular matrix. Inverting an upper triangular matrix is an
$\mathcal{O}(N^2)$ operation (instead of $\mathcal{O}(N^3)$ for a general matrix inverse).\footnote{
Instead of computing $\b A^{-1}$, we solve the linear equation $\b A\b x = \b b$ each time the
matrix inverse is needed so as to avoid numerical instabilities.
}
Equation~\ref{eq:psf} can then be rapidly evaluated from right to left as
\begin{equation}\label{eq:rapid-psf}
\b a_\text{PSF} =
\b U^{-1}\,\big(\b U^*\big)^{-1}\,\big(\b B^*\b B\big)\,\b a_\text{PS}\,.
\end{equation}
Furthermore, Equation~\ref{eq:rapid-psf} does not need to be separately evaluated for each CLEAN
component. Instead, we can identify $N$ CLEAN components, accumulate $\b a_\text{PS}$ for each
component, and evaluate Equation~\ref{eq:rapid-psf} on the accumulation. This can greatly reduce the
number of times this equation needs to be evaluated, but care must be taken to ensure that the $N$
components are not so close together that sidelobes from one may interact with another.
Altogether, the adaptation of CLEAN applied to the maps presented in this paper is summarized below.
\begin{algorithmic}[1]
\Require{$\b a$ is the solution to Equation~\ref{eq:tikhonov-solution}}
\Function{CLEAN}{$\b a$}
\Let{$\b M$}{$\b B^*\b B$}
\Let{$\b U$}{${\rm chol}(\b M + \varepsilon\b I)$} \Comment{Cholesky decomposition}
\While{noise in map $>$ threshold}
\State find $N$ pixels with the largest residual flux
\Let{$\b x$}{$\sum_{i=1}^N \,(\text{pixel flux}) \times \b a_\text{PS}(\theta_i, \phi_i)$}
\Let{$\b y$}{$\b U^{-1}\big(\b U^*\big)^{-1}\b M\b x$}
\Let{$\b a$}{$\b a - (\text{loop gain})\times\b y$}
\State record subtracted components
\EndWhile
\Let{$\b a$}{$\b a + (\text{restored components})$}
\State \Return{$\b a$}
\EndFunction
\end{algorithmic}
In summary, Tikhonov-regularized $m$-mode analysis imaging constructs a wide-field synthesis image of
the sky from a complete Earth rotation, and with exact treatment of wide-field effects. This is
accomplished by solving a regularized block-diagonal matrix equation
(Equation~\ref{eq:tikhonov-solution}). The solution to this equation generates a map where
sources are convolved with a PSF characteristic of the interferometer (a function of the frequency,
antenna response, and baseline distribution with a full Earth rotation). The CLEAN algorithm is
adopted to deconvolve the PSF and produce the final sky maps.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{Observations}\label{sec:observations1}
\subsection{The OVRO-LWA}
\begin{figure}[t]
\includegraphics[width=\columnwidth]{figures/chapter3/antenna-layout}
\caption{
Antenna layout for the OVRO-LWA. Black dots correspond to antennas within the 200 m diameter
core of the array. The 32 triangles are the expansion antennas built in early 2016 in order
to increase the longest baseline to $\sim1.5$ km. The red dots are core antennas that are
disconnected from the correlator in order to accommodate these antennas. The five crosses
are antennas equipped with noise-switched front ends.
}
\label{fig:antenna-layout}
\end{figure}
\begin{figure}[t]
\begin{tabular}{c}
\includegraphics[width=\textwidth]{figures/chapter3/psf+75} \\
\includegraphics[width=\textwidth]{figures/chapter3/psf+45} \\
\includegraphics[width=\textwidth]{figures/chapter3/psf+0} \\
\end{tabular}
\caption{
The $m$-mode analysis imaging PSF at three declinations (top row: $\delta=+75^\circ$, middle
row: $\delta=+45^\circ$, bottom row: $\delta=+0^\circ$) and three frequencies (left column:
36.528~MHz, middle column: 52.224~MHz, right column: 73.152~MHz). The PSF is computed by
evaluating Equation~\ref{eq:psf}. Above 55~MHz, the angular extent of the PSF does not
follow the expected scaling with frequency because the angular resolution is limited by the
selection of $l_\text{max}=1000$. The FWHM at $\delta=+45^\circ$ is listed in
Table~\ref{tab:summary}.
}
\label{fig:psf}
\end{figure}
The OVRO-LWA is a 288-element interferometer located at OVRO near Big Pine, California
\citep{hallinan_2017}. The OVRO-LWA is a low-frequency instrument with instantaneous bandwidth
covering 27 to 85~MHz and with 24~kHz channelization. Each antenna stand hosts two perpendicular
broadband dipoles so that there are $288\times2$ signal paths in total. These signal paths feed into
the 512-input LEDA correlator \citep{2015JAI.....450003K}, which allows the OVRO-LWA to capture the
entire visible hemisphere in a single snapshot image.
The 288 antennas are arranged in a pseudo-random configuration optimized to minimize sidelobes in
snapshot imaging (see Figure~\ref{fig:antenna-layout}). Of these 288 antennas, 251~are contained
within a 200~m diameter core, 32~are placed outside of the core in order to extend the maximum
baseline length to $\sim$1.5~km, and five are equipped with noise-switched front ends for calibrated
total power measurements of the global sky brightness. These antennas are used as part of the LEDA
experiment \citep{2018MNRAS.478.4193P} to measure the global signal of 21~cm absorption from the
cosmic dawn. In the current configuration, 32 antennas (64 signal paths) from the core are
disconnected from the correlator in order to accommodate the 32 antennas on longer baselines. A
final stage of construction will involve 64 additional antennas installed on long baselines out to a
maximum length of 2.6~km.
The data set used in this paper spans 28 consecutive hours beginning at 2017 February 17 12:00:00 UTC
time. During this time, the OVRO-LWA operated as a zenith-pointing drift-scanning interferometer.
The correlator dump time was selected to be 13~s such that the correlator output evenly divides a
sidereal day. Due to the computational considerations presented in \S\ref{sec:mmode-analysis}, eight
24~kHz channels are selected for imaging from this data set: 36.528, 41.760, 46.992, 52.224, 57.456,
62.688, 67.920, and 73.152~MHz. These particular channels are chosen due to their location at the
exact center of instrumental subbands.
\subsection{Complex Gain Calibration}\label{sec:gaincal}
The complex gain calibration is responsible for correcting per-antenna amplitude and phase errors.
This is accomplished using a sky model and a variant of alternating least-squares colloquially known
as ``Stefcal''
\citep{2008ISTSP...2..707M, 2014A&A...571A..97S}\footnote{
The calibration routine is written in the Julia programming language
\citep{doi:10.1137/141000671}, and is publicly available online
(\url{https://github.com/mweastwood/TTCal.jl}) under an open source license (GPLv3 or any later
version).
}.
Cyg~A and Cas~A are -- by an order of magnitude -- the brightest point-like radio sources in the
northern hemisphere at resolutions lower than 0.25$^\circ$. Therefore, the optimal time to solve for
the interferometer's gain calibration is when these sources are at high elevations. The antenna
complex gains are measured from a 22~minute track of data when Cyg~A and Cas~A are at high
elevations. The gains measured in this way are then used to calibrate the entire 28 hour data set.
The calibration sky model consists only of Cyg~A and Cas~A. The model flux of Cyg~A is set to the
\citet{1977A&A....61...99B} spectrum, while the flux of Cas~A is measured from the data itself (using
a preliminary calibration solved for with a fiducial Cas~A spectrum).
Calibrating in this manner generates approximately arcminute errors in the astrometry of the final
sky maps due to ionospheric refractive offsets during the time of calibration. These residual
errors in the astrometry are corrected post-imaging by registering the images with respect to all
Very Large Array Low-frequency Sky Survey Redux (VLSSr) \citep{2014MNRAS.440..327L} sources that are
bright ($>30$~Jy with a consistent flux density measured with the OVRO-LWA) and not too close to
other bright sources (at least $1^\circ$ separation).
Temperature fluctuations of the analog electronics generate 0.1~dB sawtooth oscillations in the
analog gain. These oscillations occur with a variable 15 to 17~minute period associated with HVAC
cooling cycles within the electronics shelter that houses these electronics. The amplitude of these
gain fluctuations is calibrated by smoothing the autocorrelation amplitudes on 45~minute timescales.
The ratio of the measured autocorrelation power to the smoothed autocorrelation power defines a
per-antenna amplitude correction that is then applied to the cross-correlations. Additionally, the
ambient temperature at the front-end electronics (located in a box at the top of each dipole)
fluctuates diurnally, which will generate diurnal gain fluctuations. At this time, no correction is
made for these diurnal gain fluctuations.
\subsection{Primary Beam Measurements}\label{sec:beam}
\begin{figure}[t]
\includegraphics[width=\textwidth]{figures/chapter3/beam}
\caption{
Empirical fits to the OVRO-LWA Stokes $I$ primary beam (the response of the $x$ and
$y$ dipoles has been summed) at three frequencies: 36.528 MHz (left panel), 52.224 MHz
(middle panel), and 73.152 MHz (right panel). The source tracks used to measure the beam
model are overlaid. From north to south, these tracks correspond to Cas~A, Cyg~A, 3C~123,
Tau~A, Vir~A, Her~A, 3C~353, and Hya~A. The fitting process is described in
\S\ref{sec:beam}, and residuals for Cyg~A and Cas~A are in Figure~\ref{fig:scintillation}.
}
\label{fig:beam}
\end{figure}
In order to generate wide-field images of the sky, the response of the antenna to the sky must be
known. Drift-scanning interferometers like the OVRO-LWA can empirically measure their primary beam
under a mild set of symmetry assumptions \citep{2012AJ....143...53P}. The symmetry assumptions are
necessary to break the degeneracy between source flux and beam amplitude when the flux of the source
is unknown. In this work, we assume symmetries that are apparent in the antenna design, but
real-world defects and coupling with nearby antennas will contribute toward breaking these
symmetries at some level. In particular, we assume that the $x$ and $y$ dipoles have the same
response to the sky after rotating one by $90^\circ$, and that the beam is invariant under
north--south and east--west flips.
We measure the flux of several bright sources (Cyg~A, Cas~A, Tau~A, Vir~A, Her~A, Hya~A, 3C~123, and
3C~353) as they pass through the sky and then fit a beam model composed of Zernike polynomials to
those flux measurements. We select the basis functions to have the desired symmetry ($Z_0^0$,
$Z_2^0$, $Z_4^0$, $Z_4^4$, $Z_6^0$, $Z_6^4$, $Z_8^0$, $Z_8^4$, and $Z_8^8$), and the beam amplitude
at zenith is constrained to be unity. See Figure~\ref{fig:beam} for an illustration of a fitted beam
model at several frequencies. This process is repeated for each frequency channel. Residuals for
Cyg~A and Cas~A can be seen in Figure~\ref{fig:scintillation}.
\subsection{Ionospheric Conditions}\label{sec:ionosphere-conditions}
\begin{figure}[t]
\includegraphics[width=\textwidth]{figures/chapter3/scintillation-refraction}
\caption{
Panels (a) and (b) show the measured apparent flux of Cyg~A and Cas~A at 36.528 MHz (red
points) and 73.152 MHz (blue points) as a function of time over the observing period. The
solid black curves show the expected flux computed using the empirical beam model fits. The
thermal noise contribution to each point is about 50 Jy. Cyg~A is occulted by the White
Mountains when it is low on the horizon to the east.
Panels (c) and (d) show the measured position offset of Cyg~A and Cas~A relative to their
true astronomical positions at 36.528 MHz (red line) and 73.152 MHz (blue line).
}
\label{fig:scintillation}
\end{figure}
\begin{figure}[t]
\includegraphics[width=\columnwidth]{figures/chapter3/vtec}
\caption{
Median vertical TEC within 200 km of OVRO during the time of the observation. The gray
shaded regions indicate times outside of the observing period. The gray vertical lines
indicate sunrise and sunset (as labeled).
}
\label{fig:vtec}
\end{figure}
The geomagnetic conditions during this time were mild. The Disturbance storm time (Dst) index, which
measures the $z$-component of the interplanetary magnetic field, was
$>-30$ nT during the entirety of the observing period.\footnote{
The Dst index was obtained from the World Data Center for Geomagnetism, Kyoto University
(\url{http://swdcwww.kugi.kyoto-u.ac.jp/}). Accessed 2017 July 25.
}
Following the classification scheme of \citet{2008GMS...181.....K}, a weak geomagnetic storm has
$\text{Dst} < -30$ nT. Stronger geomagnetic storms have $\text{Dst} < -50$ nT.
Despite the mild conditions, low-frequency interferometric observations are still affected by the
index of refraction in the ionosphere. Figure~\ref{fig:vtec} shows the median vertical total
electron content (TEC) above OVRO measured from GPS \citep{1999JASTP..61.1205I}. The median is
computed over all GPS measurements within 200 km of the observatory. Over the observing period, the
TEC smoothly varies from 20 TECU at midday to 5 TECU during the night. However, this measurement is
only sensitive to large-scale fluctuations in the ionosphere and does not capture small-scale
fluctuations.
Small-scale fluctuations are best characterized by source scintillation and refractive offsets.
Figure~\ref{fig:scintillation} shows the apparent flux and position offset of Cyg~A and Cas~A as a
function of time over the entire observing period. Both sources exhibit rapid scintillation on the
timescale of a single integration (13~s). For example, at 36.528~MHz, it is not unusual for
Cyg A to have measured flux variations of 50\% between adjacent 13 second integrations. The
variance at 36.528~MHz compared with the variance at 73.152~MHz is consistent with an ionospheric
$\nu^{-2}$ origin. The measured position offset of each source is a measurement of the ionospheric
phase gradient across the array. This varies on slower 10~minute timescales, with each source
refracting by as much as 20\arcmin (at 36.528 MHz) from its true astronomical position as waves
in the ionosphere pass through the line of sight. At 74~MHz on the VLA, \citet{2007ApJS..172..686K}
observed $\sim 1\arcmin$ refractive offsets during the night, and $\sim 4\arcmin$ offsets during the
day on similar $\sim10$~minute timescales, which is consistent with what is seen here. The impact
of these effects on the sky maps is simulated in \S\ref{sec:ionosphere}.
\subsection{Source Removal}\label{sec:source-removal}
\subsubsection{Cyg A and Cas A}
Due to the rapid and large ionospheric fluctuations seen in Figure~\ref{fig:scintillation}, CLEAN
cannot be relied on to accurately deconvolve bright sources. However, without removing bright
sources from the data, sidelobes from these sources will dominate the variance in the sky maps. At
74~MHz, Cyg~A is a 15,000~Jy source \citep{1977A&A....61...99B}. A conservative estimate for the
confusion limit at 74~MHz with a 15\arcmin beam is 1000~mJy \citep{2014MNRAS.440..327L}. Therefore,
we require that Cyg~A's sidelobes be at least $-45$~dB down from the main lobe to prevent Cyg~A's
sidelobes from dominating the variance in the image.
To achieve this dynamic range at low frequencies, it is important to account for propagation effects
through the ionosphere. In the weak scattering regime ($r_\text{diff} \gg r_\text{f}$, where
$r_\text{diff}$ is the diffractive scale of the ionosphere, $r_\text{f} = \sqrt{\lambda D / 2\pi}$
is the Fresnel scale, $\lambda$ is the wavelength, and $D$ is the distance to the ionosphere),
fluctuations within the ionosphere contribute amplitude and phase scintillations that can be
described by a direction-dependent complex gain calibration. This justifies the use of ``peeling,''
which incorporates a direction-dependent calibration to subtract sources in the presence of
ionospheric scintillation \citep[e.g.,][]{2008ISTSP...2..707M, 2015MNRAS.449.2668S}.
In the strong scattering regime ($r_\text{diff} \lesssim r_\text{f}$), the image of a point source
can ``break apart'' into multiple images or speckles \citep{2015MNRAS.453..925V}. Attempting to
peel a source in the strong scattering regime will lead to source-subtraction artifacts in the final
sky map. \citet{2016RaSc...51..927M} measured that from the location of LOFAR at 150 MHz, the
diffractive scale of the ionosphere is $>5$ km 90\% of the time. This implies that at 73 MHz, the
diffractive scale is typically $>2$ km, and at 36 MHz, the diffractive scale is typically $>1$ km.
These limits are comparable to the Fresnel scale for the OVRO-LWA (i.e., $r_\text{diff} >
r_\text{f}$), and therefore we do not generally expect to see strong scattering from the ionosphere.
Ionospheric conditions during the observing period were mild (see
\S\ref{sec:ionosphere-conditions}). However, we do observe scintillation and refractive-offset
events on the timescale of a single integration (13~s; see Figure~\ref{fig:scintillation}).
Consequently, we peeled Cyg~A and Cas~A from the data set using a new solution for each integration.
In addition, the largest angular scale of Cas~A is $\sim8\arcmin$, and the largest angular scale of
Cyg~A is $\sim2\arcmin$. With an $\sim10\arcmin$ resolution on its longest baselines at 73~MHz, the
OVRO-LWA marginally resolves both sources. A resolved source model is needed for both sources. We
fit a self-consistent resolved source model to each source. This is performed by minimizing the
variance within an aperture located on each source after peeling. By phasing up a large number of
integrations before imaging (at least 1 hour), it is possible to smear out the contribution of the
rest of the sky. We then use a nonlinear optimization routine \citep[NLopt Sbplx;][]{sbplx, nlopt}
to vary the parameters in a source model until the variance within the aperture is minimized. Cyg~A
is modeled with two Gaussian components, while Cas~A is modeled with three Gaussian components.
Ultimately, these multicomponent models are used to peel Cyg~A and Cas~A, but residual errors from
this model and from the ionosphere (particularly while these sources are at low elevations)
contribute residual artifacts that are largely localized to within 1$^\circ$ of each source.
\subsubsection{Other Bright Sources}
Other bright sources -- namely Vir~A, Tau~A, Her~A, Hya~A, 3C~123, and 3C~353 -- are also removed
from the visibilities prior to imaging. Because these sources are much fainter than Cyg~A and Cas~A,
we do not need resolved source models to be able to remove these sources from the visibilities
without residual sidelobes contaminating the image.
However, the ionosphere will cause these sources to scintillate and refract. The position and flux
of each source is measured separately in each channel and integration. The sources are then
subtracted from the visibilities using the updated position and flux of the source. The brightest of
these sources (Vir~A and Tau~A) are peeled using a direction-dependent calibration when they are at
high elevations.
\subsubsection{The Sun}
The Sun can be trivially removed from any map of the sky by constructing the map using only data
collected at night. A map of the entire sky can be obtained by using observations spaced 6 months
apart. However, the data set used in this paper consists of 28 consecutive hours. Fortunately, the
Sun was not active during this period, which could have greatly increased the difficulty involved in
subtracting the Sun.
We attempt to subtract the Sun from the data set with the goal of suppressing its sidelobes. The Sun
is well-resolved by the OVRO-LWA, and hence a detailed source model is needed. In fact, the optical
depth $\tau=1$ surface of the Sun changes with frequency, and as a consequence, a new model is needed
at each frequency. While we could fit a limited number of Gaussian components to Cyg~A and Cas~A,
this is insufficient for the Sun. Additionally, while most astronomical sources at these
frequencies have negative spectral indices, the Sun has a positive spectral index. Therefore, more
care will need to be taken in subtracting the Sun at higher frequencies than at lower frequencies.
The strategy used for removing the Sun below 55 MHz involves fitting a shapelet
\citep{2003MNRAS.338...35R} model to the Sun and subtracting without the use of direction-dependent
gains. The shapelet fitting is performed in the visibility space. Above 55~MHz, a model is fit to
the Sun by minimizing the residuals after peeling (in the same way that models are obtained for
Cyg~A and Cas~A). The Sun is then peeled from each integration using direction-dependent gains.
\subsection{Flux Scale}\label{sec:flux-scale}
\begin{figure}[t]
\includegraphics[width=\textwidth]{figures/chapter3/flux-scale}
\caption{
Measured fluxes (black points) of 11 sources plotted against the published spectra from
\citet{2017ApJS..230....7P} (solid line above 50 MHz, dotted line below 50 MHz),
\citet{2012MNRAS.423L..30S} (dashed line), and \citet{1977A&A....61...99B} (dot-dashed
line). Cas~A is compared against a spectrum assuming a secular decrease of 0.77\% per year
\citep{2009AJ....138..838H}.
}
\label{fig:flux-scale}
\end{figure}
The flux scale of the data was tied to the \citet{1977A&A....61...99B} spectrum of Cyg~A during gain
calibration. However, gain calibration is also a function of the beam model and the spectrum used
for Cas~A. Recent work by \citet{2012MNRAS.423L..30S} (hereafter SH12) using archival data from the
literature and \citet{2017ApJS..230....7P} (hereafter PB17) using the VLA has expanded the number of
low-frequency radio sources with calibrated flux measurements from one (Cyg~A) to 11 in total.
While the SH12 flux scale is valid between 30 and 300~MHz, the PB17 flux scale is somewhat more
limited because the lowest-frequency observations come from the VLA 4-band system. As a consequence,
the PB17 flux scale is not valid below 50 MHz.
Figure~\ref{fig:flux-scale} shows a comparison between flux measurements made using the all-sky maps
from this work and spectra from the aforementioned flux scales. Generally, the OVRO-LWA flux
measurements agree to between 5\% and 10\% of the SH12 spectra. Below 50 MHz, there can be substantial
departures with respect to the extrapolated PB17 spectra (e.g., 3C 286, 3C 295, and 3C 380), but
it is usually the case that we have much better agreement with the SH12 spectra. This indicates that
the PB17 spectra cannot be extrapolated below 50 MHz.
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{Results}\label{sec:results1}
\bgroup
\def\arraystretch{1.5}
\begin{sidewaystable}[p]
\centering
\begin{tabular}{cccccccccc}
\hline
\hline
& \tbf{$\b\nu$}
& \tbf{$\b\Delta\b\nu$}\footnote{
Bandwidth used to construct the map. As described in the text, each map is constructed
from a single frequency channel (24~kHz).
}
& \multicolumn3c{\tbf{FWHM}\footnote{
The full width at half maximum (FWHM) of the synthesized beam at the specified declination
(major axis $\times$ minor axis).
}}
&& \multicolumn2c{\tbf{Noise}\footnote{
Measured with a jackknife and splitting the data set into even- and odd-numbered
integrations. This estimate therefore includes all noise sources that act on the
timescale of a single 13~s integration (e.g., thermal, ionospheric, etc.).
}}
& \tbf{Fraction of Modes}\footnote{
Singular values of the transfer matrix compared with the value of the regularization
parameter $\varepsilon$ used while solving Equation~\ref{eq:tikhonov-solution}. As
discussed in the text, singular vectors with corresponding singular values $\sigma \ll
\sqrt{\varepsilon}$ are set to zero by the Tikhonov regularization procedure.
} \\
\cline{4-6}
\cline{8-9}
\tbf{\#}
& MHz & MHz
& $\delta=0^\circ$ & $\delta=+45^\circ$ & $\delta=+75^\circ$
&& K & mJy/beam
& with $\sigma>\sqrt{\varepsilon}$ \\
\hline
1 & 36.528 & 0.024 & $26.0'\times19.1'$ & $20.2'\times16.9'$ & $19.8'\times18.7'$ && 595. & 799. & 0.391 \\
2 & 41.760 & 0.024 & $23.3'\times17.5'$ & $18.5'\times16.0'$ & $18.3'\times17.4'$ && 541. & 824. & 0.480 \\
3 & 46.992 & 0.024 & $20.9'\times16.3'$ & $17.4'\times15.2'$ & $17.6'\times16.9'$ && 417. & 717. & 0.504 \\
4 & 52.224 & 0.024 & $18.7'\times15.2'$ & $16.2'\times15.0'$ & $16.0'\times15.8'$ && 418. & 814. & 0.535 \\
5 & 57.456 & 0.024 & $18.0'\times14.9'$ & $15.9'\times15.0'$ & $15.7'\times15.4'$ && 354. & 819. & 0.542 \\
6 & 62.688 & 0.024 & $17.8'\times15.0'$ & $15.8'\times14.9'$ & $15.7'\times15.4'$ && 309. & 843. & 0.540 \\
7 & 67.920 & 0.024 & $17.6'\times15.0'$ & $15.9'\times14.7'$ & $15.8'\times15.6'$ && 281. & 894. & 0.529 \\
8 & 73.152 & 0.024 & $18.6'\times15.1'$ & $16.8'\times14.6'$ & $16.6'\times16.1'$ && 154. & 598. & 0.512 \\
\hline \hline
\end{tabular}
\caption{A summary of the generated all-sky maps.}
\label{tab:summary}
\end{sidewaystable}
\egroup
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw04}
\caption{
These eight panels illustrate (with a Mollweide projection and logarithmic color scale) the
eight full-sky maps generated with Tikhonov-regularized $m$-mode analysis imaging and the
OVRO-LWA. Each map covers the sky north of $\delta=-30^\circ$ with angular resolution of
$\sim15\arcmin$. Eight bright sources have been removed from each map (Cyg~A, Cas~A, Vir~A,
Tau~A, Her~A, Hya~A, 3C~123, and 3C~353). The small blank region near $l=+45.7^\circ$,
$b=-47.9^\circ$ corresponds to the location of the Sun during the observation period. A
detailed summary of the properties of each map is given in Table~\ref{tab:summary}.
}
\label{fig:channel-maps}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw06}
\caption{
continued
}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw08}
\caption{
continued
}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw10}
\caption{
continued
}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw12}
\caption{
continued
}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw14}
\caption{
continued
}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]
\centering
\includegraphics[width=\textwidth]{figures/chapter3/spw16}
\caption{
continued
}
\end{sidewaysfigure}
\addtocounter{figure}{-1}
\begin{sidewaysfigure}[p]