Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VSO search for STEREO COR2 data does not return all available observations #7439

Open
AthKouloumvakos opened this issue Feb 8, 2024 · 4 comments
Labels
Bug Probably a bug net Affects the net submodule Remote Server Issue / VSO An issue that is caused by unexpected or incorrect behaviour on a VSO server.

Comments

@AthKouloumvakos
Copy link

Describe the bug

The issue is that the VSO search for STEREO COR2 data does not always return all the Total Brigtnessc (TB) COR2 fits files that are available on the STEREO site. STEREO SECCHI Level 0.5 FITS files for the two spacecraft are provided from https://stereo-ssc.nascom.nasa.gov/data/ins_data/secchi/L0/ and are organized into three directories: the \cal that contains the Calibration files, the \img that contains the “Brightness" images for EUVI, HI, COR instruments and the, \seq that contains the Polarized (PB) images for COR1 or COR2. Additionally, the \summary directory provides summary text files of the available data for each month and instrument. Making some tests it seems that for many days the TB COR2 images are completely missing from the search results.

To Reproduce

A simple test for two selected dates:

A) An example of a date that COR2 data are missing:

import astropy.units as u
import sunpy.map
from sunpy.net import Fido
from sunpy.net import attrs as a

time_range = a.Time('2020/03/08 00:00:00', '2020/03/09 00:00:00')
result = Fido.search(time_range, a.Source('STEREO_A'), a.Detector.cor2)

files = Fido.fetch(result)

my_map = sunpy.map.Map(files, sequence=True) 

polar, TB_files = [], []
for map_ in my_map:
    polar.append(map_.meta['polar'])
    if map_.meta['polar'] == 1001:
        TB_files.append(map_.meta['filename'])

count_TB = polar.count(1001)  # This 
count_PBs = len(polar) - count_TB

print(result)
print(f'Total brightness images {count_TB}')
print(f'Total brightness Filenames {TB_files}')
print(f'TPolarized Brightness {count_PBs}')

The result from the above is the following:

Results from 1 Provider:

72 Results from the VSOClient:
Source: http://vso.stanford.edu/cgi-bin/search
Total estimated size: 606.528 Mbyte

       Start Time               End Time         Source  ... Extent Type   Size 
                                                         ...              Mibyte
----------------------- ----------------------- -------- ... ----------- -------
2020-03-08 00:08:15.000 2020-03-08 00:08:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 00:08:45.000 2020-03-08 00:09:07.000 STEREO_A ...      CORONA 8.03375
2020-03-08 00:09:15.000 2020-03-08 00:09:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 01:08:15.000 2020-03-08 01:08:36.000 STEREO_A ...      CORONA 8.03375
2020-03-08 01:08:45.000 2020-03-08 01:09:07.000 STEREO_A ...      CORONA 8.03375
2020-03-08 01:09:15.000 2020-03-08 01:09:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 02:08:15.000 2020-03-08 02:08:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 02:08:45.000 2020-03-08 02:09:07.000 STEREO_A ...      CORONA 8.03375
2020-03-08 02:09:17.000 2020-03-08 02:09:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 03:08:15.000 2020-03-08 03:08:37.000 STEREO_A ...      CORONA 8.03375
                    ...                     ...      ... ...         ...     ...
2020-03-08 20:09:17.000 2020-03-08 20:09:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 21:08:15.000 2020-03-08 21:08:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 21:08:45.000 2020-03-08 21:09:07.000 STEREO_A ...      CORONA 8.03375
2020-03-08 21:09:15.000 2020-03-08 21:09:37.000 STEREO_A ...      CORONA 8.03375
2020-03-08 22:08:15.000 2020-03-08 22:08:37.000 STEREO_A ...      CORONA 8.03375
...
Total brightness images 0
Total brightness Filenames []
Polarized Brightness images 72

So for this specific date no TB images were found but according to the summary file https://stereo-ssc.nascom.nasa.gov/data/ins_data/secchi/L0/a/summary/sccA202003.img.c2 the availiable TB images for that date are the following:

       FileName                  DateObs          Tel  Exptime Xsize  Ysize  Filter  Polar   Prog   OSnum   Dest   FPS    LED     CMPRS   NMISS
===============================================================================================================================================
20200308_000800_d4c2A.fts | 2020/03/08 00:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_002400_d4c2A.fts | 2020/03/08 00:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_003900_d4c2A.fts | 2020/03/08 00:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_005400_d4c2A.fts | 2020/03/08 00:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_010100_d4c2A.fts | 2020/03/08 01:01:00 | COR2 |  36.0| 2048 | 2048 |  None |  1001 | Doub | 2142 | SSR1 |     |   None |  ICER3 |    0
20200308_010800_d4c2A.fts | 2020/03/08 01:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_012400_d4c2A.fts | 2020/03/08 01:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_013900_d4c2A.fts | 2020/03/08 01:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_015400_d4c2A.fts | 2020/03/08 01:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_020800_d4c2A.fts | 2020/03/08 02:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_022400_d4c2A.fts | 2020/03/08 02:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_023900_d4c2A.fts | 2020/03/08 02:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_025400_d4c2A.fts | 2020/03/08 02:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_030800_d4c2A.fts | 2020/03/08 03:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_032400_d4c2A.fts | 2020/03/08 03:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_033900_d4c2A.fts | 2020/03/08 03:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_035400_d4c2A.fts | 2020/03/08 03:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_040800_d4c2A.fts | 2020/03/08 04:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_042400_d4c2A.fts | 2020/03/08 04:23:59 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_043900_d4c2A.fts | 2020/03/08 04:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_045400_d4c2A.fts | 2020/03/08 04:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_050800_d4c2A.fts | 2020/03/08 05:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_052400_d4c2A.fts | 2020/03/08 05:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_053900_d4c2A.fts | 2020/03/08 05:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_055400_d4c2A.fts | 2020/03/08 05:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_060800_d4c2A.fts | 2020/03/08 06:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_062400_d4c2A.fts | 2020/03/08 06:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_063900_d4c2A.fts | 2020/03/08 06:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_065400_d4c2A.fts | 2020/03/08 06:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_070800_d4c2A.fts | 2020/03/08 07:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_072400_d4c2A.fts | 2020/03/08 07:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_073900_d4c2A.fts | 2020/03/08 07:38:59 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_075400_d4c2A.fts | 2020/03/08 07:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_080800_d4c2A.fts | 2020/03/08 08:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_082400_d4c2A.fts | 2020/03/08 08:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_083900_d4c2A.fts | 2020/03/08 08:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_085400_d4c2A.fts | 2020/03/08 08:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_090800_d4c2A.fts | 2020/03/08 09:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_092400_d4c2A.fts | 2020/03/08 09:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_093900_d4c2A.fts | 2020/03/08 09:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_095400_d4c2A.fts | 2020/03/08 09:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_100800_d4c2A.fts | 2020/03/08 10:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_102400_d4c2A.fts | 2020/03/08 10:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_103900_d4c2A.fts | 2020/03/08 10:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_105400_d4c2A.fts | 2020/03/08 10:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_110800_d4c2A.fts | 2020/03/08 11:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_112400_d4c2A.fts | 2020/03/08 11:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_113900_d4c2A.fts | 2020/03/08 11:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_115400_d4c2A.fts | 2020/03/08 11:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_120800_d4c2A.fts | 2020/03/08 12:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_122400_d4c2A.fts | 2020/03/08 12:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_123900_d4c2A.fts | 2020/03/08 12:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_125400_d4c2A.fts | 2020/03/08 12:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_130800_d4c2A.fts | 2020/03/08 13:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_132400_d4c2A.fts | 2020/03/08 13:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_133900_d4c2A.fts | 2020/03/08 13:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_135400_d4c2A.fts | 2020/03/08 13:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_140800_d4c2A.fts | 2020/03/08 14:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_142400_d4c2A.fts | 2020/03/08 14:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_143900_d4c2A.fts | 2020/03/08 14:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_145400_d4c2A.fts | 2020/03/08 14:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_150800_d4c2A.fts | 2020/03/08 15:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_152400_d4c2A.fts | 2020/03/08 15:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_153900_d4c2A.fts | 2020/03/08 15:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_155400_d4c2A.fts | 2020/03/08 15:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_160800_d4c2A.fts | 2020/03/08 16:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_162400_d4c2A.fts | 2020/03/08 16:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_163900_d4c2A.fts | 2020/03/08 16:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_165400_d4c2A.fts | 2020/03/08 16:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_170800_d4c2A.fts | 2020/03/08 17:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_172400_d4c2A.fts | 2020/03/08 17:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_173900_d4c2A.fts | 2020/03/08 17:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_175400_d4c2A.fts | 2020/03/08 17:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_180800_d4c2A.fts | 2020/03/08 18:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_182400_d4c2A.fts | 2020/03/08 18:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_183900_d4c2A.fts | 2020/03/08 18:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_185400_d4c2A.fts | 2020/03/08 18:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_190800_d4c2A.fts | 2020/03/08 19:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_192400_d4c2A.fts | 2020/03/08 19:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_193900_d4c2A.fts | 2020/03/08 19:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_195400_d4c2A.fts | 2020/03/08 19:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_200800_d4c2A.fts | 2020/03/08 20:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_202400_d4c2A.fts | 2020/03/08 20:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_203900_d4c2A.fts | 2020/03/08 20:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_205400_d4c2A.fts | 2020/03/08 20:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_210800_d4c2A.fts | 2020/03/08 21:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_212400_d4c2A.fts | 2020/03/08 21:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_213900_d4c2A.fts | 2020/03/08 21:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_215400_d4c2A.fts | 2020/03/08 21:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_220800_d4c2A.fts | 2020/03/08 22:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_222400_d4c2A.fts | 2020/03/08 22:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_223900_d4c2A.fts | 2020/03/08 22:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_225400_d4c2A.fts | 2020/03/08 22:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_230800_d4c2A.fts | 2020/03/08 23:08:00 | COR2 |   2.0|  256 |  256 |  None |  1001 | Doub | 1921 | SSR1 |     |   None |  ICER9 |    0
20200308_232400_d4c2A.fts | 2020/03/08 23:24:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0
20200308_233900_d4c2A.fts | 2020/03/08 23:39:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1693 | SSR1 |     |   None |  ICER3 |    0
20200308_235400_d4c2A.fts | 2020/03/08 23:54:00 | COR2 |   6.0| 2048 | 2048 |  None |  1001 | Doub | 1694 | SSR1 |     |   None |  ICER3 |    0

B) An example of a date that COR2 data are all availiable:

Doing the same for:

time_range = a.Time('2020/03/06 00:00:00', '2020/03/07 00:00:00')

we get:

Total brightness images 97
Total brightness Filenames ['20200306_000800_d4c2A.fts', '20200306_002400_d4c2A.fts', '20200306_003900_d4c2A.fts', '20200306_005400_d4c2A.fts', '20200306_010100_d4c2A.fts', '20200306_010800_d4c2A.fts', '20200306_012400_d4c2A.fts', '20200306_013900_d4c2A.fts', '20200306_015400_d4c2A.fts', '20200306_020800_d4c2A.fts', '20200306_022400_d4c2A.fts', '20200306_023900_d4c2A.fts', '20200306_025400_d4c2A.fts', '20200306_030800_d4c2A.fts', '20200306_032400_d4c2A.fts', '20200306_033900_d4c2A.fts', '20200306_035400_d4c2A.fts', '20200306_040800_d4c2A.fts', '20200306_042400_d4c2A.fts', '20200306_043900_d4c2A.fts', '20200306_045400_d4c2A.fts', '20200306_050800_d4c2A.fts', '20200306_052400_d4c2A.fts', '20200306_053900_d4c2A.fts', '20200306_055400_d4c2A.fts', '20200306_060800_d4c2A.fts', '20200306_062400_d4c2A.fts', '20200306_063900_d4c2A.fts', '20200306_065400_d4c2A.fts', '20200306_070800_d4c2A.fts', '20200306_072400_d4c2A.fts', '20200306_073900_d4c2A.fts', '20200306_075400_d4c2A.fts', '20200306_080800_d4c2A.fts', '20200306_082400_d4c2A.fts', '20200306_083900_d4c2A.fts', '20200306_085400_d4c2A.fts', '20200306_090800_d4c2A.fts', '20200306_092400_d4c2A.fts', '20200306_093900_d4c2A.fts', '20200306_095400_d4c2A.fts', '20200306_100800_d4c2A.fts', '20200306_102400_d4c2A.fts', '20200306_103900_d4c2A.fts', '20200306_105400_d4c2A.fts', '20200306_110800_d4c2A.fts', '20200306_112400_d4c2A.fts', '20200306_113900_d4c2A.fts', '20200306_115400_d4c2A.fts', '20200306_120800_d4c2A.fts', '20200306_122400_d4c2A.fts', '20200306_123900_d4c2A.fts', '20200306_125400_d4c2A.fts', '20200306_130800_d4c2A.fts', '20200306_132400_d4c2A.fts', '20200306_133900_d4c2A.fts', '20200306_135400_d4c2A.fts', '20200306_140800_d4c2A.fts', '20200306_142400_d4c2A.fts', '20200306_143900_d4c2A.fts', '20200306_145400_d4c2A.fts', '20200306_150800_d4c2A.fts', '20200306_152400_d4c2A.fts', '20200306_153900_d4c2A.fts', '20200306_155400_d4c2A.fts', '20200306_160800_d4c2A.fts', '20200306_162400_d4c2A.fts', '20200306_163900_d4c2A.fts', '20200306_165400_d4c2A.fts', '20200306_170800_d4c2A.fts', '20200306_172400_d4c2A.fts', '20200306_173900_d4c2A.fts', '20200306_175400_d4c2A.fts', '20200306_180800_d4c2A.fts', '20200306_182400_d4c2A.fts', '20200306_183900_d4c2A.fts', '20200306_185400_d4c2A.fts', '20200306_190800_d4c2A.fts', '20200306_192400_d4c2A.fts', '20200306_193900_d4c2A.fts', '20200306_195400_d4c2A.fts', '20200306_200800_d4c2A.fts', '20200306_202400_d4c2A.fts', '20200306_203900_d4c2A.fts', '20200306_205400_d4c2A.fts', '20200306_210800_d4c2A.fts', '20200306_212400_d4c2A.fts', '20200306_213900_d4c2A.fts', '20200306_215400_d4c2A.fts', '20200306_220800_d4c2A.fts', '20200306_222400_d4c2A.fts', '20200306_223900_d4c2A.fts', '20200306_225400_d4c2A.fts', '20200306_230800_d4c2A.fts', '20200306_232400_d4c2A.fts', '20200306_233900_d4c2A.fts', '20200306_235400_d4c2A.fts']
Polarized Brightness images 72

This result maches to all the availiable TB images from https://stereo-ssc.nascom.nasa.gov/data/ins_data/secchi/L0/a/summary/sccA202003.img.c2.

A more advanced test:

In the example below I am trying to find out for which dates during March 2022 there are data missing and how many images every month during 2020 are missing.

Some funcions first:

import pandas as pd
from sunpy.time import TimeRange
from datetime import datetime, timedelta

def get_files_from_ssc_summary(time):
    summary_url = r'https://stereo-ssc.nascom.nasa.gov/data/ins_data/secchi/L0/a/summary/sccA%Y%m.img.c2'

    tr = TimeRange(time[0], time[1])
    
    for times in pd.date_range(tr.start.strftime('%Y-%m-%d %H:%M:%S'), tr.end.strftime('%Y-%m-%d %H:%M:%S')):
        
        times = pd.date_range(tr.start.strftime('%Y-%m-%d %H:%M:%S'), tr.end.strftime('%Y-%m-%d %H:%M:%S'))
        
        summary_url_ = times.strftime(summary_url)  # sccA201105.img.h1

        data = pd.read_fwf(summary_url_[0],
                            header=None, colspecs=([0, 25], [28, 47], [50, 54], [108, 112]), skiprows=2,
                            names=['filename', 'date', 'telescope', 'Dest'])

        data = data.set_index([pd.to_datetime(data.iloc[:, 1], format='%Y/%m/%d %H:%M:%S')])
        data = data[(data.index >= tr.start.strftime('%Y-%m-%d %H:%M:%S')) &
                    (data.index <= tr.end.strftime('%Y-%m-%d %H:%M:%S'))]

        filenames_scc = []
        for f in data.itertuples():
            if f.Dest=='SSR1':
                filenames_scc.append(f.filename)

        return filenames_scc

def get_files_from_vso(time):
        time_range = a.Time(time[0], time[1])
        result = Fido.search(time_range, a.Source('STEREO_A'), a.Detector.cor2)

        filenames_vso = []
        for f in result['vso']:
            filenames_vso.append(f['fileid'].split('/')[-1])
        return filenames_vso

Then do a VSO search of data for a full month and compare them with the availiable files at SECCHI site:

start_date = datetime(2020, 3, 1, 0, 0, 0)
end_date = datetime(2020, 3, 31, 23, 59, 59)

time_intervals = []

date_format = '%Y/%m/%d %H:%M:%S'

# Iterate through each day in the month and generate the time intervals
current_date = start_date
while current_date <= end_date:
    end_of_day = current_date + timedelta(days=1) - timedelta(seconds=1)

    time_intervals.append(
        (
            current_date.strftime(date_format),
            end_of_day.strftime(date_format)
        )
    )

    current_date += timedelta(days=1)

# Print the result
for t_0, t_1 in time_intervals:
    filenames_scc = get_files_from_ssc_summary([t_0, t_1])
    filenames_vso = get_files_from_vso([t_0, t_1])
    i = 0
    for filenames_scc_ in filenames_scc:
        if filenames_scc_ in filenames_vso:
            i += 1
    print(f'Total brightness images missing(availiable) at VSO from {t_0} to {t_1}: {len(filenames_scc) - i} ({i})')

This gives:

Total brightness images missing(availiable) at VSO from 2020/03/02 00:00:00 to 2020/03/02 23:59:59: 0 (83)
Total brightness images missing(availiable) at VSO from 2020/03/03 00:00:00 to 2020/03/03 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/04 00:00:00 to 2020/03/04 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/05 00:00:00 to 2020/03/05 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/06 00:00:00 to 2020/03/06 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/07 00:00:00 to 2020/03/07 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/08 00:00:00 to 2020/03/08 23:59:59: 97 (0)
Total brightness images missing(availiable) at VSO from 2020/03/09 00:00:00 to 2020/03/09 23:59:59: 97 (0)
Total brightness images missing(availiable) at VSO from 2020/03/10 00:00:00 to 2020/03/10 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/11 00:00:00 to 2020/03/11 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/12 00:00:00 to 2020/03/12 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/13 00:00:00 to 2020/03/13 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/14 00:00:00 to 2020/03/14 23:59:59: 97 (0)
Total brightness images missing(availiable) at VSO from 2020/03/15 00:00:00 to 2020/03/15 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/16 00:00:00 to 2020/03/16 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/17 00:00:00 to 2020/03/17 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/18 00:00:00 to 2020/03/18 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/19 00:00:00 to 2020/03/19 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/20 00:00:00 to 2020/03/20 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/21 00:00:00 to 2020/03/21 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/22 00:00:00 to 2020/03/22 23:59:59: 97 (0)
Total brightness images missing(availiable) at VSO from 2020/03/23 00:00:00 to 2020/03/23 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/24 00:00:00 to 2020/03/24 23:59:59: 97 (0)
Total brightness images missing(availiable) at VSO from 2020/03/25 00:00:00 to 2020/03/25 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/26 00:00:00 to 2020/03/26 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/27 00:00:00 to 2020/03/27 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/28 00:00:00 to 2020/03/28 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/29 00:00:00 to 2020/03/29 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/30 00:00:00 to 2020/03/30 23:59:59: 0 (97)
Total brightness images missing(availiable) at VSO from 2020/03/31 00:00:00 to 2020/03/31 23:59:59: 0 (97)

There are days where no TB data returned at all, see for example the 2020/03/08.

Repeating the same for the full 2020 year in a mothly basis:

start_date = datetime(2020, 1, 1, 0, 0, 0)
end_date = datetime(2020, 12, 31, 23, 59, 59)

time_intervals = []

date_format = '%Y/%m/%d %H:%M:%S'

# Iterate through each month and generate the time intervals
current_date = start_date
while current_date <= end_date:
    end_of_month = (current_date.replace(day=1) + timedelta(days=32)).replace(day=1) - timedelta(days=1)

    time_intervals.append(
        (
            current_date.strftime(date_format),
            end_of_month.strftime(date_format)
        )
    )

    current_date = end_of_month + timedelta(days=1)

# Print the result
for t_0, t_1 in time_intervals:
    filenames_scc = get_files_from_ssc_summary([t_0, t_1])
    filenames_vso = get_files_from_vso([t_0, t_1])
    i = 0
    for filenames_scc_ in filenames_scc:
        if filenames_scc_ in filenames_vso:
            i += 1
    print(f'Total brightness images missing(availiable) at VSO from {t_0} to {t_1}: {len(filenames_scc) - i} ({i})')

This gives:

Total brightness images missing from VSO for 2020/01/01 00:00:00: 1092
Total brightness images missing from VSO for 2020/02/01 00:00:00: 489
Total brightness images missing from VSO for 2020/03/01 00:00:00: 807
Total brightness images missing from VSO for 2020/04/01 00:00:00: 598
Total brightness images missing from VSO for 2020/05/01 00:00:00: 812
Total brightness images missing from VSO for 2020/06/01 00:00:00: 2118
Total brightness images missing from VSO for 2020/07/01 00:00:00: 763
Total brightness images missing from VSO for 2020/08/01 00:00:00: 796
Total brightness images missing from VSO for 2020/09/01 00:00:00: 2071
Total brightness images missing from VSO for 2020/10/01 00:00:00: 737
Total brightness images missing from VSO for 2020/11/01 00:00:00: 1188
Total brightness images missing from VSO for 2020/12/01 00:00:00: 773

Screenshots

No response

System Details

  • OS: Mac OS 13.6.3
  • Arch: 64bit, (i386)
  • sunpy: Tested in 5.1.1 and 4.1.5
  • astropy: 5.3.2
  • numpy: 1.22.4
  • packaging: 23.2
  • parfive: 2.0.2

Installation method

pip

@nabobalis nabobalis added Bug Probably a bug net Affects the net submodule labels Feb 8, 2024
@nabobalis
Copy link
Contributor

Thank you for the detailed report. Can you try the web UI for the VSO and see if those return the files?

If that does, then we know its a bug in our own VSOClient or if we need to ask the VSO team themselves for advice.

@ejm4567
Copy link
Contributor

ejm4567 commented Feb 8, 2024

Yes, thanks for the detailed report. I've been looking into this issue, I can say it's an issue with the ingest script that adds SECCHI/COR2 data to the DB that the VSO uses for searching.

Also, the issue is broader than just 2020.
For example on 7/5/22 there are 169 files on disc, yet the DB has 249:

Match count = 105 DB only = 144 Disc only = 64 Total(DB) = 249/249 Total(disc) = 169/169

So some files are only on disc, others are only in the DB, and 105 are on both the disc and the DB.
I'm working now on determining the extent of the issue and what fixes are needed in the ingest script.

@AthKouloumvakos
Copy link
Author

No problem and thank you also for looking at this.

Also, the issue is broader than just 2020.

Yes, it seems that it is a broader issue. I had time to check 2020 and 2021. Almost 1/3 of the days for 2020 (133/365) are impacted and no TB images are available. For example:

Screenshot 2024-02-08 at 10 05 57 PM

For 2021 I find a similar situation and numbers. Note that I only searched this for TB images so I am not sure what happens to the PB images, if these files are all availiable or not.

So some files are only on disc, others are only in the DB, and 105 are on both the disc and the DB.

This is definitely an issue.

For 7/5/22 (I assumed this is July 5, 2022) what things should be according to the summary files is the following:

In sccA202207.img.c2 (the file that provides the TB images) there are: 169 files of which = 97 SSR1 + 72 SW
and in sccA202207.seq.c2 (PB images) there are: 144 flies of which = 72 SSR1 + 72 SW
Note that: SSR1=synoptic program, SW=Spaceweather/Beacon

Since VSO provides (I think) both TB and PB images at the same search and (probably) only the synoptic program (SSR1) images (since the SW are low-res and cannot be used scientifically) then for this date the VSO search should give

97 TB SSR1 + 72 PB SSR1 images = 169 images

I hope this helps.

@ejm4567
Copy link
Contributor

ejm4567 commented Mar 8, 2024

I've gotten to the bottom of the missing COR2 files. The first issue, as you've pointed out, is that the VSO only serves out TB, PB and SSR1 images, and filters out the SW (beacon) files.

The second is there was an error in the parsing script, most likely due to a change in the format of the log files from NRL that are parsed. The result was files were skipped and marked as MISSING.

I've got a fix in for the RE so going forward from 3/6/24 you should see the correct no. of COR2 files returned by the VSO.
I am working now on generalizing the parsing script to go back and sweep into the DB the remaining MISSING files.

I'll update here when that job is completed.

@nabobalis nabobalis added the Remote Server Issue / VSO An issue that is caused by unexpected or incorrect behaviour on a VSO server. label Mar 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bug Probably a bug net Affects the net submodule Remote Server Issue / VSO An issue that is caused by unexpected or incorrect behaviour on a VSO server.
Projects
None yet
Development

No branches or pull requests

3 participants