syncer/ceos.pm -- Synchronize CEOS data with the GCIS
./sync.pl --syncer=ceos --url=https://data-stage.globalchange.gov
The CEOS syncer pulls platform, instrument and mission data using the "download to excel" function available from http://database.eohandbook.com.
It pulls missions, instruments and agencies. It ingests missions and instruments, as platforms and instruments. It also associates missions and instruments with GCIS organizations, using the CEOS agencies.
The CEOS syncer uses the ceos lexicon to translate identifiers within CEOS to GCIDs.
Platforms are extracted from http://database.eohandbook.com/database/missiontable.aspx.
Platforms are excluded if their status is not "mission complete" or "currently being flown".
The numeric CEOS identifier for a platform is used to identify the GCID for a platform. The short name is stored as an alternative identifier.
The following fields in GCIS are updated based on the CEOS information :
name, url, platform_type_identifier, start_date, end_date
Instruments are extracted from http://database.eohandbook.com/database/instrumenttable.aspx. Instruments are associated with platforms using the relationships described by CEOS. The association of a platform and an instrument is GCIS is called an "instrument instance", whereas an "instrument" refers to a class of instruments.
Instruments are included unless their status is "proposed", "being developed" or "no longer considered".
The numeric CEOS identifier for an instrument is used to identify the instrument. The short name is stored as an alternative identifier.
The following fields in GCIS are updated based on the CEOS information :
name, description (only if one does not exist), description_attribution
The description is generated from the "instrument technology" field.
The CEOS agencies are only used if an entry for an agency exists in the GCIS.
Agencies are associated with instruments or platforms using the role "contributor".