-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Review trips: platform_code (now platform_name) #17
Comments
The first question is answered in #12 (comment), where it is suggested to use ShipC. I will investigate how easy it is to map these. |
Summary of mapping:
Questions:
|
@peterdesmet you should use ICES platform code as the identifier/key in your system and only use an interger virtual key internally in your database, if you find that useful. The platform name is not unique cannot as such be used as an identifier of a given platform. The ICES platform request system is an international colaboration used globally. As seen at http://vocab.ices.dk/request, you can contact accessions@ices.dk if you want access to the platform request application where you can search for and request new codes for platform not allready in the system. All you need to do to request a code for a new platform is to provide enough metadata to identify the platform uniquely. Then a data manager will validate your information and assign a code. The code will be the key for all future references across the globe for the platform in question. No need to reinvent the whell here. |
just to clarify, these are not duplicates; the governance model distinguishes between instances of a vessel/hull. So although the name is not unique, the combination of key attributes will be - in most cases the commission date/decommission date are the defining instances of a vessel (platform code) with the same name/call sign |
Quite unlikely I am afraid... I have received (some) lookup tables from
JNCC, and in case of platform_code the missing values are simply not there.
Not sure how this is possible, but the actual original ESAS database is
somehow locked and Mark Lewis can't reach it. This should be solved soon,
and maybe there is some additional information there, but I would not count
on it.
Nicolas Vanermen
*Wetenschappelijk medewerker*
Instituut voor Natuur- en Bosonderzoek
Havenlaan 88, bus 73, 1000 Brussel
0486/361.582
…On Wed, Jan 6, 2021 at 12:26 PM Peter Desmet ***@***.***> wrote:
trips: platform_code indicates the platform code (ship call sign, etc)
with an integer code:
https://github.com/ices-tools-dev/esas/blob/0d294b4ea678e34f52409a16602f9133ac179dcf/_data/table-schemas/trips.yaml#L119-L130
Values are available at here
<https://github.com/ices-tools-dev/esas/blob/main/_data/vocabularies/platform_code.tsv>
.
- @ices-tools-dev/data-and-information
<https://github.com/orgs/ices-tools-dev/teams/data-and-information>
Does ICES already have a controlled vocabulary for this?
- @nicolasvanermen <https://github.com/nicolasvanermen> I notice a lot
of unknown values. How likely is it that these well eventually get meaning?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#17>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABYKGXN6ZFZBG2VMY5SK4G3SYRCFVANCNFSM4VXM3QSA>
.
|
@HjalteParner @neil-ices-dk thanks, then the core issue is: given that we a historic database that only contains platform/ship names such as |
In case of ter streep, the code 2465 is not used in the database. The same
goes for other obvious duplicates such as prins filip and prins albert. Can
you supply me with a list of duplicates?
Nicolas Vanermen
*Wetenschappelijk medewerker*
Instituut voor Natuur- en Bosonderzoek
Havenlaan 88, bus 73, 1000 Brussel
0486/361.582
…On Wed, Jan 6, 2021 at 6:42 PM Peter Desmet ***@***.***> wrote:
Summary of mapping:
- There are 458 platform_codes in total
<https://github.com/ices-tools-dev/esas/blob/main/_data/vocabularies/platform_code.tsv>
- 93 (20%) can be dumb mapped on name, but part of those are unique
values
- 50 (11%) are Temp x or UNKNOWN
- 315 (69%) cannot be dumb mapped on name.
Questions:
- @nicolasvanermen <https://github.com/nicolasvanermen> I notice some
duplicate values in the ESAS list, such as 11009 and 2465 for Ter
Streep and likely duplicates: Eldjan vs Eldjarn I guess we should
merge this if we adopt SHIPC codes? It would reduce the list by 29 values
- @ices-tools-dev/data-and-information
<https://github.com/orgs/ices-tools-dev/teams/data-and-information> I
notice many duplicate values in the SHIPC list, such as 14AT, 32A7,
572N, 90A7, CUAN for Antares. How should we differentiate between
those?
- @ices-tools-dev/data-and-information
<https://github.com/orgs/ices-tools-dev/teams/data-and-information> I
notice many values for UNKNOWN (like we have too). How can we assign
these correctly?
- @ices-tools-dev/data-and-information
<https://github.com/orgs/ices-tools-dev/teams/data-and-information>
How would we add the 315 unmapped ships? How are codes assigned?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#17 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABYKGXLVSOFLUXEVCFFH2I3SYSOJ5ANCNFSM4VXM3QSA>
.
|
we've actually done a similar exercise for our trawl survey datasets (@Osanna123) will remember this fondly; in that case we primarily looked at the date ranges of the data that related to the vessel name to map it to probable instance(s) of the vessel in the platform codes. |
'UNKNOWN' platforms can be mapped to the AAxx codes or the ZZ99 in the SHIPC list.
This would have to be a separate exercise, any additional info would be useful, at least years with data |
Discussed this with @nicolasvanermen and @EricStienen. Decided to make this a purely informal field named
@nicolasvanermen in your export:
|
The field is not mandatory, but if the field is to be 'informal', it's better to move it to notes. |
List of AA-codes: |
I have now reviewed the whole mapping list. To be discussed what is the best approach to move forward. |
Decisions after May 11 meeting with @Osanna123 and @nicolasvanermen:
|
Some codes that are not yet mapped (do not have
@Osanna123 without truly knowing how much work is involved, >=2010 and >100 trips seem reasonable: 59 codes to add |
OK, thanks! But I don't really get how the 'no filter' row and columns have
less ships compared to their filtered analogues.
Nicolas Vanermen
*Wetenschappelijk medewerker*
Instituut voor Natuur- en Bosonderzoek
Havenlaan 88, bus 73, 1000 Brussel
0486/361.582
…On Tue, May 11, 2021 at 1:49 PM Peter Desmet ***@***.***> wrote:
Some codes that are not yet mapped (do not have ok) do seem to be used a
lot or recently, making them valuable candidates to be added to SHIPC. Here
are the number of those codes based on the *use threshold* (numbers
created withwith OR, e.g. >=2000 OR >=20 trips):
threshold no filter >=20 trips >=50 trips >=100 trips
*no filter* 281 95 47 28
*>=2000* 95 145 114 103
*>=2005* 89 130 93 79
*>=2010* 39 119 76 59
*>=2015* 15 108 62 43
@Osanna123 <https://github.com/Osanna123> without truly knowing how much
work is involved, >=2010 and >100 trips seem reasonable: 59 codes to add
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#17 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABYKGXKCN4JGGWOU3HBVWSLTNEKTHANCNFSM4VXM3QSA>
.
|
"No filter" is maybe a confusing term, but read |
|
trips: platform_code
indicates the platform code (ship call sign, etc) with an integer code:esas/_data/table-schemas/trips.yaml
Lines 119 to 130 in 0d294b4
Values are available at here.
The text was updated successfully, but these errors were encountered: