Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

3.1.1 Localities standardization #37

Closed
RicardoOrtizG opened this issue Oct 30, 2020 · 4 comments
Closed

3.1.1 Localities standardization #37

RicardoOrtizG opened this issue Oct 30, 2020 · 4 comments
Assignees

Comments

@RicardoOrtizG
Copy link

In section 2.2 localities there are some good guidelines for documentation of localities, but I see these are emphasized on the field documentation or digitalization to a database, I think that there is no recommendation for the standardization of localities for the georeferencing process. From SiB Colombia we divide the general georeference workflow into two phases, localities standardization and georeference perse (adding coordinates), the first one allows us to reduce the number of localities to georeference and to integrate localities from different sources in a georeference project, in this process some misspelling, redundancy, and wrong documentation are corrected in order to get a later good classification for localities.

For example, localities documented only with county information is adjusted, taken the values to the respective DwC element, and the locality leaves empty. With tools like OpenRefine we look for clusters of the same locality described in different ways or maybe with some misspelling errors and correct them.

¿There is a reason to consider this is no longer necessary or efficient the localities standardization to the georeference process?, otherwise, I suggest to include some recommendations in the document could be helpful, we have documented some here

@ArthurChapman
Copy link
Collaborator

I think we have covered this in §3.1 Georeferencing process and specifically in §3.1.1. In §2.2 we are describing how to record and document a locality - the georeferencing process is covered later in the document.

@tucotuco
Copy link
Collaborator

tucotuco commented Nov 23, 2020 via email

@tucotuco tucotuco self-assigned this Nov 28, 2020
@tucotuco tucotuco changed the title Localities standardization 3.1.1 Localities standardization Nov 29, 2020
tucotuco added a commit that referenced this issue Dec 29, 2020
Addresses #37 except for the reference, forthcoming.
@tucotuco
Copy link
Collaborator

Addressed in a979e8e and 5295ad5

@RicardoOrtizG
Copy link
Author

I'm very sorry to answer so late, unfortunately in the process we made in the past we didn´t take time of georeferencing with and without locality standardization, but for example, with a process like clusters in OpenRefine, the number of unique localities definitely reduces. The same happens when the localities were cleaned by higher geography, elevation, or even coordinates information that should not be in the locality. I understand the decision that John takes, but I want to know what kind of information could confirm that?, maybe I have it but need to processit. I 'm thinking in some examples of number of unique localities before and after the standardization. That could work?

And I wonder if you can change the citation to SiB Colombia and not SIB Colombia. Thanks a lot!.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants