Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consolidate phrasing around text data #648

Open
maelle opened this issue Mar 31, 2023 · 6 comments
Open

Consolidate phrasing around text data #648

maelle opened this issue Mar 31, 2023 · 6 comments
Assignees
Milestone

Comments

@maelle
Copy link
Member

maelle commented Mar 31, 2023

Caught by @eliocamp @paocorrales @yabellini

In the dev guide we have "text data" https://devdevguide.netlify.app/softwarereview_policies.html#package-categories

In the submission template we have "text analysis" https://github.com/ropensci/software-review/blob/main/.github/ISSUE_TEMPLATE/A-submit-software-for-review.md

What should it actually be? "text data analysis"? @noamross

cc @mpadge

@mpadge
Copy link
Member

mpadge commented Mar 31, 2023

My English grammar dictator would say, "Analysis of textual data," but that's clunky, so I'd tend for simply "text analysis" in both.

@noamross
Copy link
Contributor

I actually think that we should remove the geospatial and text data categories. Both were experimental built around specific interest groups that we had explicit support for and had active folks helping shepherd (Scott building packages for web geospatial formats, Lincoln and others in the R text analysis working group, which is no longer a thing). Thoughts, @ropensci/editors?

@yabellini
Copy link
Contributor

Some of the projects of our champions come from the geospatial world.

@noamross
Copy link
Contributor

Are they general geospatial data type manipulation or about access to or processing of geospatial data sets? The latter would definitely still be in scope. This is less of an issue in any case for geospatial than for text, which we always described as a pilot.

@yabellini
Copy link
Contributor

Both: one is an extension to a rgee package for accessing and processing GEE API from R. The other uses some geospatial dataset and processes that info (weather data, survey data)

@tdhock
Copy link

tdhock commented Mar 31, 2023

In machine learning we talk about "natural language processing" which is a kind of text data analysis.

@maelle maelle added this to the 1.0.0 milestone Mar 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants