Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Plants dataset #34

Closed
bavla opened this issue Mar 20, 2023 · 2 comments
Closed

Plants dataset #34

bavla opened this issue Mar 20, 2023 · 2 comments
Labels
question Question for the curators

Comments

@bavla
Copy link

bavla commented Mar 20, 2023

I came across the dataset Plants (https://archive-beta.ics.uci.edu/dataset/180/plants). When converting it into a two-mode network, I noticed that it contains an unknown state "gl" - is gl=dengl ? pe, I guess, is the Prince Edward Island?
The dataset description says Number of Instances: 22632, but there are 34781 lines!!??

vladimir.batagelj@fmf.uni-lj.si

@ap0nia ap0nia added the question Question for the curators label Jun 8, 2023
@ap0nia
Copy link

ap0nia commented Jun 8, 2023

@uci-ml-repo/curators

@rlongjohn
Copy link

Thank you for your questions! The number of instances for this dataset has been corrected to 34,781. As for your questions about the location abbreviations, I think your guesses for the meaning of "gl" and "pe" seem reasonable. After a quick glance at a few examples in the USDA Plants Database, it appears they do use "gl" as an abbreviation for Greenland, and those plants with "pe" (at least the few examples I looked at) do seem to be found in Prince Edward Island.

@ap0nia ap0nia closed this as completed Jul 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Question for the curators
Projects
None yet
Development

No branches or pull requests

3 participants