-
Notifications
You must be signed in to change notification settings - Fork 24
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SRDB import: count records with consecutive Study_midyear as single plot #183
Comments
I wrote the code to deal with that. I don't know yet what problem it can cause (I need to roll back all changes in ForC and run SRDB_import.R again to figure it out), so leaving this open for now. |
I found a tricky example related to this. This is data from site "Oak Ridge" We, as human, can see that stand with age 10 was samped in years 1997-2001 but the stand age was not updated along those years. Other plots seem to be independents. and stand age should be kept to what they are. I don't know how to code this to handle all potential cases.... quickly enough (to be done before I leave tonight) |
This is a file with all situations (273 sites) that my code is not addressing. Not all of them need to be fixed but most. (It is easier to look at the file in Excel) |
Got it. This is a tricky one to deal with. Some will obviously require manual review, but I'm wondering if there's any way to facilitate that/ code away some of the more common problems. |
Here's the start of a list I'm making as I assess these: Plots that show up on this list but are fine: a. Examples of legitimately independent plots, but handled correctly::
b. those where measurements were made in only one year:
c. cases where all plots are from the same study (I don't think you'd ever get slightly inconsistent ages from within the same study).
Examples that seem like they'd be solved by counting as the same plot if date is only 1/2 year off:
Those where 1+ plots has NAC for stand age:
Those that are legitimately complicated/ will require review of original pubs:
|
The 1/2 year issue seems like it may be easily solveable, and seems to be affecting a good number. Sometimes it would be solved by rounding up, other times by rounding down. |
There are a number where measurements were made in only one year. If all records are from the same study, they can be counted as legitimately separate plots and removed from the list. |
When one of the ages is NAC, its a likely duplicate with another plot but requires manual review. These should be imported as is and will need to be reviewed later. Please make a list of these. |
Can you show me an example where rounding 1/2 year would solve the problem? |
Actually, this isn't dealing with consecutive years; rather, it would come in where you decide whether plots were established in the same year. If Several examples below. In this case, if you count 1971 as the earlier year of measurement, there's a 24 year difference between the first and second measurements, and also a 24 year difference in stand age. One way to solve this (that would actually be better for plot names) would be to floor estimates of year established. We'd rarely say, "stand established in 1940.5" or "stand is 70.5" years old. This false precision is essentially what's causing a lot of these problems. Coweeta-WS6 ____________________________ Coweeta-WS13 ____________________________ Coweeta-WS17 ____________________________ Coweeta-WS18 ____________________________ |
ok, I'll use floor(Study_midyear) when attributing plot.name |
@ValentineHerr, I've updated the list above. Could you please try implementing those (hopefully all pretty easy) and see how the list looks? |
Site names are now listed in Note that the lists are smaller than earlier because I was looking at sites for which there was no records of interests before (they only had variables that are not mapped to ForC) |
Thanks. This reduces the number of sites that need review to a manageable number. |
I just found an example where a single plots is falsely counted as multiple plots. It is "Kurth_2014_isrr research site in Minnesota USA", "Temperate Deciduous Forest. Stand established around 1950/1951/1952". The same study had plots of age 0-1-2 (measured in 3 consecutive years) that are treated correctly. In SRDB, age 60 is given to all 3 instances of the older plot. |
@ValentineHerr, in cases where Study_midyear increases consecutively and Age stays the same, we have what the code thinks are separate plots when in fact we're dealing with a multi-year study at the same plot (example: Schanis). Please modify these such that we only get one plot, with age defined based on the first study date.
Please alert me if this causes unexpected problems. I think we've been through this issue before and decided to do it the way its currently done, but now that I'm looking at the import I'm realizing that this creates a lot of unwanted false separation of plots.
The text was updated successfully, but these errors were encountered: