Skip to content
This repository has been archived by the owner on Mar 18, 2024. It is now read-only.

Carbon flux: use latitude, longitude and day of year in predictions? #10

Closed
stsievert opened this issue Aug 16, 2018 · 6 comments
Closed

Comments

@stsievert
Copy link
Contributor

I'd like a better explanation of the motivation, and some domain knowledge to know what variables to exclude.

@stsievert
Copy link
Contributor Author

stsievert commented Aug 16, 2018

I see latitude + day of year as being most important because those variables determine how much sunlight is received.

Here are some experiments on the clustering algorithm when using latitude, longitude and day of year (DOY) (as well as the measured variable, carbon flux).

Without lat/lon With lat/lon
Without DOY nodoy-nolatlon nodoy-latlon
With DOY doy-nolatlon doy-latlon

The sites are colored by vegetation type, with a legend of legend

More detail on the clustering is present with more data received, naturally. I think this will help with prediction performance. I think longitude should be left out; I can see it as being confounding, though the clustering picks up on it.

@jbednar
Copy link
Collaborator

jbednar commented Aug 17, 2018

It's hard to see how the longitude will help anything, so I'd vote for not including it.

@ebo
Copy link
Collaborator

ebo commented Aug 17, 2018 via email

@jbednar
Copy link
Collaborator

jbednar commented Aug 17, 2018

Right; over a relatively small region of the globe, lon would be a a perfectly reasonable feature to include. But for a global model, it just seems more confusing than helpful, likely to lead to overfitting and poor generalization, unless recoded as something like "distance from the nearest coastline" or something else more meaningful at a global scale.

@ebo
Copy link
Collaborator

ebo commented Aug 17, 2018 via email

@jbednar
Copy link
Collaborator

jbednar commented Aug 17, 2018

Any and all advice welcome!!!

@jbednar jbednar closed this as completed Oct 12, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants