-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No support for *.op.gz files in reformat_GSOD()? #76
Comments
You're not crazy. I'm sorry, it did. When NCEI changed the formats of the data that they served I had to update GSODR a few months ago and |
I figured as much after poking around this repo. Thanks. I repointed my code to load data from the other source and not from ftp, but ran into a different issue, and now I'm just trying to simply use |
I remember my problem with In the old process, when I downloaded data from ftp, each
|
Ah, interesting. It's faster to download the entire .gz and sort it out after? How many stations are you fetching at once? I hadn't considered this case. I though either download all or just a few selected not many selected. |
My case is exactly in between. I pull 10 years worth of data for every station within a 25 km radius of every major city in the world. Here is the function that does it all for me:
|
Here is the repo, the code snippet above is from |
Awesome, thanks! I'll have a look and see if I can improve the package. |
Sweet! Let me know if I can contribute in any way. |
I've updated the internal functionality to check how many requests are being made. If the number of stations is greater than 10, GSODR will download the entire global annual file and sort out the needed files locally. If there are less than 10 stations, then it will download each requested station individually. I did a few tests to check how many individual requests were faster vs downloading the whole. The number is not exact due to things I can't control (Internet), but this should help in most cases to make it faster to request large numbers of stations that are not ALL stations or just a few. Line 155 in b1b6a5d
|
Sweet! I'll make sure to chuck my extra piece of code to pull full archives for my analysis on my next update. |
Thank you for letting me know about the bottleneck. You OK if I add you as a contributor to the package for the ideas/input? |
I would be honored! To be honest, I'd do a pull request on this item (and a couple of others), but I've never done a PR before, and was afraid to mess things up 🤷♂️ |
feel free. submit it and we can work through everything together after
…--
Dr Adam H. Sparks
Associate Professor of Field Crops Pathology | Centre for Crop Health
Institute for Life Sciences and the Environment (ILSE) | Research and Innovation Division
University of Southern Queensland | Toowoomba, Queensland | 4350 | Australia
Phone: +61746311948
Mobile: +61415489422
Mobile: +61415489422
On 14 Jan 2020, 04:55 +1000, Taras Kaduk ***@***.***>, wrote:
I would be honored! To be honest, I'd do a pull request on this item (and a couple of others), but I've never done before, and was afraid to mess things up 🤷♂️
—
You are receiving this because you modified the open/close state.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
I may be confusing something here, but I think that a while back,
reformat_GSOD()
worked with *.op.gz files obtained from NOAA ftp.It seems like this is no longer the case. Is something changed?
This is not an issue per se, it's a confusion. I couldn't rerun my code from a year ago, and I'm trying to see what changes do I need to make to the code to make it work.
The text was updated successfully, but these errors were encountered: