Implemented _get_time_for_url in GenericClient #3863

abhijeetmanhas · 2020-03-05T16:21:03Z

Description

Fixes #3715
After this PR, there is no need to implement _get_time_for_url() function in most of the many clients.
Currently I have implemented _get_time_for_url() in GenericClient so that it is not needed to be implemented separately in every client.
Made a new member crawler in GenericClient which is used to get dates.

It might not be the best way, so I need suggestions from maintainers.

pep8speaks · 2020-03-05T16:21:12Z

Hello @abhijeetmanhas! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! 🍻

Comment last updated at 2020-03-11 15:00:43 UTC

changelog/3863.trivial.rst

sunpy/net/dataretriever/client.py

abhijeetmanhas · 2020-03-05T17:12:56Z

autopep8 . I will not use it in the future.

…

On Thu, 5 Mar 2020 at 22:03, Nabil Freij ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In sunpy/net/dataretriever/client.py <#3863 (comment)>: > @@ -116,13 +117,11 @@ class GenericClient(BaseClient): is mainly designed for downloading data from FTP and HTTP type data sources, although should in theory be general enough to get data from any web service. - What reformatter? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3863?email_source=notifications&email_token=AKI5PG7ZWHQRJBA2XBBQEUDRF7H5ZA5CNFSM4LCNREZ2YY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCYEY3QY#discussion_r388412775>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKI5PG6C6F3QYIDIKMRYE63RF7H5ZANCNFSM4LCNREZQ> .

nabobalis · 2020-03-05T17:18:28Z

autopep8 . I will not use it in the future.
…
On Thu, 5 Mar 2020 at 22:03, Nabil Freij @.> wrote: @.* commented on this pull request. ------------------------------ In sunpy/net/dataretriever/client.py <#3863 (comment)>: > @@ -116,13 +117,11 @@ class GenericClient(BaseClient): is mainly designed for downloading data from FTP and HTTP type data sources, although should in theory be general enough to get data from any web service. - What reformatter? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3863?email_source=notifications&email_token=AKI5PG7ZWHQRJBA2XBBQEUDRF7H5ZA5CNFSM4LCNREZ2YY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCYEY3QY#discussion_r388412775>, or unsubscribe https://github.com/notifications/unsubscribe-auth/AKI5PG6C6F3QYIDIKMRYE63RF7H5ZANCNFSM4LCNREZQ .

I am not aware that autopep8 also does documentation strings.

abhijeetmanhas · 2020-03-08T15:01:11Z

Implemented for all clients which use scraper. Fixes #3814 too.

abhijeetmanhas · 2020-03-08T15:20:55Z

@nabobalis I have made the changes suggested by you, and it have passed all checks.

dpshelio

For consistency, I would keep scraper instead of crawler. So it's more understandable where the object comes from.

abhijeetmanhas · 2020-03-09T14:19:57Z

Ok, changing to scraper then

…

On 9 Mar 2020 19:48, "David Pérez-Suárez" ***@***.***> wrote: ***@***.**** commented on this pull request. For consistency, I would keep scraper instead of crawler. So it's more understandable where the object comes from. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#3863?email_source=notifications&email_token=AKI5PGZMH3TZO526X3HEZTTRGT3ELA5CNFSM4LCNREZ2YY3PNVWWK3TUL52HS4DFWFIHK3DMKJSXC5LFON2FEZLWNFSXPKTDN5WW2ZLOORPWSZGOCYQGG7I#pullrequestreview-371221373>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AKI5PGZDMCULOKQRT66XLVDRGT3ELANCNFSM4LCNREZQ> .

abhijeetmanhas · 2020-03-10T16:21:51Z

@dpshelio done the changes you suggested. Any other changes expected as of now?

sunpy/net/dataretriever/client.py

abhijeetmanhas · 2020-03-11T15:05:03Z

@dpshelio I have done the suggested changes.

Cadair · 2020-03-11T16:47:32Z

sunpy/net/dataretriever/client.py

+        else:
+            scraper = self.scraper
+            almost_day = TimeDelta(1 * u.day - 1 * u.millisecond)
+            times = [TimeRange(t0, t0 + almost_day) for t0 in map(scraper._extractDateURL, urls)]


Why is this assuming one day? The base class should not be making assumptions about the form of the results returned by the clients.

Yes, exactly my doubts were regarding this. What does the start time and end time in the response table represent? Since the data files we get through any query are always associated with timestamps, and if there are multiple files on the same date, what should set the start and end time to? Should they be timeranges of one day, or shoud timeranges should be divided ? (say there are 10 files for one day, so one day is divided in 10 timeranges?)

Yes, I don't think it should assume one day. I think the proper way to fix this is to modify the individual _extractDateURL methods to return an interval (ie. start time and end time) instead of currently just returning a start time. That way the length of an interval can be specified on a per-client basis.

abhijeetmanhas force-pushed the bye_get_time branch from b656fd2 to 39ed95f Compare March 5, 2020 16:23

nabobalis reviewed Mar 5, 2020

View reviewed changes

changelog/3863.trivial.rst Outdated Show resolved Hide resolved

nabobalis reviewed Mar 5, 2020

View reviewed changes

sunpy/net/dataretriever/client.py Outdated Show resolved Hide resolved

abhijeetmanhas force-pushed the bye_get_time branch 3 times, most recently from 356a976 to 2764016 Compare March 8, 2020 14:58

abhijeetmanhas marked this pull request as ready for review March 8, 2020 15:03

abhijeetmanhas requested review from a team as code owners March 8, 2020 15:03

abhijeetmanhas mentioned this pull request Mar 8, 2020

We should port the VSO post-search filtering code to work with Fido #3733

Open

dpshelio reviewed Mar 9, 2020

View reviewed changes

abhijeetmanhas added 9 commits March 9, 2020 20:25

added func in generic

43c2db4

made eve use func

7327e03

used global func in fermi

52f4656

generic func for norh

6994d5f

generic get_time for lyra

7989938

updated lyra in tables

e3fb823

removed get_time from goes

839d7d9

passing test in docs

89b981a

added changelog

2764016

dpshelio reviewed Mar 11, 2020

View reviewed changes

sunpy/net/dataretriever/client.py Outdated Show resolved Hide resolved

abhijeetmanhas added 2 commits March 11, 2020 21:48

replaced crawler with scraper

78fa563

shorterned times array formation code

06341e4

Cadair reviewed Mar 11, 2020

View reviewed changes

ayshih added dataretriever net Affects the net submodule labels Apr 13, 2020

nabobalis added this to the 2.1 milestone Apr 15, 2020

abhijeetmanhas closed this Jul 3, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented _get_time_for_url in GenericClient #3863

Implemented _get_time_for_url in GenericClient #3863

abhijeetmanhas commented Mar 5, 2020

pep8speaks commented Mar 5, 2020 •

edited

abhijeetmanhas commented Mar 5, 2020 via email

nabobalis commented Mar 5, 2020

abhijeetmanhas commented Mar 8, 2020 •

edited

abhijeetmanhas commented Mar 8, 2020

dpshelio left a comment

abhijeetmanhas commented Mar 9, 2020 via email

abhijeetmanhas commented Mar 10, 2020

abhijeetmanhas commented Mar 11, 2020

Cadair Mar 11, 2020

abhijeetmanhas Mar 11, 2020

dstansby May 22, 2020

Implemented _get_time_for_url in GenericClient #3863

Implemented _get_time_for_url in GenericClient #3863

Conversation

abhijeetmanhas commented Mar 5, 2020

Description

pep8speaks commented Mar 5, 2020 • edited

Comment last updated at 2020-03-11 15:00:43 UTC

abhijeetmanhas commented Mar 5, 2020 via email

nabobalis commented Mar 5, 2020

abhijeetmanhas commented Mar 8, 2020 • edited

abhijeetmanhas commented Mar 8, 2020

dpshelio left a comment

Choose a reason for hiding this comment

abhijeetmanhas commented Mar 9, 2020 via email

abhijeetmanhas commented Mar 10, 2020

abhijeetmanhas commented Mar 11, 2020

Cadair Mar 11, 2020

Choose a reason for hiding this comment

abhijeetmanhas Mar 11, 2020

Choose a reason for hiding this comment

dstansby May 22, 2020

Choose a reason for hiding this comment

pep8speaks commented Mar 5, 2020 •

edited

abhijeetmanhas commented Mar 8, 2020 •

edited