Api key #61

Maarten-vd-Sande · 2020-09-08T12:36:29Z

I decided to just implement the skeleton instead of leaving it up to you, see #60. This still needs the sleeps changed, but I am not sure which sleep is for what, since some are 1/10th, 1/3rd, 1/2 etc.

import pysradb
print(pysradb.__version__)
print(pysradb.SRAweb(api_key="__secret__").sra_metadata("GSM1020644", detailed=True).experiment_alias.values)

works and does not crash (and takes the same amount of time as without a key).

codecov · 2020-09-08T12:40:08Z

Codecov Report

Merging #61 into master will decrease coverage by 0.01%.
The diff coverage is 66.66%.

@@            Coverage Diff             @@
##           master      #61      +/-   ##
==========================================
- Coverage   42.23%   42.22%   -0.02%     
==========================================
  Files           5        5              
  Lines        1030     1035       +5     
==========================================
+ Hits          435      437       +2     
- Misses        595      598       +3

Impacted Files	Coverage Δ
pysradb/sraweb.py	`84.12% <66.66%> (-0.48%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 5e54c4e...aeaa494. Read the comment docs.

saketkc · 2020-09-08T14:44:15Z

Thanks @Maarten-vd-Sande , this looks great and is very helpful! I have not revisited the sleep time issues for a while given things looked mostly stable. But this PR should be a good opportunity to do so.

Might be helpful for @bscrow's PR #57 too. I would need some time to review this.

Maarten-vd-Sande · 2020-09-08T15:47:09Z

Glad that it is considered useful 😄 . Let me know if I can somehow help to move this forward.

saketkc · 2020-09-08T23:21:37Z

Do you think we should propogate the self.sleep_time in the retry functions? It is not being used anyhwere. One idea would be to not have it and let the retry block as is (it has a sleep of 0.5 seconds by default)

Maarten-vd-Sande · 2020-09-09T06:50:23Z

I think the API key is not really necessary actually. I only used SRAweb with detailed=True, and I thought it took so "long" because pysradb looked up each sample separately (which it doesn't). I just discovered that the reason it takes so long is checking for the ENA download link.

I changed the time.sleeps to use the self.time_sleep, so at least that is cleaned up. This works and the small tests I did do not run into API limits. However the api-key is not really necessary, so we can also close this PR and ignore this :)

bscrow · 2020-09-09T08:54:25Z

Yep @saketkc including an API key for the search feature should speed up large queries for SRA metadata, since the current implementation retrieves metadata in batches of 300 entries. I'll test it out for the search module

Maarten-vd-Sande · 2020-09-09T08:57:17Z

Seems like I keep on missing stuff in the code 👀. Where is this split (batches of 300) made @bscrow ?

bscrow · 2020-09-09T09:48:37Z

@Maarten-vd-Sande this split is only implemented in the upcoming pysradb search feature (#57 ), which is different from the pysradb metadata feature found in pysradb/sraweb.py in that it fetches metadata based on text queries instead of accession numbers

saketkc · 2020-09-10T05:33:11Z

I guess, this is still useful if someone is interested in trying out the API key and the current behavior remains unaffected.
I might expose this to command line in the future.

What are your thoughts @Maarten-vd-Sande and @bscrow?

Maarten-vd-Sande · 2020-09-10T08:40:38Z

It doesn't hurt to have it, although it might be misleading in that it speeds up pysradb with "normal" usage. If it helps with the search function then that's nice!

I have to say it requires more thorough testing to see if it hits the API limit, since I haven't really stress-tested it.

saketkc · 2020-09-11T20:45:41Z

I decided to merge it. In my experience, stress testing NCBI is hard (very unreliable behavior overall). Since it doesn't change the current behavior, and might potentially be useful (hopefully), we can keep it.

saketkc · 2020-09-11T20:45:54Z

Many thanks for your contribution @Maarten-vd-Sande!

Maarten-vd-Sande · 2020-11-04T08:33:46Z

Just a FYI, every once in a while I get:

Unable to parse xml: {"error":"API rate limit exceeded","api-key":"131.174.27.98","count":"4","limit":"3"}

Maybe the sleep time should be slightly increased to avoid this.

saketkc · 2020-11-04T13:06:02Z

Thanks for catching this @Maarten-vd-Sande. Do you have an example I can test this against?

Maarten-vd-Sande · 2020-11-04T13:16:52Z

Let me see if I can cook something up :)

Maarten-vd-Sande and others added 7 commits September 7, 2020 15:13

fix collisions in merge

5b0427f

set_index is not inplace...

b2531b3

apply black formatting

b7ba375

Add test for fastq columns when using multiple SRPs

65ca04c

skeleton for api key

62f9aba

Merge branch 'master' of github.com:Maarten-vd-Sande/pysradb

2f50618

Merge branch 'master' of github.com:saketkc/pysradb

92bd480

add docstring

7e5bed9

Maarten-vd-Sande and others added 4 commits September 9, 2020 08:45

replace time sleeps

2713ad6

Merge branch 'master' into master

7bf4ce3

update docstring

d6c4ee9

Merge branch 'master' of github.com:Maarten-vd-Sande/pysradb

a746293

Maarten-vd-Sande mentioned this pull request Sep 9, 2020

multithreaded ena loading #63

Merged

Maarten-vd-Sande changed the title ~~Skeleton for api key~~ Api key Sep 9, 2020

saketkc added 2 commits September 9, 2020 13:03

Merge branch 'master' into master

4e492a9

Merge branch 'master' into master

aeaa494

saketkc merged commit 1eaa0a8 into saketkc:master Sep 11, 2020

saketkc mentioned this pull request Sep 11, 2020

[ENH] Accept API keys #60

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Api key #61

Api key #61

Maarten-vd-Sande commented Sep 8, 2020

codecov bot commented Sep 8, 2020 •

edited

Loading

saketkc commented Sep 8, 2020

Maarten-vd-Sande commented Sep 8, 2020

saketkc commented Sep 8, 2020

Maarten-vd-Sande commented Sep 9, 2020

bscrow commented Sep 9, 2020

Maarten-vd-Sande commented Sep 9, 2020 •

edited

Loading

bscrow commented Sep 9, 2020

saketkc commented Sep 10, 2020

Maarten-vd-Sande commented Sep 10, 2020

saketkc commented Sep 11, 2020

saketkc commented Sep 11, 2020

Maarten-vd-Sande commented Nov 4, 2020

saketkc commented Nov 4, 2020

Maarten-vd-Sande commented Nov 4, 2020

Api key #61

Api key #61

Conversation

Maarten-vd-Sande commented Sep 8, 2020

codecov bot commented Sep 8, 2020 • edited Loading

Codecov Report

saketkc commented Sep 8, 2020

Maarten-vd-Sande commented Sep 8, 2020

saketkc commented Sep 8, 2020

Maarten-vd-Sande commented Sep 9, 2020

bscrow commented Sep 9, 2020

Maarten-vd-Sande commented Sep 9, 2020 • edited Loading

bscrow commented Sep 9, 2020

saketkc commented Sep 10, 2020

Maarten-vd-Sande commented Sep 10, 2020

saketkc commented Sep 11, 2020

saketkc commented Sep 11, 2020

Maarten-vd-Sande commented Nov 4, 2020

saketkc commented Nov 4, 2020

Maarten-vd-Sande commented Nov 4, 2020

codecov bot commented Sep 8, 2020 •

edited

Loading

Maarten-vd-Sande commented Sep 9, 2020 •

edited

Loading