-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Zarr at Scipy 2019 #20
Comments
Thanks @rabernat for raising this. I've been wanting to get to scipy for years and haven't managed it yet, so perhaps this is the year. I'd be happy to write an abstract, I'll do it as a text file and PR it back to this repo so hopefully it's easy for anyone to give comments before submission. |
@jakirkham could you suggest a use case from your domain? |
For the author list I'll include everyone who is a member of @zarr-developers/core-devs by default. Please let me know if you have any objection to being listed as an author. |
If anyone else would like to suggest a use case I'd be very happy to include it. Ideally it will be an example where zarr is already being used or being prototyped/evaluated, but if you have a potential use case that you're interested in using zarr for then I'd be interested to know too. |
@dazzag24 -- do you still use Zarr? |
Yep
…On Fri, 25 Jan 2019, 10:30 Zain Patel ***@***.*** wrote:
@dazzag24 <https://github.com/dazzag24> -- do you still use Zarr?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/zarr-developers/zarr/issues/396#issuecomment-457528538>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ABAcV6bd8JFXuKjlTB8ehYIf2TG61wfIks5vGty4gaJpZM4aMaXZ>
.
|
To get the ball rolling, I've made a PR (#397) with an initial draft of an abstract. I'd very much welcome and comments, suggestions or contributions. |
I have dozens of examples. Basically all of our data on the cloud is in zarr, will follow up in the PR. |
@alimanfoo, I'm not sure I'll be able to do too much on this before the deadline as I'm out of the country. That said, would be happy to chip in when I get back. Don't have a clear enough idea of my schedule to commit to being at SciPy this year. Will update once I do. |
Thanks @jakirkham. At this point just a sentence or two describing datasets
that you're storing in zarr would be all we need I think.
…On Tue, 29 Jan 2019, 08:21 jakirkham ***@***.*** wrote:
@alimanfoo <https://github.com/alimanfoo>, I'm not sure I'll be able to
do too much on this before the deadline as I'm out of the country. That
said, would be happy to chip in when I get back. Don't have a clear enough
idea of my schedule to commit to being at SciPy this year. Will update once
I do.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/zarr-developers/zarr/issues/396#issuecomment-458448565>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAq8QvSUrPilKbDHXAqDA3e088rcW01wks5vIASHgaJpZM4aMaXZ>
.
|
Hi folks, I did a bit of editing of the abstract following comments from @jhamman, I think it's in pretty good shape but would be cool if we could mention one more example of data being stored as Zarr. I think we literally just need a sentence. E.g., here is the current text of the results subsection:
I'd be extremely grateful if someone could volunteer a sentence describing another example. @jakirkham? @ambrosejcarr? @dazzag24? Any other comments welcome. I'm aiming to submit on Friday. |
Happy to contribute some examples. One or both of the below sentences could be used.
|
Wonderful, thanks @ambrosejcarr. Just heard deadline has been extended to February 15, so we have a few extra days to contemplate. |
I've pushed another commit to the abstract in #397 adding in the short summary, the HCA example from @ambrosejcarr, and an author list. In the author list I've included everyone who has contributed code or is a member of @zarr-developers/core-devs or who participated in the first zarr/n5 conference call or who contributed to the abstract. The list comprises @rabernat, @sbalmer, @ambrosejcarr, @tjcrone, @dazzag24, @martindurant, @funkey, @meggart, @jhamman, @shoyer, @jeromekelleher, @jakirkham, @alimanfoo, @joshmoore, @CSNoyes, @onalant, @constantinpape, @mzjp2, @mrocklin, @axtimwalde, @vincentschut, @shikharsg, @jmswaney, @ryan-williams. Apologies I did not know everyone's name or affiliation. If you would prefer not to be included in the author list or would like me to edit your name or affiliation please let me know asap. If you are not in this list but have contributed in some way to the project and would like to be included then please let me know (and apologies for not including you already). Submission deadline is noon CST so I'll submit in a couple of hours. |
The deadline is 11:59pm, so midnight, not noon. 😉 |
Ha, I should pay more attention! I'll probably still submit in a couple of hours, before I go home. |
FYI: I have submitted a proposal for Intake, and I believe there will be one for Dask (although I haven't heard for sure). Supposing that not all of these are accepted, perhaps we could merge some content, catalog of zarrs, dask-parallel processing of zarrs, etc. |
Any news on the zarr scipy talk? Was it accepted? |
Ok, it looks like they just listed the first three authors in alphabetical order. Is @alimanfoo planning to attend / present this? |
Yes good news, it was accepted. I'm planning to present.
…On Thu, 18 Apr 2019, 22:15 Ryan Abernathey, ***@***.***> wrote:
Ok, it looks like they just listed the first three authors in alphabetical
order. Is @alimanfoo <https://github.com/alimanfoo> planning to attend /
present this?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/zarr-developers/zarr/issues/396#issuecomment-484528957>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAFLYQUR5PR66WVKXL36G53PRB7A7ANCNFSM4GRRUXMQ>
.
|
For interest, here are the review comments. Also interesting is that I initially got a rejection notice, then a couple of hours later got an acceptance notice, so I guess we initially just missed the cut but then someone above us dropped out. If so suggests that scipy is really competitive, and we had a bit of luck. In any case, great to see the positive reviews. ----------------------- REVIEW 1 --------------------- This talk illustrates the use of Zarr with examples from several scientific domains. It is used for several interesting projects such as to store genome variation data from next-generation sequencing of natural populations of malaria parasites and mosquitoes and within the Human Cell Atlas project. It will be great for this conference and health care researchers will get to learn the utilization of Zarr. ----------------------- REVIEW 2 --------------------- The need for storage of tensor data is only increasing in applications involving parallel and distributed computing of large data sets such as used in machine learning applications. This paper presents an important and ongoing research effort in this area by a large group of accomplished data scientists. The abstract is clearly written, and the work is novel and significant. Although the topic covers a technical implementation, the focus on case studies of real-world problems is a strength. ----------------------- REVIEW 3 --------------------- The authors have proposed a talk on Zarr project for distributed and parallel computing. It is also related to the application of Python to life science (Malaria genomics and Human cell atlas) and climate studies. |
Btw does anyone have a suggestion for how to author the slides for the talk in a way that is amenable to putting in a PR and collaborating on? I suppose the only text-based PR-friendly format would be latex+beamer, haven't used it before but happy to have a go. Suppose could also be done with jupyter+reveal, although it's impossible to do line comments on a jupyter notebook. Failing that, google doc? |
I use reveal.js without Jupyter and like it
https://github.com/mrocklin/slides/tree/gh-pages
https://github.com/mrocklin/slides/blob/gh-pages/dask-short.md (this is
content in markdown)
https://github.com/mrocklin/slides/blob/gh-pages/dask-short.html (this is
boiler plate)
produce
http://matthewrocklin.com/slides/dask-short.html#/
…On Tue, Apr 23, 2019 at 11:30 AM Alistair Miles ***@***.***> wrote:
Btw does anyone have a suggestion for how to author the slides for the
talk in a way that is amenable to putting in a PR and collaborating on?
I suppose the only text-based PR-friendly format would be latex+beamer,
haven't used it before but happy to have a go.
Suppose could also be done with jupyter+reveal, although it's impossible
to do line comments on a jupyter notebook.
Failing that, google doc?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/zarr-developers/zarr/issues/396#issuecomment-485878099>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AACKZTDPAK6WO6O7OEQC7YLPR42RVANCNFSM4GRRUXMQ>
.
|
Who will be at SciPy? Also how long are people planning to be there? Finally would it be worth doing a sprint? |
I'll be there for the main conference.
…On Thu, 6 Jun 2019 at 18:53, jakirkham ***@***.***> wrote:
Who will be at SciPy? Also how long are people planning to be there?
Finally would it be worth doing a sprint?
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/zarr-developers/zarr/issues/396?email_source=notifications&email_token=AAFLYQQ2EHJTYAZRX4TX3A3PZE6H5A5CNFSM4GRRUXM2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODXDPPBQ#issuecomment-499578758>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAFLYQWEBE6P76IQP5P2HMDPZE6H5ANCNFSM4GRRUXMQ>
.
--
Alistair Miles
Head of Epidemiological Informatics
Centre for Genomics and Global Health
Big Data Institute
Li Ka Shing Centre for Health Information and Discovery
University of Oxford
Old Road Campus
Headington
Oxford
OX3 7LF
United Kingdom
Phone: +44 (0)1865 743596 or +44 (0)7866 541624
Email: alimanfoo@googlemail.com
Web: http://a <http://purl.org/net/aliman>limanfoo.github.io/
Twitter: @alimanfoo <https://twitter.com/alimanfoo>
Please feel free to resend your email and/or contact me by other means if
you need an urgent reply.
|
I'll be there for the main conference. @jhamman and I are also hosting an
xarray sprint on Saturday. Would gladly do that in conjunction with a zarr
sprint.
On Thu, Jun 6, 2019 at 7:25 PM Alistair Miles <notifications@github.com>
wrote:
… I'll be there for the main conference.
On Thu, 6 Jun 2019 at 18:53, jakirkham ***@***.***> wrote:
> Who will be at SciPy? Also how long are people planning to be there?
> Finally would it be worth doing a sprint?
>
> —
> You are receiving this because you were mentioned.
> Reply to this email directly, view it on GitHub
> <
https://github.com/zarr-developers/zarr/issues/396?email_source=notifications&email_token=AAFLYQQ2EHJTYAZRX4TX3A3PZE6H5A5CNFSM4GRRUXM2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODXDPPBQ#issuecomment-499578758
>,
> or mute the thread
> <
https://github.com/notifications/unsubscribe-auth/AAFLYQWEBE6P76IQP5P2HMDPZE6H5ANCNFSM4GRRUXMQ
>
> .
>
--
Alistair Miles
Head of Epidemiological Informatics
Centre for Genomics and Global Health
Big Data Institute
Li Ka Shing Centre for Health Information and Discovery
University of Oxford
Old Road Campus
Headington
Oxford
OX3 7LF
United Kingdom
Phone: +44 (0)1865 743596 or +44 (0)7866 541624
Email: ***@***.***
Web: http://a <http://purl.org/net/aliman>limanfoo.github.io/
Twitter: @alimanfoo <https://twitter.com/alimanfoo>
Please feel free to resend your email and/or contact me by other means if
you need an urgent reply.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<https://github.com/zarr-developers/zarr/issues/396?email_source=notifications&email_token=AAJEKJWC3ZLP3MJDMQEZWCTPZGMFXA5CNFSM4GRRUXM2YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGODXEOBZY#issuecomment-499704039>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAJEKJXO5UQ7DRJKNGPTFQ3PZGMFXANCNFSM4GRRUXMQ>
.
|
That could be good. Will also be there for the sprints. Related: There is an N-D image analysis sprint that overlaps with a few projects. I wonder if we can set it up so we have neighboring rooms or share the same room given that there will likely be a lot of common interest between participants. |
@hanslovsky will be at SciPy and present his ImgLib2 <-> Numpy bridge ImgLyb https://github.com/imglib/imglib2-imglyb and Paintera https://github.com/saalfeldlab/payntera |
Seeing as how SciPy 2022 will have Zarrish attendees, I feel it's safe to close this knowing we can always find it when/if we need it. |
I think someone should propose a talk about Zarr to Scipy. This would really help raise the profile of the project to a broad audience.
Deadline is Feb. 11:
https://www.scipy2019.scipy.org/talk-poster-presentations
The text was updated successfully, but these errors were encountered: