Template by lucyleeow · Pull Request #1 · sphinx-gallery/sample-project

lucyleeow · 2019-12-11T16:26:23Z

Basic template.

Should I add:

a simple module with a function or two and use it in the examples?
requirements file
example data and download the data in a template (suggested by @larsoner here)
pandas dataframe in example to show html being captured

Suggestions welcome.

larsoner · 2019-12-11T20:36:35Z

@lucyleeow you'll want to set up CircleCI to do a build so we can see artifacts

larsoner · 2019-12-11T20:37:10Z

(let's start with getting that to work so we can easily iterate and review, we'll want it as part of the template anyway)

lucyleeow · 2019-12-12T11:16:49Z

@larsoner how do I get circleci to work on this PR....? I added a basic circleci config.yml file, merged this into master and it ran on master...?

larsoner · 2019-12-12T12:43:51Z

You have to tell it to "build forked pull requests" in the settings on the circle site. Let me know if you can't find it and I can do it

lucyleeow · 2019-12-12T12:58:39Z

Thanks @larsoner. The output is here.

larsoner · 2019-12-12T14:38:26Z

Next steps it would be good to have:

some sample_project module with a function and a class
examples that use these
backreferences support
modified class.rst and function.rst showing how to add the backrefs
the pandas dataframe repr stuff

larsoner · 2019-12-12T14:38:56Z

Should we merge this, or wait until that stuff is in place? I'm fine either way

lucyleeow · 2019-12-12T14:40:01Z

I can add to this PR and merge as one. Thanks!

lucyleeow · 2019-12-13T17:55:46Z

@larsoner I am a little lost as to how why the function parameter and out parts is not formatted correctly. I think docstring is the same as functions in SG - is there a setting to set?

Also I don't know why an empty backreferences_dir examples file is made for the function but seems to work perfect for the class (at least locally).

larsoner · 2019-12-13T19:59:44Z

Probably autosummary and/or napolean would help here. Also maybe autosummary_generate=True config values, etc. Basically I would copy bits of the SG config until I found what fixed things.

lucyleeow · 2019-12-16T15:33:41Z

@larsoner The layout of the autodoc was fixed by adding 'sphinx.ext.napoleon' and 'sphinx.ext.intersphinx'. I did notice however, that pd.DataFrame was not linked via intersphinx (here). This lead me to check the SG example gallery, and it is not linked there either. Any idea why? Matplotlib plot links perfectly fine.

The backreferences file for the function in the SampleModule works if the function is specifically imported (i.e., from SampleName.module import fun) but an empty file is generated if just the ~~package~~ module is imported (import SampleName.module) and the function used in the example like this: package.module.fun()

lucyleeow · 2019-12-16T16:05:37Z

@larsoner backreferences works also if I import the function in the __init__.py file

lucyleeow · 2019-12-17T11:05:53Z

I'm not sure if this is a bug or if I am just doing it wrong. I think the case that doesn't work for me works in sklearn documentation.

larsoner · 2019-12-17T17:27:00Z

@lucyleeow these sound like bugs/limitations in the parsing code, somewhere in these functions:

https://github.com/sphinx-gallery/sphinx-gallery/blob/master/sphinx_gallery/backreferences.py#L62-L190

These are often a pain to debug, I think last time I needed to do it I added some if <condition>: raise RuntimeError in some helpful place, modified tinybuild, ran python -i -msphinx ... call to actually build the doc, and then when it hit my RuntimeError I did import pdb; pdb.pm(). It's ugly but it worked. I don't know of a nicer way to get a debug prompt for a Sphinx build, though one probably exists. For example you can probably call sphinx's main from within a script, with some break points set in your IDE. But I never bothered checking.

lucyleeow · 2020-01-13T16:44:37Z

Last thing I wanted to add was dealing with data files (discussed in sphinx-gallery/sphinx-gallery#565 (comment)).

Not sure how best to do this. My plan would be to download data from a url (with urllib2) and save to a SG_data folder - in the examples_dir? Do I need to give full path or relative path to the folder when doing this?

larsoner · 2020-01-13T16:46:31Z

What mne-python does (and I think it's modeled after nilearn?) is to download data by default to ~/mne_data, so maybe download it to ~/sg_data. This path then gets saved in a config file in the user's system to know where to look for the data. That way if they did in the first call sample.data_path('/home/larsoner/somewhere/else') and this downloaded the data to this non-standard location, then the next call to data_path = sample.data_path() will know to look in the non-standard location.

lucyleeow · 2020-01-13T17:08:34Z

This path then gets saved in a config file in the user's system to know where to look for the data.

How does this happen?

That way if they did in the first call sample.data_path('/home/larsoner/somewhere/else')

Sorry confused with what sample.data_path is. A function I define for downloading data?

lucyleeow · 2020-01-13T17:09:00Z

(I'm not very familiar with this area)

larsoner · 2020-01-13T17:15:28Z

See docs: https://mne.tools/dev/generated/mne.datasets.sample.data_path.html

Eventually it gets set with set_config https://github.com/mne-tools/mne-python/blob/master/mne/datasets/utils.py#L212

This code is dense because we have a lot of datasets, you probably want to start with something simpler if possible

lucyleeow · 2020-01-15T16:53:49Z

Not confident I've done it right but I added a module with functions to download data to a file and save this location in a config file. Copied sklearn.

larsoner · 2020-01-15T17:02:47Z

Looks like it's working!

This seems like a reasonable start to me. Things for an eventual todo (subsequent PRs when people want):

Add CircleCI caching
Add CircleCI config to download files ahead of time rather than in-script (i.e., assume people have downloaded data)
Add tqdm progress bar to download

Someday packages that use dataset downloading (mne, nilearn, sklearn) should think about outsourcing data file downloading duties to a new third-party module. I think all packages have benefitted greatly from sphinx-gallery being a package, having some dataset_downloader (datasetter?) package that we all could vendor or import would be nice. Less stuff to maintain and we all benefit from our fixes...

In any case, @lucyleeow I would say let's merge this and iterate in subsequent PRs

GaelVaroquaux · 2020-01-15T17:05:24Z

I think all packages have benefitted greatly from sphinx-gallery being a package, having some dataset_downloader (datasetter?) package that we all could vendor

+1 on a single-file package that we can vendor. There are a lot of goodies across the various packages. It might be hard to satisfy all, though.

larsoner · 2020-01-15T17:14:30Z

Optimistically I think that the sklearn/nilearn/MNE maintainers could fight amongst themselves long enough to make it happen :)

I guess a natural place for this to live would be sphinx-gallery/<name>.git. I might work on starting something like this with direct pushes, open a WIP PR to integrate it into in MNE, and then nilearn, then sklearn. Try to start easy and ascend the difficulty gradient :)

lucyleeow · 2020-01-15T17:22:34Z

@larsoner added your suggestions in an issue. I'll merge then!

lucyleeow · 2020-01-15T17:29:46Z

@larsoner to clarify:

Add CircleCI config to download files ahead of time rather than in-script (i.e., assume people have downloaded data)

How does CircleCI relate to whether people (user?) have downloaded data locally?

larsoner · 2020-01-15T17:36:24Z

I'm thinking about the script output, which could give download status updates and also gives a time estimate for running the script. If the data file is downloaded during the script run, then the time estimate of the example is affected as is the output (it would eventually show "downloading..." or whatever if we made the downloader actually give updates). Downloading all example data before building docs will more accurately reflect what a user will experience if they have already downloaded the data.

Also, downloading data first has been very helpful for MNE because then our 1.5 hr doc build will fail 10 minutes in if it can't download all of the necessary data, rather than it taking the full 1.5 hr for SG to fail and report the error.

lucyleeow · 2020-01-15T17:43:09Z

Yes that sounds useful. I was confused about how changing CircleCI affects doc building locally but I realise you mean building in CircleCI.

first template

b21f78b

Merge branch 'master' into template

a0c771b

[empty] trigger CI

f0b165a

lucyleeow added 3 commits December 13, 2019 18:49

add module and backreferences

083df70

add module

5dbfbca

add req to circleci config

f59438e

lucyleeow added 2 commits December 16, 2019 13:57

FIX add config for docstring

8457e40

import to fun to fix backref

b6a316f

import in init file

a131b1e

lucyleeow mentioned this pull request Jan 1, 2020

BUG backreferences not working for functions if not imported directly sphinx-gallery/sphinx-gallery#583

Closed

add download module

cf7fb07

amend examples, add download ex

968eda9

lucyleeow mentioned this pull request Jan 15, 2020

ENH Improve data downloading #3

Open

3 tasks

lucyleeow merged commit 5043ae3 into sphinx-gallery:master Jan 15, 2020

Conversation

lucyleeow commented Dec 11, 2019

Uh oh!

larsoner commented Dec 11, 2019

Uh oh!

larsoner commented Dec 11, 2019

Uh oh!

lucyleeow commented Dec 12, 2019

Uh oh!

larsoner commented Dec 12, 2019

Uh oh!

lucyleeow commented Dec 12, 2019

Uh oh!

larsoner commented Dec 12, 2019

Uh oh!

larsoner commented Dec 12, 2019

Uh oh!

lucyleeow commented Dec 12, 2019

Uh oh!

lucyleeow commented Dec 13, 2019

Uh oh!

larsoner commented Dec 13, 2019

Uh oh!

lucyleeow commented Dec 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lucyleeow commented Dec 16, 2019

Uh oh!

lucyleeow commented Dec 17, 2019

Uh oh!

larsoner commented Dec 17, 2019

Uh oh!

lucyleeow commented Jan 13, 2020

Uh oh!

larsoner commented Jan 13, 2020

Uh oh!

lucyleeow commented Jan 13, 2020

Uh oh!

lucyleeow commented Jan 13, 2020

Uh oh!

larsoner commented Jan 13, 2020

Uh oh!

lucyleeow commented Jan 15, 2020

Uh oh!

larsoner commented Jan 15, 2020

Uh oh!

GaelVaroquaux commented Jan 15, 2020 via email

Uh oh!

larsoner commented Jan 15, 2020

Uh oh!

lucyleeow commented Jan 15, 2020

Uh oh!

lucyleeow commented Jan 15, 2020

Uh oh!

larsoner commented Jan 15, 2020

Uh oh!

lucyleeow commented Jan 15, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucyleeow commented Dec 16, 2019 •

edited

Loading