Election tutorial #2041

ae-foster · 2019-09-17T23:56:48Z

Here is the second tutorial for pyro.contrib.oed.

In this tutorial, we introduce the posterior estimator and show how it could be used to design a polling strategy that can lead to accurate predictions of the outcome of a US presidential election.

martinjankowiak

this is great! some initial comments/questions:

@neerajprad should we move this data files (*.pickle) somewhere else?
@neerajprad what's the optimal way to link to the other OED tutorial?
at the top add some caveat about "We make a number of simplifying assumptions... exploratory..."?
"We will take historical election data 19760-2012 as our prior" => "We will use historical election data 1976-2012 to construct a plausible prior"
print out some of the dataframes instead of the whole thing where appropriate?
explain why your covariance is in logits space
missing word "we are assuming that people respond to the ...."
"# Now compute w according to the (approximate) electoral college formula" pull this deterministic calculation out as a helper function?
strange comment? " (This kind of posthoc analysis is, of course, only engaged in by people of low statistical morals.)"
"One such variational estimator is " + "the"
"This $q$ performs" arguably q doesn't perform anything although it might be used to perform something?
can you call the intermediates in compute_dem_probability h1, h2 etc
re: "def h(p)" why not use bernoulli.entropy
explain swing_score
why eig=False and use of ape?
why do the code snippet that starts best_eig = 0 programmatically?
i'm confused by what's going on in the second-to-last section
the final sentence reads a bit wonky

neerajprad · 2019-09-19T04:48:26Z

@neerajprad should we move this data files (*.pickle) somewhere else?

Even though these are small binaries, we should upload it to aws like other datasets and read it from there, rather than committing it in this PR.

@neerajprad what's the optimal way to link to the other OED tutorial?

For the time being, how about just directing to the github link? Once this is published, we can change those links to direct to the website.

jpchen · 2019-09-19T04:52:01Z

what's the optimal way to link to the other OED tutorial?

look at the links in the other intro tutorials. iirc a normal md link should work [first tutorial](first_tutorial.ipynb) and then nbconvert will automatically create the correct html extensions. alternatively, you could hardcode the url pyro.ai/examples/tutorial_name.html

martinjankowiak · 2019-09-27T18:46:28Z

@ae-foster when you get to this let's also fix the formatting errors in http://pyro.ai/examples/working_memory.html

…ist in working mem tutorial

ae-foster · 2019-10-10T14:31:45Z

I've gone through and applied @martinjankowiak 's suggestions.

why eig=False and use of ape?

Since w is defined only implicitly by the model we need a large number of samples to compute the prior entropy, better to estimate it once to avoid extra noise comparing designs. I can add a sentence to this effect, though it may confuse some people.

i'm confused by what's going on in the second-to-last section

I rewrote this section- have a look and see if it is better

Even though these are small binaries, we should upload it to aws like other datasets and read it from there, rather than committing it in this PR.

I take it you guys have to do that as you have the necessary AWS keys

a normal md link should work

I'm sticking with this then

let's also fix the formatting errors in http://pyro.ai/examples/working_memory.html

The only one I could see was a malformed numbered list near the beginning- I tried to fix this by throwing in some extra new lines. Hard to tell how it'll render though. Were there other formatting errors we need to pick up?

martinjankowiak · 2019-10-15T14:40:00Z

@neerajprad can you please help get the pickle files in this PR onto aws?

martinjankowiak · 2019-10-15T14:55:39Z

@ae-foster looks good.

are you able to make docs locally? i think there's still some weird formatting
i still find the second-to-last section somewhat confusing. what is the message you're trying to get across?
re: "I can add a sentence to this effect, though it may confuse some people." i think you should explain this a bit in the text (although i guess you should keep the explanation short)

ae-foster · 2019-11-04T14:14:57Z

Just pushed some more updates.

are you able to make docs locally? i think there's still some weird formatting

I went through the existing tutorial online and picked up a few other changes which are fixed now

i still find the second-to-last section somewhat confusing

I had another go at rewriting and simplifying this section. The general point is: having done the experimental design, let's simulate actually running the experiment and conducting the data analysis: have a look at the posterior compared to the prior. Nothing really intellectual here, just showing the complete experiment loop. If you think it's still confusing it's no problem to drop this section altogether.

i think you should explain this a bit in the text

I've added explanation as comments near where I put eig=False so it's clear what I'm referring to.

Overall, I think the tutorial is looking close to ready now. Did you get the files up on AWS?

codecov-io · 2019-11-04T14:50:21Z

Codecov Report

❗ No coverage uploaded for pull request base (dev@31a72da). Click here to learn what that means.
The diff coverage is n/a.

@@          Coverage Diff           @@
##             dev    #2041   +/-   ##
======================================
  Coverage       ?   94.47%           
======================================
  Files          ?      201           
  Lines          ?    12781           
  Branches       ?        0           
======================================
  Hits           ?    12075           
  Misses         ?      706           
  Partials       ?        0

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 31a72da...4302b7e. Read the comment docs.

martinjankowiak · 2019-11-04T18:45:19Z

looking great. a few last nits:

the 'previous tutorial' link and the 'working memory tutorial' link should point to http://pyro.ai/examples/working_memory.html
can you call the intermediates in compute_dem_probability something other than y
typos: 'If we have the option of first design the polling strategy using our prior'

i'll let you know once we have the AWS pickles sorted

neerajprad · 2019-11-04T19:33:24Z

@ae-foster, @martinjankowiak - The dataset files should be in https://d2hg8soec8ck9v.cloudfront.net/datasets/us_elections/ directory on aws.

e.g. https://d2hg8soec8ck9v.cloudfront.net/datasets/us_elections/us_presidential_election_data_test.pickle

ae-foster · 2019-11-05T09:47:23Z

Thanks- I've sorted those comments from @martinjankowiak and I have removed the data from the repo, instead we now have

!curl -sO "https://d2hg8soec8ck9v.cloudfront.net/datasets/us_elections/electoral_college_votes.pickle"
etc.

which should download the data to the right directory at the start

martinjankowiak · 2019-11-05T18:03:27Z

great thanks @ae-foster ! can you please make the data downloading pythonic? see e.g. here
once that's done we'll get this merged!

ae-foster · 2019-11-06T09:48:11Z

Now using urlopen 👍

martinjankowiak

great, thanks adam! i'll merge this as is although i notice some of the numbers have shifted a bit. did something change, is this just inherent stochasticity, or do things need to be trained for longer or what?

ae-foster · 2019-11-07T13:13:54Z

It's inherently stochastic because it depends on the simulated outcome of the experiment

ae-foster added 2 commits September 17, 2019 16:55

add elections notebook

7668cdb

add data files

1815345

fritzo added the awaiting review label Sep 18, 2019

martinjankowiak reviewed Sep 19, 2019

View reviewed changes

martinjankowiak added awaiting response and removed awaiting review labels Sep 26, 2019

ae-foster mentioned this pull request Sep 27, 2019

Clean up examples ae-foster/pyro#14

Open

ae-foster and others added 2 commits October 10, 2019 15:20

address martin's suggestions for elections tutorial, attempt to fix l…

0276464

…ist in working mem tutorial

Merge branch 'dev' into election-tutorial-2

f712bea

ae-foster added 2 commits November 1, 2019 15:24

update tutorial

82def5c

finish changes to election tutorial

a2ce7ef

ae-foster added 2 commits November 5, 2019 09:25

address martin's comments: hyperlinks, var name, typo

f7547fe

remove data files from repo, download from aws instead

7f0c1f4

load data using urlopen

4302b7e

martinjankowiak approved these changes Nov 6, 2019

View reviewed changes

martinjankowiak merged commit b7288f4 into pyro-ppl:dev Nov 6, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Election tutorial #2041

Election tutorial #2041

ae-foster commented Sep 17, 2019

martinjankowiak left a comment

neerajprad commented Sep 19, 2019

jpchen commented Sep 19, 2019

martinjankowiak commented Sep 27, 2019

ae-foster commented Oct 10, 2019 •

edited

Loading

martinjankowiak commented Oct 15, 2019

martinjankowiak commented Oct 15, 2019

ae-foster commented Nov 4, 2019 •

edited

Loading

codecov-io commented Nov 4, 2019 •

edited

Loading

martinjankowiak commented Nov 4, 2019

neerajprad commented Nov 4, 2019

ae-foster commented Nov 5, 2019

martinjankowiak commented Nov 5, 2019

ae-foster commented Nov 6, 2019

martinjankowiak left a comment

ae-foster commented Nov 7, 2019

Election tutorial #2041

Election tutorial #2041

Conversation

ae-foster commented Sep 17, 2019

martinjankowiak left a comment

Choose a reason for hiding this comment

neerajprad commented Sep 19, 2019

jpchen commented Sep 19, 2019

martinjankowiak commented Sep 27, 2019

ae-foster commented Oct 10, 2019 • edited Loading

martinjankowiak commented Oct 15, 2019

martinjankowiak commented Oct 15, 2019

ae-foster commented Nov 4, 2019 • edited Loading

codecov-io commented Nov 4, 2019 • edited Loading

Codecov Report

martinjankowiak commented Nov 4, 2019

neerajprad commented Nov 4, 2019

ae-foster commented Nov 5, 2019

martinjankowiak commented Nov 5, 2019

ae-foster commented Nov 6, 2019

martinjankowiak left a comment

Choose a reason for hiding this comment

ae-foster commented Nov 7, 2019

ae-foster commented Oct 10, 2019 •

edited

Loading

ae-foster commented Nov 4, 2019 •

edited

Loading

codecov-io commented Nov 4, 2019 •

edited

Loading