PL: Write 2020 workshops to a gsheet #33089

breville · 2020-02-11T22:41:56Z

Write serialized information about workshops to a gsheet. The workshops are CSD/CSP 5-day summer, CSF intro, and CSF deepdive. The dates are May 15th 2020 through August 31st 2020. The write is done 30 minutes after every second hour. The destination tab is summer_workshops (auto), and we also write some information to a tab called summer_workshops_meta (auto).

This writes to the same gsheet as #32609 and works very similarly.

Write serialized information about workshops to a gsheet. The workshops are CSD/CSP 5-day summer, CSF intro, and CSF deepdive. The dates are May 15th 2020 through August 31st 2020. The write is done 30 minutes after every second hour.

islemaster · 2020-02-11T22:50:48Z

bin/cron/summer_workshops_to_gdrive

+# Subsequent rows are for each workshop.
+Pd::Workshop.where(subject: subjects).find_each.each do |w|
+  if w.workshop_starting_date && w.workshop_starting_date >= Date.new(2020, 5, 15) && w.workshop_starting_date <= Date.new(2020, 8, 31)
+    workshops << Api::V1::Pd::WorkshopDownloadSerializer.new(w).attributes.values


Quick check on the cost of doing this filtering in the script instead of as part of our query:

We're loading all workshops ever for these three subjects, rather than only those in a very specific range.

Making the range part of the query would require a join against the first session and might be more likely to introduce a bug.

Given the number of rows we're working with, I suspect this is fine through August of this year. Instincts on when we'd want to rethink this?

Yes, I wanted to avoid the join and also adding actual SQL to do the range query, and did have a similar concern about complexity. It currently takes about 30 seconds to do the query, which returns 4341 workshops before we filter by date. Those workshops go back to 2016. I think that we could cope with this query even taking several minutes, which I think will give us a few more years of creating workshops.

Sounds good - thanks for walking me through the napkin-math on this!

islemaster

👍

bencodeorg · 2020-02-11T22:53:47Z

I don't know the whole context here, but this looks very similar to something I did last year here -- if this is in place of that, might be worth tearing that code out (and remove cron schedule entry) while you're at it.

breville · 2020-02-12T00:17:59Z

@bencodeorg Good catch. I've prepped a removal at #33094.

hacodeorg · 2020-02-12T00:20:56Z

bin/cron/summer_workshops_to_gdrive

+workshops << Api::V1::Pd::WorkshopDownloadSerializer.new(Pd::Workshop.first).attributes.keys.map(&:to_s)
+
+# Subsequent rows are for each workshop.
+Pd::Workshop.where(subject: subjects).find_each.each do |w|


From the example here https://api.rubyonrails.org/classes/ActiveRecord/Batches.html, seems like it is not necessary to do both find_each and each?

Oops, good catch. That was left over from temporary testing.

PL: Write 2020 workshops to a gsheet

d364900

Write serialized information about workshops to a gsheet. The workshops are CSD/CSP 5-day summer, CSF intro, and CSF deepdive. The dates are May 15th 2020 through August 31st 2020. The write is done 30 minutes after every second hour.

breville requested review from islemaster and a team February 11, 2020 22:42

islemaster reviewed Feb 11, 2020

View reviewed changes

islemaster approved these changes Feb 11, 2020

View reviewed changes

hacodeorg reviewed Feb 12, 2020

View reviewed changes

PL: Fix query syntax

852c592

breville merged commit b94bead into staging Feb 12, 2020

breville deleted the pl-write-2020-workshops-to-gsheet branch February 12, 2020 04:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PL: Write 2020 workshops to a gsheet #33089

PL: Write 2020 workshops to a gsheet #33089

breville commented Feb 11, 2020

islemaster Feb 11, 2020

breville Feb 12, 2020

islemaster Feb 12, 2020

islemaster left a comment

bencodeorg commented Feb 11, 2020

breville commented Feb 12, 2020

hacodeorg Feb 12, 2020

breville Feb 12, 2020

PL: Write 2020 workshops to a gsheet #33089

PL: Write 2020 workshops to a gsheet #33089

Conversation

breville commented Feb 11, 2020

islemaster Feb 11, 2020

Choose a reason for hiding this comment

breville Feb 12, 2020

Choose a reason for hiding this comment

islemaster Feb 12, 2020

Choose a reason for hiding this comment

islemaster left a comment

Choose a reason for hiding this comment

bencodeorg commented Feb 11, 2020

breville commented Feb 12, 2020

hacodeorg Feb 12, 2020

Choose a reason for hiding this comment

breville Feb 12, 2020

Choose a reason for hiding this comment