Do we want to track changes to package locations etc? #8

stephlocke · 2018-10-09T12:36:44Z

Why: For Science!
How: Google Big Query or something as part of the build
Pros: Insight into CRAN
Cons: Could be judgey

maelle · 2018-10-09T12:39:38Z

Technically, the build on Travis would update a Google Big Query database and the take_snapshot function would access that or would the Big Query data be for the archiving only? In any case by build it'd be cool to not query CRAN twice (once for the database and once for the dashboard). 🤔

stephlocke · 2018-10-09T12:41:40Z

big query for archiving only, tho it could also be an alright source if we wanted to read the dashboard from it

maelle · 2018-10-09T18:21:45Z

@stephlocke why big query btw?

stephlocke · 2018-10-09T18:23:00Z

seemed like a cheap storage util for this sort of simple accumulating data, plus it has some native ML capabilities built in so we would get to play 😉

maelle · 2018-10-10T05:46:19Z

So we need to run a code every hour to create the snapshot (with a bit more info cf #9 ) and send it to a Big Query project. Probaby with bigrquery + DBI.

maelle · 2020-01-06T08:08:36Z

The data is in the commit history of gh-pages at the moment, I suppose.

stephlocke · 2020-01-06T10:47:20Z

If the dashboard.Rmd could append the data to a csv as the start of our data capture mechanism, that'd be good.

maelle · 2020-01-06T10:57:34Z

So two tasks here

Use GH commits to retrieve past data
Add the appending of a csv to the data capture.

stephlocke · 2020-01-06T11:04:53Z

I'd focus on the append first. The looking through git versions of a html file (I believe we gotta look at the compiled html file?) sounds a lot harder and it'd be better to get fresh data capturing sooner. Backfilling is a nice to have!

maelle · 2020-01-27T15:18:51Z

Bad workflow at the moment. https://github.com/lockedata/cransays/blob/master/.github/workflows/master.yml branch https://github.com/lockedata/cransays/tree/history

how to I create an orphan branch and then add CSV files to it w/o adding the rest? @stephlocke
add the actual submission time to the csv.

once the workflow is improved, add it to cron.yml

hadley · 2020-09-12T15:11:15Z

I think this was done in aa0ab6b

stephlocke added the question ❓ label Oct 9, 2018

maelle mentioned this issue Oct 9, 2018

Show more information about the packages? #9

Closed

maelle added enhancement ✨ and removed question ❓ labels Oct 9, 2018

stephlocke assigned DaveParr Oct 10, 2018

maelle unassigned DaveParr Jan 6, 2020

llrs mentioned this issue Sep 8, 2020

In depth analysis of the Bioconductor submission llrs/blogR#34

Closed

maelle closed this as completed Sep 14, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do we want to track changes to package locations etc? #8

Do we want to track changes to package locations etc? #8

stephlocke commented Oct 9, 2018

maelle commented Oct 9, 2018

stephlocke commented Oct 9, 2018

maelle commented Oct 9, 2018

stephlocke commented Oct 9, 2018

maelle commented Oct 10, 2018

maelle commented Jan 6, 2020

stephlocke commented Jan 6, 2020

maelle commented Jan 6, 2020

stephlocke commented Jan 6, 2020

maelle commented Jan 27, 2020

hadley commented Sep 12, 2020

Do we want to track changes to package locations etc? #8

Do we want to track changes to package locations etc? #8

Comments

stephlocke commented Oct 9, 2018

maelle commented Oct 9, 2018

stephlocke commented Oct 9, 2018

maelle commented Oct 9, 2018

stephlocke commented Oct 9, 2018

maelle commented Oct 10, 2018

maelle commented Jan 6, 2020

stephlocke commented Jan 6, 2020

maelle commented Jan 6, 2020

stephlocke commented Jan 6, 2020

maelle commented Jan 27, 2020

hadley commented Sep 12, 2020