Update the SQL script used for generating latest snapshot. by sophie4869 · Pull Request #703 · firebase/extensions

sophie4869 · 2021-07-21T11:49:40Z

The old script uses FIRST_VALUE and OVER, which sorts the entire changelog and finds the first record for each document. It can result in a memory issue when running BigQuery reading from the latest snapshot. (Resources exceeded during query execution: The query could not be executed in the allotted memory. Peak usage: 110% of limit. Top memory consumer(s): sort operations used for analytic OVER() clauses: 96%)

The updated script selects the maximum timestamp for each document_id, and joins back with the table by the latest timestamp instead.

The old script uses FIRST_VALUE and OVER, which sorts the entire changelog and finds the first record for each document. It can result in a memory issue when running BigQuery reading from the latest snapshot. (Resources exceeded during query execution: The query could not be executed in the allotted memory. Peak usage: 110% of limit. Top memory consumer(s): sort operations used for analytic OVER() clauses: 96%) The updated script selects the maximum timestamp for each document_id, and joins back with the table by the latest timestamp instead.

…estamp.

dackers86 · 2021-07-23T12:13:10Z

Thanks @sophie4869.

This will be tested internally and has been added to the tracker for reviewing.

…chema view. There's no need to find the latest value again.

dackers86 · 2021-08-10T13:02:42Z

@sophie4869 There is a slight delay for reviewing this, I am currently experiencing installation issues...

ext-firestore-bigquery-export-fsexportbigquery
{"@type":"type.googleapis.com/google.cloud.audit.AuditLog","status":{"code":3,"message":"Build failed: npm ERR! cipm can only install packages when your package.json and package-lock.json or npm-shrinkwrap.json are in sync. Please update your lock file with `npm install` before continuing.\nnpm ERR! \nnpm ERR! \nnpm ERR! Missing: @firebaseextensions/firestore-bigquery-change-tracker@^1.1.10\nnpm ERR! \n\nnpm ERR! A complete log of this run can be found in:\nnpm ERR! /www-data-home/.npm/_logs/2021-08-10T12_35_21_209Z-debug.log; Error ID: beaf8772"},"authenticationInfo":{"principalEmail":"219368645393@cloudservices.gserviceaccount.com"},"serviceName":"cloudfunctions.googleapis.com","methodName":"google.cloud.functions.v1.CloudFunctionsService.CreateFunction","resourceName":"projects/extensions-testing/locations/us-central1/functions/ext-firestore-bigquery-export-fsexportbigquery"}

This is perhaps related to #701

dackers86 · 2022-01-05T14:36:25Z

Hi @sophie4869.

The latest updates from the next branch will now fix npm errors when installing locally.

With these updates, CI should also now run the tests which also need to be updated to match the changes you have made.

The updates look great after reviewing on a test installation - If you can update the above we can look at getting this approved. Any questions, let me know!

dgilperez · 2022-03-14T13:43:53Z

+1 to solving this problem. I could work on this if a hand is required

dackers86 · 2022-03-14T14:17:20Z

Thanks @dgilperez. We are appreciate prs/updates from the community (and provide credit for contributions).

Otherwise this is still in our backlog to update/complete.

dgilperez · 2022-03-14T15:32:02Z

Thanks for the quick reply @dackers86. Shall I open a new PR as a fork from this one, or @sophie4869 do you want me to work on your fork (I guess you'll need to grant me permissions). I will do the former if there is no quick response from Sophie, if that's OK

sophie4869 · 2022-03-14T20:51:27Z

I can take a look by the end of the week. Does that work for you?

…

On Mon, 14 Mar 2022, 16:32 David Gil, ***@***.***> wrote: Thanks for the quick reply @dackers86 <https://github.com/dackers86>. Shall I open a new PR as a fork from this one, or @sophie4869 <https://github.com/sophie4869> do you want me to work on your fork (I guess you'll need to grant me permissions). I will do the former if there is no quick response from Sophie, if that's OK — Reply to this email directly, view it on GitHub <#703 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABHYQDF5TB6D35I4ZNIVYITU75LX3ANCNFSM5AXZMTBA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>. You are receiving this because you were mentioned.Message ID: ***@***.***>

dgilperez · 2022-03-15T15:49:16Z

@sophie4869 I created a new PR at #915, with your changes rebased. I could not run the test suite (did not know how) and did not hook the extension to a real Firebase project either.

But I am running your queries in my BigQuery views with good results 👍

meyerovb · 2022-11-03T17:22:59Z

Will this cover if there are null time stamps or the latest entry for a document has two entries with the same timestamp?

cabljac · 2022-11-07T14:34:06Z

Closing this as reopened (these commits rebased on next) and tracked in a PR here: #1288

google-cla Bot added the cla: yes Author has signed the CLA label Jul 21, 2021

sophie4869 force-pushed the sophie-next branch from ffbf0ce to 35cee76 Compare July 21, 2021 13:47

Change to GROUP BY document_name instead for selecting the latest tim…

b2d7259

…estamp.

gen-schema-view actually takes a latest view to generate the latest s…

ddc062b

…chema view. There's no need to find the latest value again.

dackers86 added the needs: author feedback Pending additional information from the author label Jan 6, 2022

dgilperez mentioned this pull request Mar 14, 2022

Update the SQL script used for generating latest snapshot II #915

Merged

cabljac mentioned this pull request Nov 3, 2022

[firestore-bigquery-export] Resources exceeded during query execution error #757

Closed

cabljac mentioned this pull request Nov 7, 2022

fix: fix resource error caused by latest snapshot script #1288

Closed

cabljac closed this Nov 7, 2022

cabljac mentioned this pull request Nov 8, 2022

fix(firestore-bigquery-export): update snapshot script #1289

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the SQL script used for generating latest snapshot.#703

Update the SQL script used for generating latest snapshot.#703
sophie4869 wants to merge 3 commits intofirebase:nextfrom
sophie4869:sophie-next

sophie4869 commented Jul 21, 2021

Uh oh!

dackers86 commented Jul 23, 2021

Uh oh!

dackers86 commented Aug 10, 2021

Uh oh!

dackers86 commented Jan 5, 2022

Uh oh!

dgilperez commented Mar 14, 2022

Uh oh!

dackers86 commented Mar 14, 2022

Uh oh!

dgilperez commented Mar 14, 2022

Uh oh!

sophie4869 commented Mar 14, 2022 via email

Uh oh!

dgilperez commented Mar 15, 2022

Uh oh!

meyerovb commented Nov 3, 2022

Uh oh!

cabljac commented Nov 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

sophie4869 commented Jul 21, 2021

Uh oh!

dackers86 commented Jul 23, 2021

Uh oh!

dackers86 commented Aug 10, 2021

Uh oh!

dackers86 commented Jan 5, 2022

Uh oh!

dgilperez commented Mar 14, 2022

Uh oh!

dackers86 commented Mar 14, 2022

Uh oh!

dgilperez commented Mar 14, 2022

Uh oh!

sophie4869 commented Mar 14, 2022 via email

Uh oh!

dgilperez commented Mar 15, 2022

Uh oh!

meyerovb commented Nov 3, 2022

Uh oh!

cabljac commented Nov 7, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants