Some rows are not replicated onto final table (pg source and bigquery destination) #7590
Labels
area/connectors
Connector related issues
community
connectors/destination/bigquery
connectors/destinations-warehouse
type/bug
Something isn't working
Enviroment
Current Behavior
I'm replicating Postgres using Logical replication with
pgoutput
from CloudSQL to BigQuery. The sync mode is using incremental + dedup + history. And I sync every 30 mins.Everyday, there would be some rows are not updated to the final table. In my example, I have a final
order_items
table. If I search fromorder_items_scd
table for a specificorder_id
, I see there are 12 rows. But If I query from the finalorder_items
table, there would only be 6 rows returned.What I found was that those rows which were not carried over to the final
order_items
table, have_airbyte_end_at
se to a UTC timestamp and its_airbyte_active_row
field set to 0. However, these rows are still active and appear in the source database.If I do a full refresh, then those missing rows would appear in the final
order_items
table. This is happening everyday (we would be missing rows from 20+ order ids in one day).Expected Behavior
Those rows should appear.
Steps to Reproduce
Are you willing to submit a PR?
no
Reference
Initially raised on: https://airbytehq.slack.com/archives/C01MFR03D5W/p1635771024396100
The text was updated successfully, but these errors were encountered: