You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Nov 11, 2022. It is now read-only.
I'm defining a pipeline to read logs exported from App Engine to PubSub using the Log Export feature. I wrote a function that parses the LogEntry. Each Log has its own timestamp and I would like to use that timestamp for downstream Windowing operations.
The problem I'm encountering is that PubsubIO attaches its own timestamp to the incoming data here and the timestamp I'm wanting to use from the LogEntry is earlier in time. Attempting to set the timestamp to that of the LogEntry results in an error:
java.lang.IllegalArgumentException: Cannot output with timestamp 2015-10-09T20:06:21.991Z. Output timestamps must be no earlier than the timestamp of the current input (2015-10-09T20:06:25.669Z) minus the allowed skew (0 milliseconds). See the DoFn#getAllowedTimestmapSkew() Javadoc for details on changing the allowed skew.
So, I can change the allowed skew but what I would really like is to allow PubSub IO to optionally not add a timestamp so that we can always use the timestamp from the LogEntry without having to manually specify the skew.