Questions about Fault-Tolerance and Exactly-Once Delivery mechanisms #58

yarmiganosca · 2019-10-09T17:38:23Z

The current docs for this connector state

Messages will neither be duplicated nor silently dropped. Messages will be delivered exactly once, or an error message will be generated. If an error is detected while loading a record (for example, the record was expected to be a well-formed JSON or Avro record, but wasn’t well-formed, then the record is not loaded; instead, an error message is returned.

I have several questions about this:

Can you explain what the mechanism is for ensuring that rows are only inserted once into the ingest table?
How is that mechanism tolerant of tasks crashing and restarting at a particular offset? Or of having that partition assigned to another task while the first is down?
How can I see these error messages?
Will one row in a batch causing an error stop the rest of the batch?

Later in the same section you call attention to

Instances of the Kafka connector do not communicate with each other. If you start multiple instances of the connector on the same topics or partitions, then multiple copies of the same row might be inserted into the table. This is not recommended; each topic should be processed by only one instance of the connector.

When you use the word "instance" here, does that mean connector registration, or connector task? (I'm pretty sure I know the answer to this one, but I just want to have it explicitly confirmed).

raphaelauv · 2021-04-06T23:15:01Z

from snowflake support : every file send from the connector to snowflake ( containing x rows of messages ) is store and keep in an S3 bucket ( for AWS ) and they compute the MD5 so they never ingest two time the same file.

So it's absolutely not enough for stream process were files size and content is not deterministic . Far from exactly-once !

at least once could be assume if offsets wre committed ONLY if file is correctly receive from snowflake and snowpipe correctly asserted that the file can be ingest (no S3 consistency error by the snowpipe ingestor ) and not deleted before ingestion by the unsafe thread of file cleaning launch by the connector.
from the recent commits I can see that logs are not verbose and not well manage
Few issues talk about silent errors in case of problems with avro, so it would need to check the source code.

After a first look to the code :

There is a huge amount of really NOT elegant concurrent code (locks and shared data structure to track "current work" ) and WRONG concurrent code ( values shared between threads , not atomic or even volatile )
There is absolutely no assert of the concurrent operations and some weird manual operations on the committed offsets. The logic is way to much complex and over-engineered for what it should be.

Documentation is really not good , I think like you , they mean "connector registration" , not task since its okay to have X task , no one same partition will be consume more than once.

Conclusion , this connector is really weakly write and tested

raphaelauv · 2021-04-14T18:10:27Z

To be exactly-once the flush of events have to be deterministic ( if the connector restart, it always recreate the same files , with the same amount of events from the same offsets , like msg 5144 to 5155 )

But with a NON deterministic flush time rule it's impossible to be exactly-once ->

snowflake-kafka-connector/src/main/java/com/snowflake/kafka/connector/internal/SnowflakeSinkServiceV1.java

Line 599 in 1f793ad

    
           return (System.currentTimeMillis() - this.previousFlushTimeStamp) >= (getFlushTime() * 1000);

and even this logic is buggy ->
#245

The time flush should be base on the time on the events ( see the doc of confluent -> https://docs.confluent.io/kafka-connect-s3-sink/current/index.html#exactly-once-delivery-on-top-of-eventual-consistency )

Having this done and if snowpipe really never re-ingest the same file ( based on file name and MD5 ) then it will be exactly-once

raphaelauv · 2021-04-14T18:38:41Z

The connector is not waiting for a snowpipe confirmation of the ingestion of the files before updating in intern the offsets to commits at the next flush call by the kafka-connect framework.

->

snowflake-kafka-connector/src/main/java/com/snowflake/kafka/connector/internal/SnowflakeSinkServiceV1.java

Line 700 in 1f793ad

conn.putWithCache(stageName, fileName, content);

So if anything happen to this file before snowpipe ingest it , or if the files is impossible to ingest , the data could be lost ( cause not every topic have unlimited retention ).

From my understanding the connector is not even at-lest-once

sfc-gh-japatel · 2021-04-22T21:56:56Z

Hi @raphaelauv thank you for giving the feedback and looking into the code in detail.

The Kafka connector is not exactly once - As you correctly pointed out, since the flush time is non deterministic.
Snowpipe will not reingest the file again since there is a de-duplication logic on the server side, but lets say for instance a file with offset 0-100 is created and ingested but precommit was not successful, there is a possibility that next time the file with offset 0-99 is created. Since this is a new file for Snowpipe, there can be a data duplication but this is extremely rare.
Regarding at-least-once: If there is any failure in snowpipe, we would put this file in table stage as a failure recovery. (Table stage comes with SF table)

CC: @sfc-gh-zli

raphaelauv · 2021-04-23T00:24:49Z

Hello @sfc-gh-japatel

about 1 -> the documentation need corrections

about 2 -> the connector is not exactly-once , what you do to LIMIT the duplication has no interest.

about 3 -> It's a snowflake-connector not a S3 connector. I expect my data to be available in a snowflake table.

The connector should not commit offsets if some data is not ingest, cause having a corrupted file in a bucket is not what I expect.

I already see 2 cases that make the connector not at-least-once :

If the connector do not deserialize correctly events (bug in the code of the connector) then data will be corrupt

And there is already a case, if the schema-registry is not available for any reason to answer this line :

snowflake-kafka-connector/src/main/java/com/snowflake/kafka/connector/records/SnowflakeAvroConverter.java

Line 131 in 3006b62

schema = schemaRegistry.getById(id);

Then by default the connector do not fail and skip the message by writing it in table_stage.
break.on.schema.registry.error should be true by default.
if the file sent is corrupted by network ( let me guess you do not assert md5 of the sent file )
if the file is moved or deleted by the connector or any other S3_policy

and there is a case : files are moved if snowpipe have more than 1 hour of lag : Files older than 1 hour are moved to table stage #172

Conclusion by default the connector should be at-least-once and not a best-effort connector, or say it in your documentation.

sfc-gh-rcheng · 2023-07-31T23:15:01Z

Snowpipe does not guarantee exactly once. Snowpipe streaming does guarantee exactly once

Closing this issue out due to age - please reopen if further discussion is needed.

raphaelauv · 2023-07-31T23:23:00Z

@sfc-gh-japatel it's about at-least-once do you have a formal proof that the connector is at-least-once ?

raphaelauv mentioned this issue Dec 9, 2021

wrapping the kafka connector is dangerous datastax/snowflake-connector#2

Closed

sfc-gh-rcheng closed this as completed Jul 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about Fault-Tolerance and Exactly-Once Delivery mechanisms #58

Questions about Fault-Tolerance and Exactly-Once Delivery mechanisms #58

yarmiganosca commented Oct 9, 2019

raphaelauv commented Apr 6, 2021 •

edited

Loading

raphaelauv commented Apr 14, 2021 •

edited

Loading

raphaelauv commented Apr 14, 2021 •

edited

Loading

sfc-gh-japatel commented Apr 22, 2021

raphaelauv commented Apr 23, 2021 •

edited

Loading

sfc-gh-rcheng commented Jul 31, 2023

raphaelauv commented Jul 31, 2023

Questions about Fault-Tolerance and Exactly-Once Delivery mechanisms #58

Questions about Fault-Tolerance and Exactly-Once Delivery mechanisms #58

Comments

yarmiganosca commented Oct 9, 2019

raphaelauv commented Apr 6, 2021 • edited Loading

raphaelauv commented Apr 14, 2021 • edited Loading

raphaelauv commented Apr 14, 2021 • edited Loading

sfc-gh-japatel commented Apr 22, 2021

raphaelauv commented Apr 23, 2021 • edited Loading

sfc-gh-rcheng commented Jul 31, 2023

raphaelauv commented Jul 31, 2023

raphaelauv commented Apr 6, 2021 •

edited

Loading

raphaelauv commented Apr 14, 2021 •

edited

Loading

raphaelauv commented Apr 14, 2021 •

edited

Loading

raphaelauv commented Apr 23, 2021 •

edited

Loading