RUBY-1228 Change Streams #888

estolfo · 2017-09-07T12:55:02Z

No description provided.

saghm · 2017-09-22T15:18:00Z

I looked over this, and unfortunately I don't really have any useful feedback to provide. Everything seems fine to me.

estolfo · 2017-09-22T15:20:50Z

ok, thank you @saghm !

ShaneHarvey · 2017-09-22T17:25:09Z

lib/mongo/cursor.rb

        process(get_more_operation.execute(@server))
+      else
+        read_with_retry do


I don't think getmores should be retryed with the same cursor. The spec says to create a new cursor on a retryable error: https://github.com/mongodb/specifications/blob/master/source/change-streams.rst#resume-process

Right, the cursor created by the ChangeStream has the options { disable_retry: true} so the first block that doesn't wrap the get_more in the read_with_retry block will be executed.

Thanks for the explanation.

ShaneHarvey · 2017-09-22T17:26:26Z

lib/mongo/collection/view/change_stream.rb

+        # @since 2.5.0
+        #
+        # @yieldparam [ BSON::Document ] Each change stream document.
+        def each


I have a couple questions. First, the spec says:

A driver SHOULD attempt to kill the cursor on the server on which the cursor is opened during the resume process, and MUST NOT attempt to kill the cursor on any other server.

Does close ensure that the kill cursor request goes to the original server?

Second, create_cursor retrys once and this method retrys as well. That means it will retry on consecutive network errors. The spec says to only retry on the first network error. Any errors that happen during the retry should be fatal. I think instead of (@cursor || create_cursor!) it should be something like (@cursor || create_cursor_no_retry!). Do you agree or am I misunderstanding the code?

Does close ensure that the kill cursor request goes to the original server?

Yes. A cursor has a reference to the server (@server variable). The kill cursors method called on the cursor in line 105, sends the operation to that server.

Second, create_cursor retrys once and this method retrys as well. That means it will retry on consecutive network errors. The spec says to only retry on the first network error. Any errors that happen during the retry should be fatal. I think instead of (@cursor || create_cursor!) it should be something like (@cursor || create_cursor_no_retry!). Do you agree or am I misunderstanding the code?

Yes, you're right. I shouldn't be retrying in the create_cursor method. I altered the logic a few commits ago. I believe it was correct before so I'll change it back.

ShaneHarvey · 2017-09-22T17:28:31Z

lib/mongo/collection/view/change_stream.rb

+          @change_stream_filters = pipeline
+          @options = options.freeze
+          @resume_token = @options[:resume_after]
+          create_cursor!


I thought I remember reading that the initial aggregation command should not be retryed but I don't see it in the spec. I'll open a spec question.

Waiting for an answer: https://jira.mongodb.org/browse/SPEC-947

Thanks for pointing this out. I chose to retry it because we historically retry read operations in the driver.

(here is where we retry finds, aggregations, and here is where we retry a count, for example)

and I thought it would be inconsistent to not retry the initial aggregation for a change stream. I'll see how the discussion goes in the spec ticket, thanks for opening it.

Thanks, this makes a lot more sense now that I know Ruby retries read operations already.

ShaneHarvey · 2017-09-25T18:11:58Z

lib/mongo/collection/view/change_stream.rb

+        # @since 2.5.0
+        def initialize(view, pipeline, options = {})
+          @view = view
+          @change_stream_filters = pipeline


Do you need to make a deep copy of the pipeline to ensure the user does not modify it after calling watch?

pipeline = [] change_stream = coll.watch(pipeline, {}) pipeline << {'$project': {_id: 0}} # Will the next cursor be created with [{'$changeStream': ...}, {'$project': {_id: 0}}]?

Maybe I'm being too paranoid...

haha, that is possible, I guess. Good call though, it wouldn't hurt to copy it.

ShaneHarvey · 2017-09-25T18:29:40Z

lib/mongo/collection/view/change_stream.rb

+        private
+
+        def cache_resume_token(doc)
+          unless @resume_token = doc[:_id]


Similarly, do you need to duplicate the _id to prevent a user from modifying it?

No, because ObjectIds are immutable and we are caching the ObjectId itself, not the document. If we were caching the document, then we might have to duplicate it. Thanks for checking though.

The _id here is a change stream resume token which is a document.

Oh right, I forgot. hm, I guess I should duplicate it then. Thanks!

ShaneHarvey

ChangeStream and #watch LGTM!

estolfo force-pushed the change-stream branch from 5c7d0b0 to 637e63f Compare September 20, 2017 15:20

estolfo assigned saghm Sep 21, 2017

durran self-requested a review September 22, 2017 01:10

estolfo force-pushed the change-stream branch from 496d72e to 709737d Compare September 22, 2017 15:24

ShaneHarvey self-requested a review September 22, 2017 16:51

ShaneHarvey requested changes Sep 22, 2017

View reviewed changes

ShaneHarvey reviewed Sep 25, 2017

View reviewed changes

ShaneHarvey approved these changes Sep 25, 2017

View reviewed changes

estolfo force-pushed the change-stream branch from 74a4055 to c3fec3e Compare September 26, 2017 09:23

RUBY-1228 Change Stream implementation

5cec119

estolfo force-pushed the change-stream branch from c3fec3e to 5cec119 Compare September 26, 2017 11:51

estolfo merged commit ed5e2c1 into mongodb:master Sep 26, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RUBY-1228 Change Streams #888

RUBY-1228 Change Streams #888

estolfo commented Sep 7, 2017

saghm commented Sep 22, 2017

estolfo commented Sep 22, 2017

ShaneHarvey Sep 22, 2017 •

edited

estolfo Sep 24, 2017

ShaneHarvey Sep 25, 2017

ShaneHarvey Sep 22, 2017 •

edited

estolfo Sep 24, 2017

ShaneHarvey Sep 22, 2017

ShaneHarvey Sep 22, 2017

estolfo Sep 24, 2017

ShaneHarvey Sep 25, 2017

ShaneHarvey Sep 25, 2017

estolfo Sep 25, 2017

ShaneHarvey Sep 25, 2017

estolfo Sep 25, 2017

ShaneHarvey Sep 25, 2017

estolfo Sep 25, 2017

ShaneHarvey left a comment

RUBY-1228 Change Streams #888

RUBY-1228 Change Streams #888

Conversation

estolfo commented Sep 7, 2017

saghm commented Sep 22, 2017

estolfo commented Sep 22, 2017

ShaneHarvey Sep 22, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShaneHarvey Sep 22, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShaneHarvey left a comment

Choose a reason for hiding this comment

ShaneHarvey Sep 22, 2017 •

edited

ShaneHarvey Sep 22, 2017 •

edited