Add `Transaction` object #1276

josenavas · 2015-06-19T16:13:00Z

This PR is the first of a series of PR to be able to modify Qiita DB to make use of transactions everywhere.

As discussed in the last meeting, instead of modifying the current queue system it will be better to add a Transaction object that encapsulates the transaction.

A few changes from the original design:

The placeholders, instead of being of the form {#:#} -> {query_idx:result_idx} are of the form {#:#:#} -> {query_idx:row_id:value_idx}. This allow for more flexibility and it is easier to reference an actual result. It also removes the need of flattening the results.
The execute method has a boolean parameter commit, optional and default True. This allows to execute all the current queries in the transaction without committing it, which is useful if you need an intermediate result of the transaction to execute some python code (e.g. instantiate an object to retrieve some other information needed for adding more queries to the DB)
The execute methods returns always all the results. You can retrieve then a specific result using the index, or using [-1] to retrieve the result of the last query. This last part was impossible before unless you knew how many results where returned by the last query.

The tests are expected to fail, since the rest of the code has not been modified. Those will be further PR.

…into improve-sql-queues-system

ElDeveloper · 2015-06-19T17:42:24Z

qiita_db/sql_connection.py

+        Parameters
+        ----------
+        sql : str
+            The sql query


minor sql -> SQL

josenavas · 2015-06-19T21:23:24Z

Thanks @ElDeveloper ! Comments addressed

ElDeveloper · 2015-06-19T21:42:29Z

Awesome, thanks! 👍

On (Jun-19-15|14:23), josenavas wrote:

Thanks @ElDeveloper ! Comments addressed

Reply to this email directly or view it on GitHub:
#1276 (comment)

squirrelo · 2015-06-20T15:30:52Z

qiita_db/test/test_sql_connection.py

+        self.assertEqual(obs._index, 0)
+        self.assertTrue(isinstance(obs._conn_handler, SQLConnectionHandler))
+
+    def test_replace_placeholders(self):


can you add a test here for when there are a few inserts or updates followed by another select, so for example res5a would be next in the results list, as query 5.

Basically we need a test for when you have a select, followed by inserts that use the select, then anther select, then inserts that use the second select.

josenavas · 2015-06-24T00:59:18Z

@squirrelo, exactly, but sometimes to generate those files you need the data in the DB. Transactions allow you to execute some queries, try to generate the files, and then commit them. There is no other way of doing this consistency that executing one (or many) SQL queries w/o committing and then commit after everything is in place...

squirrelo · 2015-06-24T01:04:30Z

Is there a way of querying inside the transaction cursor, as opposed to the database itself? That way you are querying as if the items are already committed, but they actually aren't. I know sqlalchemy can do this.

Alternately, if you need the query information, you can make the query info list available through a function, e.g. get_result(index) so you can use the SQL results without having to commit.

josenavas · 2015-06-24T01:10:23Z

@squirrelo What you're proposing is just executing the query and not commit it. You're saying this:

with connect(params) as con:
    with con.get_cursor() as cur:
        try:
            cur.execute("DO STUFF")
        except:
            con.rollback()
     if checks_pass:
         con.commit()
     else:
         con.rollback()

This the exact same thing that I'm proposing with the commit=False parameter, but the transaction object allows you to do this like this:

with Transaction('t') as t:
    t.add("DO STUFF")
    t.execute(commit=False)
    if not check_pass:
        t.rollback()

Don't let the terminology confuse you, you don't do anything "inside" a cursor, you do it directly to the DB, the difference is that you commit it or not.

squirrelo · 2015-06-24T01:12:26Z

Actually what I'm proposing is more like:

with Transaction('t') as t:
    t.add("DO STUFF")
    t.add("QUERY NEEDED STUFF")
    needed_info = t.get_results(t.index-1)
    # use needed info to make files

josenavas · 2015-06-24T01:14:30Z

@squirrelo

And how can I return the result if I don't execute the query w/o committing?
Your proposal, will be doing this internally:

def get_results(index):
    res = self.execute(commit=False)
    return res[-1]

squirrelo · 2015-06-24T01:29:37Z

I've found a thing that may fix all our problems: Tornado sessions object This locks a transaction thread until explicitly freed. If this works the way I think it does, we really should be using this as our backend instead of a straight connection like we do now.

josenavas · 2015-06-24T01:32:35Z

You're proposing 2 things:

Removing psycopg2 as a dependency
Coupling qiita_db with tornado.

I don't agree with this, we have already seen that is a bad idea coupling qiita_pet and qiita_db...

josenavas · 2015-06-24T01:37:51Z

Also, I'm not seeing anything about commit/rollback on that documentation... I don't even know how this would work...

squirrelo · 2015-06-24T01:38:42Z

That's fair. ISOLATION_LEVEL_SERIALIZABLE might be the best in this case then, which if I'm reading correctly would make the cursor essentially stall the database while running the execution. This is the only way I can see not having the issue of running execute(commit=False) and not worrying about the database changing before the actual database execute with commit.

josenavas · 2015-06-24T01:40:19Z

No, that would not solve the issue.

Let me resume the issue that we have in one sentence:
We are doing commits that we shouldn't do

That's the only problem that we're having, I don't know how else I can explain this.

squirrelo · 2015-06-24T01:41:49Z

I get that, but my worry is something else, like another user, starts a transaction and adds something we needed between the commit=false and the actual commit. If this is not a worry and I'm just being paranoid, then OK, but just want to make sure.

josenavas · 2015-06-24T01:44:42Z

The question is, how another user is going to add something that we need? If we need it, we shouldn't be relying in another user to add it.

Also, with commit = False, we are locking the tables, so no other user can modify those tables, which is the idea of SQL transaction.

squirrelo · 2015-06-24T01:45:07Z

OK, cool. I'm paranoid then. Carry on.

ElDeveloper · 2015-06-24T05:13:27Z

We can make clear in the documentation that it will commit automatically, ...

Seems like a fine solution to me!

josenavas · 2015-06-24T05:27:06Z

@ElDeveloper @squirrelo This is ready for another review.

ElDeveloper · 2015-06-24T14:18:04Z

qiita_db/sql_connection.py

+            elif self._queries:
+                # There are still queries to be executed, execute them
+                # It is safe to use the execute method here, as internally is
+                # wrapped in a tr/except and rollbacks in case of failure


ElDeveloper · 2015-06-24T14:55:56Z

BTW, 👍.

squirrelo · 2015-06-24T15:03:15Z

qiita_db/sql_connection.py

+    def __exit__(self, exc_type, exc_value, traceback):
+        # We need to wrap the entire function in a try/finally because
+        # at the end of the function we need to set _is_inside_context to false
+        try:


This try/finally can be removed by just moving self._is_inside_context = False tothe first line of this function. Regardless of what happens after, you are always going to be outside the context when exit is called.

Nope, I can't. If I move the self._is_inside_context = False to the beginning of the function, any call to the Transaction methods (rollback, execute and commit) done in this method will raise a RuntimeError

josenavas · 2015-06-24T15:48:27Z

Ready for another review!

Add `Transaction` object

josenavas added 11 commits June 16, 2015 10:49

Removing dumb comment

c86a582

Adding the Transaction object

5ae857a

Removing unused code from the SQLConnectionHandler

59234d3

Adding tests

e12918e

Adding missing tests

400821b

Fixing error messages

4b26d69

Fixing regex and adding commit parameter to execute

9c7f43f

Fixing some code and improving tests

8ff6e33

Adding commit parameters and commit and rollback methods

691157e

Fixing bug in partially executed transactions and added a specific test

2e3b3f7

Removing unused code

03cc701

josenavas added refactor priority: high labels Jun 19, 2015

josenavas added this to the Alpha 0.2 milestone Jun 19, 2015

This was referenced Jun 19, 2015

WIP: Improve SQL queues #1262

Closed

Fix environment_manager.py #1277

Closed

Merge branch 'improve-sql-queues' of https://github.com/biocore/qiita …

2b88f46

…into improve-sql-queues-system

ElDeveloper reviewed Jun 19, 2015
View reviewed changes

Addressing @ElDeveloper's comments

45da838

Addressing missing comment

d1fdfd1

squirrelo reviewed Jun 20, 2015
View reviewed changes

josenavas added 3 commits June 23, 2015 19:34

Executing queries at __exit__

6eec39b

Making sure that the queues are cleaned up on rollback and fixing tests

a36a4af

Improve docs

6e1959f

josenavas added 2 commits June 23, 2015 22:23

Making sure that the methods are only invoked inside the context

c7d1473

Improve documentation

aaa1f9b

ElDeveloper reviewed Jun 24, 2015
View reviewed changes

squirrelo reviewed Jun 24, 2015
View reviewed changes

josenavas added 2 commits June 24, 2015 08:42

Improving documentation

4170efb

Addressing comments

af1cf92

squirrelo added a commit that referenced this pull request Jun 24, 2015

Merge pull request #1276 from josenavas/improve-sql-queues-system

5f1c1b1

Add `Transaction` object

squirrelo merged commit 5f1c1b1 into qiita-spots:improve-sql-queues Jun 24, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `Transaction` object #1276

Add `Transaction` object #1276

josenavas commented Jun 19, 2015

ElDeveloper Jun 19, 2015

josenavas Jun 19, 2015

josenavas commented Jun 19, 2015

ElDeveloper commented Jun 19, 2015

squirrelo Jun 20, 2015

squirrelo Jun 20, 2015

josenavas Jun 20, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

ElDeveloper commented Jun 24, 2015

josenavas commented Jun 24, 2015

ElDeveloper Jun 24, 2015

josenavas Jun 24, 2015

ElDeveloper commented Jun 24, 2015

squirrelo Jun 24, 2015

josenavas Jun 24, 2015

josenavas commented Jun 24, 2015

Add Transaction object #1276

Add Transaction object #1276

Conversation

josenavas commented Jun 19, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josenavas commented Jun 19, 2015

ElDeveloper commented Jun 19, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

josenavas commented Jun 24, 2015

squirrelo commented Jun 24, 2015

ElDeveloper commented Jun 24, 2015

josenavas commented Jun 24, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ElDeveloper commented Jun 24, 2015

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josenavas commented Jun 24, 2015

Add `Transaction` object #1276

Add `Transaction` object #1276