Support for asyncio #6

anthony-tuininga · 2022-06-01T19:23:25Z

This is a continuation of the original request made on cx_Oracle: oracle/python-cx_Oracle#178.

The current status is that with the addition of the thin driver, adding suport for asyncio will be considerably simpler (and will only work in thin mode). If anyone has suggestions or recommendations on API, please share them!

kleysonr · 2022-06-01T19:51:44Z

+1

danizen · 2022-06-01T20:01:53Z

It isn't async per se but the key benefits of escaping from the GIL and better managing RDBMS connections that we need. As both @sharkguto and @P403n1x87 mentioned in oracle/python-cx_Oracle#178, it can be done by wrapping queries in coroutines.

That means that @cjbj and @anthony-tuininga don't need to write a new driver (note that async drivers don't follow PEP 249). I think a thin package that wraps oracledb in an opinionated way (e.g. using connection pools all the time, and get a connection just to do the request) could be used.

I guess there is a problem there for cursors and fetching additional results, but I'll leave it to @anthony-tuininga to figure this out.

anthony-tuininga · 2022-06-01T20:11:48Z

@danizen, I did make an attempt at simply wrapping cx_Oracle in asyncio coroutines -- but the performance of the result was poor (and that's being generous). As such I don't see any benefit to wrapping the thick driver at all -- other than the fact that you can use asyncio, I guess! My first attempt to generate an asyncio (thin) driver showed excellent performance in comparison. So that is the route that is being considered at this point! I'd like to see this implemented soon -- this year if at all possible! Unless I hear otherwise I'll probably follow the pattern used by asyncpg.

danizen · 2022-06-01T20:28:39Z

Thanks, @anthony-tuininga, I am looking forward to trying it out.

jiaulislam · 2022-09-29T13:07:55Z

I guess it's still not supported officially to get async oracle. 💔

anthony-tuininga · 2022-09-29T13:21:33Z

Not yet, no. Its definitely on the list, though! And I'd still like to see it done this year, yet, if at all possible.

anthony-tuininga · 2022-12-07T20:56:49Z

Just to give a bit of an update: I have started looking into this and ran into a bit of a roadblock interfacing asyncio with the Oracle database protocol -- but a solution has been found, thankfully! Thanks to that roadblock, getting it done this year is not going to happen any longer...but I am actively working on it (among other things), so hopefully I'll have something for you to look at in January.

One question for those of you following along: as mentioned earlier a simple wrap of the synchronous routines in a future (that executes in a thread pool) works but is about twice as slow as the synchronous version. Would it be helpful to include that as a fallback if the solution I mentioned earlier doesn't work for all database versions? Or would it be preferable to simply state that support isn't available in that case? Comments welcome!

srtucker · 2022-12-07T21:43:57Z

Thanks for the update. For your second option would it be a hard stop or would the async API still function, it just would affectively be synchronous?

anthony-tuininga · 2022-12-07T21:46:39Z

The options, I think, are

raise an exception and state that asyncio doesn't work (well) with this database version
fallback to the "works but is considerably slower" approach (putting synchronous calls into a thread pool for execution)

P403n1x87 · 2022-12-07T22:08:38Z

* fallback to the "works but is considerably slower" approach (putting synchronous calls into a thread pool for execution)

Slow queries could perhaps benefit from the process, rather than the thread, pool. But opting in, in this case, should require an active user request IMO. Sending a fast query to a process pool might actually be slower than using the thread pool, although I don't have numbers to back this up at the moment.

old-syniex · 2022-12-18T21:44:34Z

The options, I think, are

raise an exception and state that asyncio doesn't work (well) with this database version

fallback to the "works but is considerably slower" approach (putting synchronous calls into a thread pool for execution)

I would suggest to wrap the problematic versions in threadpool for consistency.

The community might will be able to suggest solutions

danizen · 2022-12-19T18:43:15Z

My personal preference would be to raise an exception, but it is not either or. You could raise a warning and then fallback to wrap in a threadpool. Users who want the threadpool behavior could ignore the warning, and users who want the wrapped behavior could catch the warning and raise some sort of system error.

ptekelly · 2023-01-24T11:28:41Z

any news on this?

jiaulislam · 2023-01-24T14:20:47Z

They are working on it. But I guess it will work only for thin driver not the thick mode. I think asyncio is also required for thick mode.

anthony-tuininga · 2023-01-24T14:42:16Z

Yes, we are working on it but also getting distracted by other projects! I can (and have as a proof of concept) implemented asyncio with thick mode -- but that was about 2-3 times slower than without asyncio, which sort of defeats the purpose, I think! Do you still want asyncio even if it is slower than regular synchronous mode? My experimentations with thin are much more promising.

ptekelly · 2023-01-24T14:56:45Z

for me thick doesn't matter - just using thin (at the moment at least)

jiaulislam · 2023-01-25T04:55:57Z

Yes, we are working on it but also getting distracted by other projects! I can (and have as a proof of concept) implemented asyncio with thick mode -- but that was about 2-3 times slower than without asyncio, which sort of defeats the purpose, I think! Do you still want asyncio even if it is slower than regular synchronous mode? My experimentations with thin are much more promising.

If I have a database on a remote can I use Thin mode ? I tried earlier but it was giving an exception saying require thick mode. I didn't dig up that issue much as I was in hurry with that project but I guess I have to look it up again if thin mode works in remote database.

cjbj · 2023-01-25T05:06:00Z

@jiaulislam yes you can connect in Thin mode to remote databases. This is off topic for this thread, so if you have questions about it, please start a new discussion.

vbadita · 2023-02-28T22:56:50Z

What would be the database versions which don't support asyncio ?

cjbj · 2023-02-28T23:05:09Z

No further discussions have occurred. @anthony-tuininga has been busy on the Thin node-oracledb driver.
What DB versions are you using?

ptekelly · 2023-03-01T07:04:10Z

No further discussions have occurred. @anthony-tuininga has been busy on the Thin node-oracledb driver. What DB versions are you using?

I'd be happy for thin node async - is there forum post about that - or better still a time frame?

cjbj · 2023-03-01T09:35:39Z

No time frame. This issue is the place to follow to get news.

ptekelly · 2023-03-01T09:40:27Z

ok thanks

danizen · 2023-03-01T21:07:55Z

Now that I am using asyncio more heavily, I think the main benefit from this would only be in managing the number of connections, and that can be addressed somewhat with DRCP configuration. One trick I am doing is to create a ThreadPoolExecutor and then using the asyncio.run_in_executor formulation to run synchronous code.

This means I can tune the ThreadPoolExecutor to have the same number of threads i expect in a connection pool, and it is quite functional.

If you couple that with DRCP, it gets yet more functional.

vbadita · 2023-03-09T21:02:29Z

Now that I am using asyncio more heavily, I think the main benefit from this would only be in managing the number of connections, and that can be addressed somewhat with DRCP configuration. One trick I am doing is to create a ThreadPoolExecutor and then using the asyncio.run_in_executor formulation to run synchronous code.

This means I can tune the ThreadPoolExecutor to have the same number of threads i expect in a connection pool, and it is quite functional.

If you couple that with DRCP, it gets yet more functional.

Thank you. I tried ThreadPoolExecutor with connection pooling and it seems it's working fine.

ptekelly · 2023-03-28T21:44:29Z

Hi - any update to this?

anthony-tuininga · 2023-05-18T16:55:22Z

@nickswiss, others have indeed come up with a "solution" that works for them (the aforementioned cx_Oracle_async) while waiting for me to implement asyncio support. Support is indeed planned and some progress has been made, but other higher priority items have interrupted that progress. There are a few small items remaining on that list but I hope to have those completed shortly (which are planned to go into 1.4) and then I can concentrate on asyncio support (which is planned for 2.0). Of course these plans are subject to change but we will inform you if that is the case.

Part of my efforts in the past few months have been on support for a thin mode driver for Node.js. That effort has shown that it is possible to implement a truly async model without the enhancemnts available in Oracle Database 23c -- so with that knowledge I hope to have asyncio support for all database versions, with the caveat that the Oracle Database 23c enhancement should improve performance. The performance without the Oracle Database 23c enhancement should still easily outperform the simple wrapper approach that cx_Oracle_async is using.

The current plan is for asyncio support to only be usable in thin mode.

WilliamStam · 2023-06-26T18:31:10Z

thank you for all your hard work! its super supper appreciated.

im also a +1 for asyncio natively to get the proper awaits in it. in the meen time. anyone have some code they could share to get it going? :P cant exactly block the gil while waiting for an oracle response. is it as easy as:

def make_oracle_call(sql, params):
    with oracle as e: # psudo code
        results = e.fetchall()
        
    return results


executor = ThreadPoolExecutor(max_workers=1)
a = executor.submit(functools.partial(make_oracle_call,sql,params))

Julian-Brendel · 2023-10-30T07:24:21Z

Hi All,

Just wanted to check in on this thread.

Do you have any update on the roadmap / plans for the async / 2.0 release?

cjbj · 2023-10-30T10:47:22Z

@Julian-Brendel some work is being done and management is aware we are treating this as a priority request, however other things do keep coming up so I'm not going to comment on timelines.

anthony-tuininga · 2023-11-22T18:29:22Z

See announcement #258 for details. Comments welcome here!

old-syniex · 2023-11-22T20:39:34Z

@anthony-tuininga I am suggestingto release version 2.0.0a1 for ease of testing purposes.

I propose aligning our oracledb design with the following asyncpg methods:
fetch
fetchrow
fetchval
execute
executemany

Adopting this approach would likely enhance the overall user-friendliness.

anthony-tuininga · 2023-11-22T20:48:34Z

@old-syniex, I presume you are referring to having these methods on the connection object -- thereby eliminating the need to create a cursor object and perform an execute/fetch on that? I agree that these would be more convenient in cases where the additional properties available on cursors are not required. Would you like to see this as an additional set of APIs? Or as a complete replacement?

As for creating a release, that is certainly a possibility which we will consider.

old-syniex · 2023-11-22T20:55:21Z

@anthony-tuininga
Yeah, having those methods on the connection object.
I would like it to be additional set of API, I can't find a reason why to remove the current API.

anthony-tuininga · 2023-11-22T21:17:55Z

Sure. I can see the advantage of doing that. Something like this would make sense to me:

async def execute(
    self,
    statement: str,
    parameters: Union[list, tuple, dict] = None
) -> Any:
    """
    Executes a statement and returns an AsyncCursor instance (if executing a query) or None if
    executing a non-query. Other options include returning the number of rows updated
    (for non-queries) or an ExecuteResult object which contains the information that would be on
    an AsyncCursor instance.
    """

async def executemany(
    self,
    statement: str,
    parameters: Union[list, int],
    batcherrors: bool = False,
    arraydmlrowcounts: bool = False
) -> None:
    """
    Similar to AsyncCursor.executemany() but doesn't require creating an AsyncCursor instance first.
    No return value but could also have an ExecuteResult object returned with information that would
    normally be an AsyncCursor instance.
    """

async def fetchone(
    self,
    statement: str,
    parameters: Union[list, tuple, dict] = None,
    rowfactory: Callable = None
) -> Any:
    """
    Executes a statement and returns the first row of the result set returned
    (or None, if no rows are fetched).
    """

async def fetchmany(
    self,
    statement: str,
    parameters: Union[list, tuple, dict] = None,
    num_rows: int = oracledb.defaults.arraysize,
    rowfactory: Callable = None
) -> list:
    """
    Executes a statement and returns the first <num_rows> rows of the result set.
    """

async def fetchall(
    self,
    statement: str,
    parameters: Union[list, tuple, dict] = None,
    arraysize: int = oracledb.defaults.arraysize,
    rowfactory: Callable = None
) -> list:
    """
    Executes a statement and returns all of the rows of the result set as a list.
    """

That makes it clear that they are the same as the cursor equivalents but without requiring a cursor. Thoughts?

syniex · 2023-11-24T06:12:04Z

Yes, this looks great.

anthony-tuininga · 2023-11-24T21:54:06Z

FYI, I just pushed changes to merge with the changes introduced in main as well as ensure that Python 3.7 and higher work correctly. I had inadvertently used a method that was only available in Python 3.11!

anthony-tuininga · 2023-11-27T23:29:19Z

FYI, I just pushed more changes to merge with the changes introduced in main and also added shortcut functions on the connection object as follows:

    async def callfunc(
        self,
        name: str,
        return_type: Any,
        parameters: Union[list, tuple] = None,
        keyword_parameters: dict = None,
    ) -> Any:
        """
        Call a function with the given name.
            
        This is a shortcut for creating a cursor, calling the stored function
        with the cursor and then closing the cursor.
        """

    async def callproc(
        self,
        name: str,
        parameters: Union[list, tuple] = None,
        keyword_parameters: dict = None,
    ) -> list:
        """
        Call a procedure with the given name.

        This is a shortcut for creating a cursor, calling the stored procedure
        with the cursor and then closing the cursor.
        """
    async def execute(
        self, statement: str, parameters: Union[list, tuple, dict] = None
    ) -> None:
        """
        Execute a statement against the database.

        This is a shortcut for creating a cursor, executing a statement with
        the cursor and then closing the cursor.
        """
    async def executemany(
        self, statement: Union[str, None], parameters: Union[list, int]
    ) -> None:
        """
        Prepare a statement for execution against a database and then execute
        it against all parameter mappings or sequences found in the sequence
        parameters.

        This is a shortcut for creating a cursor, calling executemany() on the
        cursor and then closing the cursor.
        """
    async def fetchall(
        self,
        statement: str,
        parameters: Union[list, tuple, dict] = None,
        arraysize: int = None,
        rowfactory: Callable = None,
    ) -> list:
        """
        Executes a query and returns all of the rows. After the rows are
        fetched, the cursor is closed.
        """

    async def fetchmany(
        self,
        statement: str,
        parameters: Union[list, tuple, dict] = None,
        num_rows: int = None,
        rowfactory: Callable = None,
    ) -> list:
        """
        Executes a query and returns up to the specified number of rows. After
        the rows are fetched, the cursor is closed.
        """
    async def fetchone(
        self,
        statement: str,
        parameters: Union[list, tuple, dict] = None,
        rowfactory: Callable = None,
    ) -> Any:
        """
        Executes a query and returns the first row of the result set if one
        exists (or None if no rows exist). After the row is fetched the cursor
        is closed.
        """

WilliamStam · 2023-11-28T07:41:46Z

on behalf of pretty much everyone. thank you soooo much for this.

cjbj · 2023-11-28T07:48:06Z

@WilliamStam thank you. Please make sure you hammer on it and give us feedback!

cjbj · 2023-12-05T01:31:04Z

How is everyone going with python-oracledb asyncio testing? Are there any issues that we should know about, or suggestions that you want to make?

syniex · 2023-12-05T05:49:08Z

@cjbj is there any chance to create a prerelease?

cjbj · 2023-12-05T06:02:41Z

There's a chance :) Let me sync with Anthony - unless he does it before my day starts tomorrow.

syniex · 2023-12-05T20:51:24Z

There's a chance :) Let me sync with Anthony - unless he does it before my day starts tomorrow.

I hope you will async with Anthony ;)

anthony-tuininga · 2023-12-06T18:45:38Z

@syniex, we discussed this and agreed that we will release version 2.0 with asyncio support as it currently stands (probably some time next week). The feedback that has been received so far has all been positive. We will include a note that the asyncio support is under review and subject to change based on feedback after use in the real world! :-) That allows us to get out the other enhancements and bug fixes at the same time. So stay tuned!

anthony-tuininga · 2023-12-12T18:54:11Z

Asyncio support is now in main in preparation for the release of version 2.0.

anthony-tuininga · 2023-12-19T23:16:04Z

And version 2.0.0 has now been released! Thanks for your patience and let us know if you find anything that needs to be changed with the asyncio support.

cjbj · 2023-12-19T23:34:56Z

Yay.

FWIW I did a quick blog post here.

WilliamStam · 2024-01-12T10:20:15Z

so finally getting round to performance stuff in my projects. interestingly the oracledb async is marginly faster than the sync every single time i ran the stupid thrown together benchmark script (a few ms but still.. its consistently "faster"

(i was testing the execute part here not the connection / whatever else parts. and tried to do apples vs apples. im aware of connection.execute() but i felt that would be "unfair" to the sync part)

https://gist.github.com/WilliamStam/b9bed409e3a754bf05accb95d04bb54e

also.. ps.. &^@#$*&^ sqlalchemy :(

anthony-tuininga · 2024-01-12T14:19:36Z

Thanks for sharing! Is the "swearing" at SQLAlchemy because of performance or something else?

WilliamStam · 2024-01-12T20:50:51Z

lol yeah. having issues with SA and oracle but nothing for this thread

sqlalchemy/sqlalchemy#10874

anthony-tuininga added the enhancement New feature or request label Jun 1, 2022

anthony-tuininga mentioned this issue Jun 1, 2022

Support for asyncio oracle/python-cx_Oracle#178

Closed

oracle deleted a comment from danizen Jun 1, 2022

psvenk mentioned this issue Mar 19, 2023

RiiTS psvenk/fireroad-warehouse#1

Merged

cjbj mentioned this issue Oct 26, 2023

Add support for connecting with LDAP in Thin Mode #111

Open

anthony-tuininga mentioned this issue Nov 22, 2023

python-oracledb support for asyncio #258

Closed

anthony-tuininga added the patch available label Nov 23, 2023

anthony-tuininga added a commit that referenced this issue Dec 12, 2023

Added support for asyncio (#6).

12040fb

anthony-tuininga closed this as completed Dec 19, 2023

Support for asyncio #6

Support for asyncio #6

Comments

anthony-tuininga commented Jun 1, 2022

kleysonr commented Jun 1, 2022

danizen commented Jun 1, 2022

anthony-tuininga commented Jun 1, 2022

danizen commented Jun 1, 2022

jiaulislam commented Sep 29, 2022

anthony-tuininga commented Sep 29, 2022

anthony-tuininga commented Dec 7, 2022

srtucker commented Dec 7, 2022

anthony-tuininga commented Dec 7, 2022

P403n1x87 commented Dec 7, 2022

old-syniex commented Dec 18, 2022

danizen commented Dec 19, 2022

ptekelly commented Jan 24, 2023

jiaulislam commented Jan 24, 2023

anthony-tuininga commented Jan 24, 2023

ptekelly commented Jan 24, 2023

jiaulislam commented Jan 25, 2023

cjbj commented Jan 25, 2023

vbadita commented Feb 28, 2023

cjbj commented Feb 28, 2023

ptekelly commented Mar 1, 2023

cjbj commented Mar 1, 2023

ptekelly commented Mar 1, 2023

danizen commented Mar 1, 2023

vbadita commented Mar 9, 2023

ptekelly commented Mar 28, 2023

anthony-tuininga commented May 18, 2023

WilliamStam commented Jun 26, 2023

Julian-Brendel commented Oct 30, 2023

cjbj commented Oct 30, 2023

anthony-tuininga commented Nov 22, 2023

old-syniex commented Nov 22, 2023

anthony-tuininga commented Nov 22, 2023

old-syniex commented Nov 22, 2023 • edited Loading

anthony-tuininga commented Nov 22, 2023 • edited Loading

syniex commented Nov 24, 2023

anthony-tuininga commented Nov 24, 2023

anthony-tuininga commented Nov 27, 2023

WilliamStam commented Nov 28, 2023

cjbj commented Nov 28, 2023

cjbj commented Dec 5, 2023

syniex commented Dec 5, 2023

cjbj commented Dec 5, 2023

syniex commented Dec 5, 2023

anthony-tuininga commented Dec 6, 2023

anthony-tuininga commented Dec 12, 2023 • edited Loading

anthony-tuininga commented Dec 19, 2023

cjbj commented Dec 19, 2023

WilliamStam commented Jan 12, 2024

anthony-tuininga commented Jan 12, 2024

WilliamStam commented Jan 12, 2024 • edited Loading

old-syniex commented Nov 22, 2023 •

edited

Loading

anthony-tuininga commented Nov 22, 2023 •

edited

Loading

anthony-tuininga commented Dec 12, 2023 •

edited

Loading

WilliamStam commented Jan 12, 2024 •

edited

Loading