feat: relationships loading #105

gazorby · 2023-11-30T01:36:33Z

Pull Request Checklist

New code has 100% test coverage
(If applicable) The prose documentation has been updated to reflect the changes introduced by this PR
(If applicable) The reference documentation has been updated to reflect the changes introduced by this PR
Pre-Commit Checks were ran and passed
Tests were ran and passed

Description

Expose SQLAlchemy relationship loading techniques through the repository interface.

The idea is to have SQLAlchemy loading styles more integrated into the repository API, via a single .load() repository method that sets the relationships to load on the repository model.

Here is the proposed API:

# List author and load all their books
authors = await AuthorRepo(session=session).load(books=True).get_one(name="J.R.R Tolkien")
assert all(isinstance(book, BookModel) for book in author.books)

# Calling .load() only affect the next query, so no relationship will be explictly loaded here
authors = await AuthorRepo(session=session).get_one(name="J.R.R Tolkien")
print(authors[0].book.title) # May throw an error if book is not configured to be lazy loaded on the mapper side

# Go one step deeper by loading publisher too
# Chil relations are chained with the '__' separator 
author = await AuthorRepo(session=session).load(books__publisher=True).get_one(name="J.R.R Tolkien")
assert all(isinstance(book, BookModel) for book in author.books)
assert all(isinstance(book.publisher, PublisherModel) for book in author.books)

# Ellispis (...) load all nested relationship under (and including) the specified one
author = await AuthorRepo(session=session).load(books=...).get_one(name="J.R.R Tolkien")
assert all(isinstance(book, BookModel) for book in author.books)
assert all(isinstance(book.publisher, PublisherModel) for book in author.books)
assert all(isinstance(book.publisher.company, CompanyModel) for book in author.books)

# We can customize how relationship is loaded by passing a SQLAlchemy relationship loading style
# https://docs.sqlalchemy.org/en/20/orm/queryguide/relationships.html#summary-of-relationship-loading-styles
author = await AuthorRepo(session=session).load(books="subqueryload").get_one(name="J.R.R Tolkien")
assert all(isinstance(book, BookModel) for book in author.books)

# Exclude all relationships to be loaded (overriding mapper configuration), and only load books
load_config = SQLAlchemyLoadConfig(default_strategy="raiseload")
author = await AuthorRepo(session=session).load(load_config, books=True).get_one(name="J.R.R Tolkien")
assert all(isinstance(book, BookModel) for book in author.books)

# You can pass a SQLAlchemyLoad object if you want
# to set a default set of relationships to be loaded on a repository
repo = AuthorRepo(session=session, load=SQLAlchemyLoad(books=True))

author = await repo.get_one(name="J.R.R Tolkien")
assert all(isinstance(book, BookModel) for book in author.books)

await repo.session.expunge_all()

authors = await repo.list()
for author in authors:
    assert all(isinstance(book, BookModel) for book in author.books)

It also makes relationship loading more "composable" since SQLAlchemyLoad are standalone objects that can be reused.

Close Issue(s)

add test case

cofin · 2023-11-30T01:48:52Z

This is a cool idea! You are on a roll this week!

Is it possible to get the tests working for 3.8?

cofin · 2023-11-30T01:51:22Z

advanced_alchemy/repository/_async.py

 from advanced_alchemy.repository._util import get_instrumented_attr, wrap_sqlalchemy_exception
 from advanced_alchemy.repository.typing import ModelT
 from advanced_alchemy.utils.deprecation import deprecated

 if TYPE_CHECKING:
    from collections import abc
    from datetime import datetime
+    from typing import Self


This may need to be typing_extensions for us to get 3.8 support?

Yes, just fixed it, but EllipsisType does not seem to available before 3.10

cofin · 2023-11-30T01:53:59Z

advanced_alchemy/repository/_async.py

@@ -169,6 +193,9 @@ async def add(
        """
        with wrap_sqlalchemy_exception():
            instance = await self._attach_to_session(data)
+            if self._load:
+                await self._flush_or_commit(auto_commit=True)


Should the auto_commit follow what was sent in from the method or actually be True?

I think it should stay True since the following self._refresh_with_load() emits a select to get back the newly inserted rows with loaded relationships.

Just thinking through this a bit more, and it's still not totally clear.

We definitely need a flush so that the inserted row is loaded into relationship. As long as it isn't a new session though, we don't have to commit to make that happen. However, i'm not quite following why the commit is necessary.

Is there a simple use case that I can walk through to visualize why the commit over a flush is needed?

gazorby · 2023-11-30T02:01:17Z

Actually, it comes from topsport-com-au/starlite-saqlalchemy#304, after some iterations ;)

provinzkraut · 2023-11-30T17:34:32Z

This is a great addition @gazorby!

However, I have concerns about the interface. The pipeline style doesn't really fit in with the rest of the repositories, as it's the only place this would be used. Furthermore, the SQLAlchemyLoad, in combination with the pipeline pattern on the repository, comes quite close to just remodeling SQLAlchemy's query building, if we think a few iterations and feature additions ahead here.

Another concern is the "blanket keyword argument" style, which isn't great for type checking (and testing), and is something that - in this case - SQLAlchemy would do better than we here.

I would propose that for now, the interface for this stays functional, which could maybe look something like this:

# List author and load all their books
authors = await AuthorRepo(session=session).get_one(name="J.R.R Tolkien", load=Author.books)
assert all(isinstance(book, BookModel) for book in author.books)

author = await AuthorRepo(session=session).get_one(
  name="J.R.R Tolkien", 
  load=[Author.books, BookModel.publisher]
)
assert all(isinstance(book, BookModel) for book in author.books)
assert all(isinstance(book.publisher, PublisherModel) for book in author.books)

# use loaders from sqla
author = await AuthorRepo(session=session).get_one(
  name="J.R.R Tolkien",
  load=subqueryload(Author.books)
)
assert all(isinstance(book, BookModel) for book in author.books)

LonelyVikingMichael · 2024-02-13T23:51:46Z

Hi @gazorby

I wasn't sure if you'd be continuing with this, so I'd made my own, admittedly lazier POC here #130 - I've just now noticed your recent activity.
Aside from agreeing with @provinzkraut's comments above, what I really like about SQLAlchemy's loader strategies is fine grained control over deeply nested data, to be more specific - the load_only method comes to mind.

If we take for example the relationship of Author -> Books -> Chapters, we can do the following with pure SQLAlchemy:

result = session.execute(
    select(Author).options(selectinload(Author.books).selectinload(Book.chapters).load_only(Chapter.name)
)

This gives us the opportunity to emit a more lightweight query in the context of the book hierarchy if the Chapter table for example had multiple other columns of metadata, but in this particular context, we're only interested in the name.

I also often make use of has() and any() to refine results, e.g.

# find out about authors and only their books containing "foo" in the title
select(Author).options(selectinload(Author.books.any(Book.title.ilike("%foo%")))

The native approach makes a lot of sense to me here, let me know your thoughts?

gazorby · 2024-02-14T07:43:11Z

Hi @LonelyVikingMichael!

I refactored my codebase with a new implementation exposing the API @provinzkraut suggested, which I also agree with, and brings several benefits over my previous iteration, not least being fully typed and much lighter.

So now I can use plain SQLAlchemy loaders or pass a nested list of relationships and let the repository generate the loaders:

author_repo.get(id, load=Author.books)
author_repo.get(id, load=[(Author.books, Book.chapter), Author.publisher]
author_repo.get(id, load=selectinload(Author.books).selectinload(Book.chapters).load_only(Chapter.name))

Will update the PR when I have some time

gazorby · 2024-04-07T21:06:44Z

supplanted by #157

gazorby added 8 commits November 27, 2023 18:22

refactor: wip

830559a

test(integration): fix repository tests

5a51e55

fix(load): path conflicts

2d226a2

refactor(load): remove base load

e96a4d1

test(load): add some integration test cases

96f8910

test(load): load

74b5490

add test case

fix(load): calling load should only affect next query

7f6cec2

refactor(load): typing

5e8ae4f

gazorby requested review from a team as code owners November 30, 2023 01:36

sourcery-ai bot mentioned this pull request Nov 30, 2023

feat: relationships loading (Sourcery refactored) #106

Closed

fix(typing): fix Self import

a395db0

cofin reviewed Nov 30, 2023

View reviewed changes

gazorby added 3 commits November 30, 2023 03:24

fix(typing): move EllipsisType to TYPE_CHECKING block

5655a35

fix(typing): fix SQLALoadStrategy type definition

aae19a8

fix(typing): import TypeAlias from typing_extensions

d40773f

gazorby marked this pull request as draft February 7, 2024 14:37

gazorby closed this Apr 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: relationships loading #105

feat: relationships loading #105

gazorby commented Nov 30, 2023 •

edited

Loading

cofin commented Nov 30, 2023

cofin Nov 30, 2023

gazorby Nov 30, 2023

cofin Nov 30, 2023

gazorby Nov 30, 2023

cofin Nov 30, 2023

gazorby commented Nov 30, 2023

provinzkraut commented Nov 30, 2023 •

edited

Loading

LonelyVikingMichael commented Feb 13, 2024

gazorby commented Feb 14, 2024 •

edited

Loading

gazorby commented Apr 7, 2024

feat: relationships loading #105

feat: relationships loading #105

Conversation

gazorby commented Nov 30, 2023 • edited Loading

Pull Request Checklist

Description

Close Issue(s)

cofin commented Nov 30, 2023

cofin Nov 30, 2023

Choose a reason for hiding this comment

gazorby Nov 30, 2023

Choose a reason for hiding this comment

cofin Nov 30, 2023

Choose a reason for hiding this comment

gazorby Nov 30, 2023

Choose a reason for hiding this comment

cofin Nov 30, 2023

Choose a reason for hiding this comment

gazorby commented Nov 30, 2023

provinzkraut commented Nov 30, 2023 • edited Loading

LonelyVikingMichael commented Feb 13, 2024

gazorby commented Feb 14, 2024 • edited Loading

gazorby commented Apr 7, 2024

gazorby commented Nov 30, 2023 •

edited

Loading

provinzkraut commented Nov 30, 2023 •

edited

Loading

gazorby commented Feb 14, 2024 •

edited

Loading