feat(core): allow using dataloader for references and collections #4321

darkbasic · 2023-05-05T10:03:25Z

This is currently a very basic implementation so that we can start talking about where we want this headed. I left out the collection dataloader as well as options handling and kept it as simple as possible. The biggest priority right now is being able to build mikro-orm on ppc64le to actually test it: lerna/lerna#3676
I've tried to build just core but I had no luck as well:

niko@talos2 ~/devel/mikro-orm/packages/core $ yarn build
command not found: rimraf

codecov · 2023-05-05T10:22:53Z

Codecov Report

Patch coverage is 100.00% of modified lines.

Files Changed	Coverage
packages/core/src/EntityManager.ts	`100.00%`
packages/core/src/entity/Collection.ts	`100.00%`
packages/core/src/entity/Reference.ts	`100.00%`
packages/core/src/enums.ts	`100.00%`
packages/core/src/utils/Configuration.ts	`100.00%`
packages/core/src/utils/DataloaderUtils.ts	`100.00%`
packages/core/src/utils/index.ts	`100.00%`

📢 Thoughts on this report? Let us know!.

packages/core/src/EntityManager.ts

packages/core/src/entity/Reference.ts

B4nan · 2023-05-05T10:29:09Z

The biggest priority right now is being able to build mikro-orm on ppc64le to actually test it: lerna/lerna#3676

Why would you need to build anything to test it? Write tests, they use ts-node, nothing needs to be build, or what am I missing? Tests will be needed either way, and you can run them in the CI so you are not blocked now.

Never heard of ppc64le, so idk how could I help you with that.

darkbasic · 2023-05-05T10:40:57Z

Why would you need to build anything to test it? Write tests, they use ts-node, nothing needs to be build, or what am I missing? Tests will be needed either way, and you can run them in the CI so you are not blocked now.

Because I already have tons of tests in my project which already makes use of an external library version of the dataloder.
They verify the result, test performance, etc so being able to start with them would be an headstart before I implement proper tests in mikro-orm itself.

B4nan · 2023-05-05T10:46:56Z

Would turborepo be any better? We could switch to that for the task running, and keep lerna only for publishing.

darkbasic · 2023-05-05T10:51:10Z

Would turborepo be any better? We could switch to that for the task running, and keep lerna only for publishing.

It was, but unfortunately not anymore:
vercel/turbo#2616
vercel/turbo#1891 (comment)

They are planning to add ppc64le support back but I fear it's not going to happen anytime soon.

darkbasic · 2023-05-05T10:57:03Z

@B4nan if I could manage to find where that unsupported error comes from I could try to build my own ppc64 binary (or even emulate the x86 one with qemu-user) but unfortunately it's not so obvious:

niko@talos2 ~/devel $ git clone https://github.com/lerna/lerna.git
Cloning into 'lerna'...
remote: Enumerating objects: 43257, done.
remote: Counting objects: 100% (1411/1411), done.
remote: Compressing objects: 100% (814/814), done.
remote: Total 43257 (delta 744), reused 1131 (delta 563), pack-reused 41846
Receiving objects: 100% (43257/43257), 29.57 MiB | 4.05 MiB/s, done.
Resolving deltas: 100% (32033/32033), done.
niko@talos2 ~/devel $ cd lerna/
niko@talos2 ~/devel/lerna $ grep -ir "Unsupported architecture" .
niko@talos2 ~/devel/lerna $ grep -ir "architecture" .
./libs/commands/publish/src/lib/create-temp-licenses.ts:    // (give up on 32-bit architecture to avoid fs-extra warning)
./website/docs/features/project-graph.md:For Lerna (and Nx) to run tasks quickly and correctly, it creates a graph of the dependencies between all the projects in the repository.  Exploring this graph visually can be useful to understand why Lerna is behaving in a certain way and to get a high level view of your code architecture.
./website/src/components/about-lerna.tsx:    text: "Lerna comes with a powerful interactive workspace visualizer, helping you understand the architecture of your workspace.",
niko@talos2 ~/devel/lerna $ grep -ir "unsupported" .
./.git/hooks/fsmonitor-watchman.sample:	die "Unsupported query-fsmonitor hook version '$version'.\n" .
./.eslintrc.json:        "node/no-unsupported-features/es-syntax": "off",
./libs/e2e-utils/src/lib/fixture.ts:        throw new Error(`Unsupported package manager: ${this.packageManager}`);
./libs/e2e-utils/src/lib/fixture.ts:        throw new Error(`Unsupported package manager: ${this.packageManager}`);
./libs/e2e-utils/src/lib/fixture.ts:        throw new Error(`Unsupported package manager: ${this.packageManager}`);

darkbasic · 2023-05-05T14:40:13Z

Apparently the lerna error comes from nx: https://github.com/nrwl/nx/blob/master/packages/nx/src/native/index.js#L235

Do we actually use the nx stuff? I didn't even know nx was a thing in lerna...

B4nan · 2023-05-05T14:43:27Z

Do we actually use the nx stuff? I didn't even know nx was a thing in lerna...

It is used as the default task runner now. Maybe this could be even replaced with yarn? Lerna is used basically just for task running in topological order, and for versioning/publishing (which happens in CI only).

darkbasic · 2023-05-05T15:02:01Z

It is used as the default task runner now. Maybe this could be even replaced with yarn?

The problem is that using yarn as a task runner it doesn't find the root dependencies from inside the workspaces:

niko@talos2 ~/devel/mikro-orm $ yarn workspace @mikro-orm/core run build
command not found: rimraf

B4nan · 2023-05-05T15:05:44Z

I think there was some option to allow that?

-T,--top-level
Check the root workspace for scripts and/or binaries instead of the current one

Hmm, but that's for scripts, not dependencies...

darkbasic · 2023-05-05T15:08:27Z

Hmm, but that's for scripts, not dependencies...

Unfortunately yes, it would simply run the top level script.

darkbasic · 2023-05-08T13:52:08Z

yarnpkg/berry#5429

merceyz · 2023-05-08T14:21:50Z

That's not what the documentation says.

Check the root workspace for scripts and/or binaries instead of the current one
https://yarnpkg.com/cli/run

Binaries in the documentation is what you're referring to as dependencies.

If you change

mikro-orm/packages/core/package.json

Line 53 in 9ebdd87

"clean": "rimraf ./dist",

to yarn run -T rimraf ./dist it will use rimraf from the root workspace.

darkbasic · 2023-05-08T14:54:30Z

yarn run -T rimraf ./dist it will use rimraf from the root workspace.

@merceyz looks like it's working, but unfortunately I cannot manage to get the plugin-workspace-tools to run the build scripts in the correct dependency order.

Workspaces use a wildcard:

  "workspaces": [
    "packages/*"
  ],

yarn workspaces foreach run build runs them in alphabetical order I think, which leads to failure because it needs to build the packages it depends on first. Looking at the docs I've found the -t option (yarn workspaces foreach -t run build) but it doesn't lead to the correct order either. I've looked for issues in the bug tracker and I've found this: yarnpkg/yarn#7150

Unfortunately it's not clear whether it's for berry or yarn v1 and it looks like a won't fix basically. Am I missing something?

EDIT: @merceyz nevermind, it looks like the correct option was --topological-dev.

darkbasic · 2023-05-08T17:52:36Z

I've finally managed to build mikro-orm using yarn's own task runner and incorporate it into my project via portals. The only minor annoyance so far has been having to change @mikro-orm/core/typings imports to @mikro-orm/core/dist/typings, but portals are so convenient that it's a minor tradeoff I'm willing to pay. So far it passes all my tests, so tomorrow I'll add basic support for dataloading collections. From there we can add the remaining options and start adding some tests into mikro-orm itself.

B4nan · 2023-05-09T06:14:45Z

change @mikro-orm/core/typings imports

You shouldn't be using any deep imports, what types are you missing that are not exported from the root of the package?

darkbasic · 2023-05-09T08:19:52Z

import type {
  EntityKey,
  EntityProps,
  ExpandProperty,
  ExpandScalar,
  FilterValue2,
  Loaded,
  Query,
  Scalar,
} from "@mikro-orm/core/dist/typings";

import type { EntityKey, IWrappedEntityInternal } from "@mikro-orm/core/dist/typings";

These are mostly for the find dataloader (a couple of them for the collection dataloader maybe).
I would have needed more (FilterValue, Query, FilterObject, Compute, ObjectQuery, FilterQuery...) but the find dataloader still doesn't cover 100% of the original types so I'm using some custom slightly smaller subtypes.

Apart from the dataloader I didn't have to use deep imports elsewhere in my app. I suggest to ignore them for the moment, I'll add a stub commit for the find dataloader and we decide if this is something which might be worth merging alongside with the ref and collection ones. If we decide to merge it we won't need these exports on the root package anymore, otherwise we can reason about exporting them.

B4nan · 2023-05-09T09:43:08Z

Ok, we could also export those types under some namespace, e.g. InternalTypes.FilterValue, to allow working with them in such context.

darkbasic · 2023-05-10T15:22:23Z

I've added a basic version of the Collections dataloader. It's not as straightforward as the Reference one, because for collections we have to filter the results to re-assign them to the original collections, but it shouldn't be too hard to understand either. Let me know if something is not clear. I've tested it against my own project test suite and it works well.

P.S.
Unfortunately I couldn't create an incremental commit because I had to rebase it to drop commits from #4331 (which is necessary for testing).
As soon as the PR to use yarn as a task runner gets merged it should be a little bit easier to follow the history.

darkbasic · 2023-05-11T19:20:34Z

I've pushed the find dataloader as well. This one is quite a bit more complex because it basically tries to optimize any kind of possible query into the smallest possible number of queries. It doesn't always manage to be faster so it's not something that we want to enable by default for every query, but amazingly it manages to be faster in some real world graphql scenarios (at least in my application). Once you get the grasp out of it it's not that complex (it's the second time I've rewritten it and I've focused on keeping things simple while achieving worthwhile performance), but feel free to ask me all the questions you want including a full detailed explanation if needed. I wanted to make it capable of possibly covering the whole set of operators, but I don't want to do so at the further expense of performance and I'm basically gradually adding more whenever I find the right use case in my own application. I think that even having a slightly tighter scope compared to the normal find could still be fine because that covers 90% of the use cases and you can always create your own specialized dataloader for complex queries. I use to benchmark it case by case and enable it for the queries that I know will suffer a lot from the GraphQL inherent N+1 nature. Not sure if we want to merge it, it's definitely useful but might never reach the full operators coverage (and maybe that's not even necessary).

packages/core/src/EntityManager.ts

… Collection.load

…_meta.className for performance

…f load

darkbasic · 2023-08-30T14:37:29Z

@B4nan the lock file has been regenerated and documentation has been created. I've also added a new global option to surgically enable each dataloader.

B4nan

two final things before we merge

packages/core/src/utils/Utils.ts

packages/core/src/utils/Configuration.ts

packages/core/src/enums.ts

darkbasic · 2023-09-09T10:26:30Z

@B4nan done, should be ready to merge.

docs/docs/dataloaders.md

packages/core/src/entity/Collection.ts

B4nan

all right, thanks for holding on with me!

) Closes #266

B4nan reviewed May 5, 2023

View reviewed changes

darkbasic force-pushed the v6-dataloader branch from f47af45 to 8b29d99 Compare May 10, 2023 15:17

darkbasic changed the title ~~feat: initial ref dataloader implementation~~ feat: initial reference and collection dataloader implementations May 10, 2023

darkbasic force-pushed the v6-dataloader branch from 8b29d99 to adcf873 Compare May 11, 2023 19:03

B4nan reviewed May 12, 2023

View reviewed changes

packages/core/src/EntityManager.ts Outdated Show resolved Hide resolved

B4nan force-pushed the v6 branch 4 times, most recently from ffd989a to 854a974 Compare May 14, 2023 21:41

darkbasic force-pushed the v6-dataloader branch from adcf873 to 83f4a9d Compare May 20, 2023 08:24

darkbasic added 12 commits August 30, 2023 11:25

feat(dataloader): implement collections init, test Reference.load and…

a19d9cd

… Collection.load

chore: remove redundant async/await

9178ecf

feat(dataloader): use EntityMetadata objects as Map keys instead of _…

820fe84

…_meta.className for performance

fix(dataloader): misc test fixes

efc9c15

chore(dataloader): move tests into features

bb922b8

fix(col-dataloader): move dataloder option into InitOptions

224e62e

fix(dataloader): remove unnecessary await

c3742cd

feat(col-dataloader): split getColBatchLoadFn into several methods

0cb640f

fix(dataloader): check the generated queries and use unwrap instead o…

7a5d748

…f load

feat(col-dataloader): revert to the default load strategy

6a59382

feat(dataloader): add global switch

8f57575

docs: add dataloader

8ad1481

darkbasic mentioned this pull request Sep 7, 2023

Combine concurrent identical queries into one. #2470

Closed

B4nan requested changes Sep 8, 2023

View reviewed changes

packages/core/src/utils/Utils.ts Outdated Show resolved Hide resolved

packages/core/src/utils/Configuration.ts Outdated Show resolved Hide resolved

packages/core/src/enums.ts Outdated Show resolved Hide resolved

darkbasic added 2 commits September 9, 2023 12:24

chore: move dataloader utils into their own file

30b1507

feat(dataloader): allow boolean values

8019b5f

B4nan reviewed Sep 9, 2023

View reviewed changes

docs/docs/dataloaders.md Outdated Show resolved Hide resolved

B4nan reviewed Sep 9, 2023

View reviewed changes

packages/core/src/entity/Collection.ts Outdated Show resolved Hide resolved

darkbasic added 2 commits September 9, 2023 13:03

docs: use correct dataloader enums

bd03c04

chore(dataloader): take advantage of existing em variable

9339fa1

B4nan approved these changes Sep 9, 2023

View reviewed changes

B4nan changed the title ~~feat: initial reference and collection dataloader implementations~~ feat(core): allow using dataloader for references and collections Sep 9, 2023

B4nan merged commit 53b9ca1 into mikro-orm:v6 Sep 9, 2023
9 checks passed

B4nan pushed a commit that referenced this pull request Oct 21, 2023

feat(core): allow using dataloader for references and collections (#4321

8b0025a

) Closes #266

B4nan mentioned this pull request Oct 21, 2023

MikroORM v6 #4731

Merged

22 tasks

B4nan pushed a commit that referenced this pull request Oct 25, 2023

feat(core): allow using dataloader for references and collections (#4321

d6b4bd6

) Closes #266

B4nan pushed a commit that referenced this pull request Nov 2, 2023

feat(core): allow using dataloader for references and collections (#4321

8169c59

) Closes #266

B4nan pushed a commit that referenced this pull request Nov 5, 2023

feat(core): allow using dataloader for references and collections (#4321

8f4790f

) Closes #266

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(core): allow using dataloader for references and collections #4321

feat(core): allow using dataloader for references and collections #4321

darkbasic commented May 5, 2023

codecov bot commented May 5, 2023 •

edited

B4nan commented May 5, 2023

darkbasic commented May 5, 2023

B4nan commented May 5, 2023

darkbasic commented May 5, 2023

darkbasic commented May 5, 2023

darkbasic commented May 5, 2023

B4nan commented May 5, 2023

darkbasic commented May 5, 2023

B4nan commented May 5, 2023 •

edited

darkbasic commented May 5, 2023

darkbasic commented May 8, 2023

merceyz commented May 8, 2023 •

edited

darkbasic commented May 8, 2023 •

edited

darkbasic commented May 8, 2023

B4nan commented May 9, 2023

darkbasic commented May 9, 2023

B4nan commented May 9, 2023

darkbasic commented May 10, 2023 •

edited

darkbasic commented May 11, 2023 •

edited

darkbasic commented Aug 30, 2023

B4nan left a comment

darkbasic commented Sep 9, 2023

B4nan left a comment

feat(core): allow using dataloader for references and collections #4321

feat(core): allow using dataloader for references and collections #4321

Conversation

darkbasic commented May 5, 2023

codecov bot commented May 5, 2023 • edited

Codecov Report

B4nan commented May 5, 2023

darkbasic commented May 5, 2023

B4nan commented May 5, 2023

darkbasic commented May 5, 2023

darkbasic commented May 5, 2023

darkbasic commented May 5, 2023

B4nan commented May 5, 2023

darkbasic commented May 5, 2023

B4nan commented May 5, 2023 • edited

darkbasic commented May 5, 2023

darkbasic commented May 8, 2023

merceyz commented May 8, 2023 • edited

darkbasic commented May 8, 2023 • edited

darkbasic commented May 8, 2023

B4nan commented May 9, 2023

darkbasic commented May 9, 2023

B4nan commented May 9, 2023

darkbasic commented May 10, 2023 • edited

darkbasic commented May 11, 2023 • edited

darkbasic commented Aug 30, 2023

B4nan left a comment

Choose a reason for hiding this comment

darkbasic commented Sep 9, 2023

B4nan left a comment

Choose a reason for hiding this comment

codecov bot commented May 5, 2023 •

edited

B4nan commented May 5, 2023 •

edited

merceyz commented May 8, 2023 •

edited

darkbasic commented May 8, 2023 •

edited

darkbasic commented May 10, 2023 •

edited

darkbasic commented May 11, 2023 •

edited