Running simultaneous queries with relations stalls typeorm #4738

oskarious · 2019-09-12T14:51:01Z

Issue type:

[ ] question
[x] bug report
[ ] feature request
[ ] documentation issue

Database system/driver:

[ ] cordova
[ ] mongodb
[ ] mssql
[ ] mysql / mariadb
[ ] oracle
[x] postgres
[ ] cockroachdb
[ ] sqlite
[ ] sqljs
[ ] react-native
[ ] expo

TypeORM version:

[ ] latest
[x] @next
[ ] 0.x.x (or put your version here)

I ran into an issue where typeorm would just stop responding occasionally, and when looking at the postgres stats I noticed that typeorm kept 10 (the size of the pool) connections open and in idle and didn't seem to ever close them afterwards.

The connections showed up as Idle | WaitEvent: Client: ClientRead.

Some more investigations seem to indicate that it the issue presents itself when trying to find entities with relations (with the getManager().find(Entity, options) syntax).

Increasing the pool size can help mitigate the problem, and using the query builder works as a workaround, but is obviously something we'd want to avoid as much as possible in favour of shorter, and clearer syntax.

I don't believe it's a config issue either

let entities = ['./dist/api/src/entities/**/*.js'];

module.exports = {
  type: 'postgres',
  synchronize, // True locally
  ssl,
  host,
  database,
  username,
  password,
  entities,
  timezone: 'Z',
  logging,
  logger,
  maxQueryExecutionTime: 1000,
  extra: {
    poolSize, // 10
    idleTimeoutMillis: 5000, // Drop connections that are stalled
    connectionTimeoutMillis: 10000, // Drop connections that are stalled
  },

  cli: {
    entitiesDir: 'api/src/entities/**',
    migrationsDir: 'api/src/migration',
    subscribersDir: 'api/src/subscriber',
  },
};

Steps to reproduce or a small repository showing the problem:

/* eslint-disable @typescript-eslint/no-use-before-define */
import { createConnection, Entity, getManager, ManyToOne, OneToMany, PrimaryGeneratedColumn } from 'typeorm';

@Entity()
export class Entity2 {
  @PrimaryGeneratedColumn()
  id!: number;

  @ManyToOne(() => Entity1, ent1 => ent1.ent2)
  ent1!: Entity1;
}

@Entity()
export class Entity1 {
  @PrimaryGeneratedColumn()
  id!: number;

  @OneToMany(() => Entity2, ent2 => ent2.ent1)
  ent2!: Entity2[];
}

const getWithRelations = () => {
  getManager()
    .find(Entity1, { relations: { ent2: true } })
    .then(e => {
      console.log(e);
    });
};

const getSimple = () => {
  getManager()
    .find(Entity1)
    .then(console.log);
};

const getWithQb = () => {
  getManager()
    .getRepository(Entity1)
    .createQueryBuilder('ent')
    .leftJoinAndSelect('ent.ent2', 'ent2')
    .getMany()
    .then(console.log);
};

(async () => {
  await createConnection();

  const parent = await getManager().save(new Entity1());

  const child1 = new Entity2();
  child1.ent1 = parent;
  await getManager().save(child1);

  const child2 = new Entity2();
  child2.ent1 = parent;
  await getManager().save(child2);

  for (let i = 0; i < 35; i++) {
    getWithRelations(); // Will stall
  }

  for (let i = 0; i < 35; i++) {
    getSimple(); // Runs as expected
  }

  for (let i = 0; i < 35; i++) {
    getWithQb(); // Runs as expected
  }
})();

The text was updated successfully, but these errors were encountered:

johnbjurulf · 2019-11-26T09:39:18Z

Got the same issue :-(

jwhitmarsh · 2020-09-11T15:24:38Z

Is there any alternative solution to this problem, other than those suggested by OP?

oskarious · 2020-09-23T11:09:49Z

@jwhitmarsh
For reference, I have started using the query builder instead. Haven't checked out the latest versions of typeorm, but last time I checked (a couple of months ago) it was still happening. Interested in hearing if anyone else has any alternatives as well!

falahati · 2021-01-25T18:48:55Z

Same thing here; especially if you have a queue for operations; in our case, it is the RabbitMQ and requests from other microservices that can overwhelm the service when it is down for a while; like when stuck due to this issue; therefore pushing it to stay in this state forever.
Decreasing the database timeout can mitigate the problem, but you better have your operations in a transaction and being able to gracefully reject or redone it, otherwise, it's a risk to the integrity of the data stored.

Please consider taking a look into this issue as it almost makes it impossible to write a real world application with the Postgres driver.

falahati · 2021-01-30T21:57:17Z

My problem was that I tried to use .find() from outside of a transaction using Repository while being in a transaction; since the transaction takes a connection, and the second nested call takes another connection; it would create a deadlock as a result of a race condition if 10 transactions ware started together and then each tried to use .find() and therefore opening another connection before any connection in the pool is free.
Solved simply by doing everything database related using the instance of entity manager provided by the transaction.

I also tried to reproduce the provided sample code here and it works as expected. So I don't know if this specific issue still exists.

Stene3 · 2022-07-01T16:01:27Z

Same happenned to me. Seems like issue with relations loading when relationLoadStrategy of type query is used. Single queries works, but more concurrently connections cause typeorm, and most probably all connection in pool, to be stalled. Any idea what could be the reason? I was trying to debug but no luck yet.

mkeemon · 2022-08-11T22:11:13Z

@Stene3 I have been running into the same issue using relationLoadStrategy: 'query'. The connection pool seems to be at the heart of the issue. I increased the max pool size to 50 against my local Postgres instance, and no longer run into the app locking up. Not a solution by any means, but helpful in debugging.

edeesis · 2023-01-30T17:18:45Z

We just ran into this issue as well, thought it might've run out of memory, but the only thing that makes sense is the connection pool issue. App just stopped responding to requests.

adrien2p · 2023-05-11T14:50:06Z

UP!

scr4bble · 2023-08-04T12:04:08Z

We just ran into this issue as well.
We are using MySQL though, not PostgeSQL.
TypeORM version "0.3.10"
Using parallel tasks from (nestjs/)bull. Each of the parallel tasks executes find() method to query the database while also fetching some of the entity relations.
It started happening when we switched relationLoadStrategy to 'query' with intention to improve the performance.

joelybahh · 2023-09-21T05:45:41Z

Same issue with us, we rolled out an API performance update via this at the time "magical" setting, however, we get the same thing, basically is like our API is on a timer before it falls over. Its a function app on Azure, so once this issue arises, the ENTIRE function needs to be restarted to create a new instance of typeORM.

Our workaround that doesn't crash the API is a poolSize of 100, this only works because our DB connection errors out first, which thankfully, isn't completely breaking the typeORM manager, and throws the appropriate errors so it will start working again as they clear out.

darius-00 · 2023-09-28T07:58:43Z

Same issue here. Using NestJS with TypeORM, Postgresql. Running more concurrent queries than a number in poolSize specified causes full stall from any further database queries.
Query:
this.usersRepository.findOne({ where: { 'id': userId } relations: ['posts'] })

SELECT * FROM pg_stat_activity;
While typeorm works as expected connections in pg_stat_activity appears and disappears. But as soon as you do 100 concurrent queries all those connections in pg_stat_activity stays there in idle state and server becomes unresponsive.

joelybahh · 2023-09-29T01:51:20Z

@darius-00 I've experienced exactly the same issue, then an entire server restart is required. I've found some "hidden" config options that are helping, but still unusual as no errors are thrown when the limit is reached, its like typeORM just silently dies with no way to reboot without entire restart.

My hunch is that as we are in a "serverless" api environment, maybe its initialising the datasource more than once so even with max set, its only max for that data source instance, but am unsure.

Here's some extra config that prevented everything falling over at the very least for us (So far):

const extra = {
    ssl: DB_DEV ? false : true,
    max: DB_MAX ? Number(DB_MAX) : DEFAULT_MAX,
    poolSize: DB_POOL_SIZE ? Number(DB_POOL_SIZE) : DEFAULT_POOL_SIZE,
    connectionTimeoutMillis: DB_CONNECTION_TIMEOUT_MILLIS
        ? Number(DB_CONNECTION_TIMEOUT_MILLIS)
        : DEFAULT_CONNECTION_TIMEOUT_MILLIS,
    query_timeout: DB_QUERY_TIMEOUT ? Number(DB_QUERY_TIMEOUT) : DEFAULT_QUERY_TIMEOUT,
    statement_timeout: DB_STATEMENT_TIMEOUT
        ? Number(DB_STATEMENT_TIMEOUT)
        : DEFAULT_STATEMENT_TIMEOUT,
};

export const AppDataSource = new DataSource({
    type: "postgres",
    host: DB_HOST,
    port: Number(DB_PORT),
    username: DB_USERNAME,
    password: DB_PASSWORD,
    database: DB_DATABASE,
    extra,
    logging: DB_LOG ? JSON.parse(DB_LOG) : true,
    synchronize: false,
    entities: entities,
    subscribers: [],
    migrations: migrations,
    ...rest,
});

joelybahh · 2023-09-29T01:54:45Z

When I set max to 5, for example, I observe the pg_stat_activity increments connections by 5 for EVERY request to the API, making me unsure if max means max connections per query, or max connections in general. Documentation around configuration options for postgres specifically seem light/non-existent. The datasource is only being initialised once from what we can tell so unless we're missing some other nuance of serverless, unsure best starting point to debug this.

clintonb · 2023-09-29T02:38:19Z

max comes from node-postgres. You're creating a pool with the specified number of connections. The connections are opened when the DataSource initializes the connection. You should expect to see those connections idle, unless you are making simultaneous connections through that one DataSource.

If you have n instances of your service running, you will end up with n * max connections.

darius-00 · 2023-09-29T10:42:35Z

poolSize and max config for me did the same thing. Just to make little better I increased poolSize to 100. It's much better than default 10, but still any user with js script
for(let i = 0; i < 200; i++){ fetch('/api/test') }
inside browsers dev console can make server inaccessible until it is restarted manually.
Strange thing is that sometimes I cannot recreate problem even with 1k http requests. After server is restarted, problem can occur with just 200 requests. After next restart it can be good again. After another restart it can happen again.

KamalAman · 2023-10-13T20:01:30Z

My problem was that I tried to use .find() from outside of a transaction using Repository while being in a transaction; since the transaction takes a connection, and the second nested call takes another connection; it would create a deadlock as a result of a race condition if 10 transactions ware started together and then each tried to use .find() and therefore opening another connection before any connection in the pool is free. Solved simply by doing everything database related using the instance of entity manager provided by the transaction.

I also tried to reproduce the provided sample code here and it works as expected. So I don't know if this specific issue still exists.

The root cause for our deadlock was caused by caused by the N+1 Problem in GraphQL creating dead-lock with query managers + repository queries e.g. 10 query managers were being created, but then we needed some additional data from the database and used a repository query, causing the dead lock with the maxPool size of 10. Query number 11 may not pass go and it creates a deadlock.

The solution was to ensure each top level service function only interacts with the database using the same query manager.

dgonzalezcuyna · 2023-11-20T15:32:45Z

+1

ajubin · 2023-11-28T14:20:42Z

it seems to be related to #10481

scr4bble · 2023-11-28T22:08:21Z

it seems to be related to #10481

Yes, the new issue is well described. Hopefully it gets more attention. It's quite important feature.

gauravl-tevaeralabs · 2024-01-09T08:32:27Z

UP!

imnotjames added bug driver: postgres labels Oct 6, 2020

balazsbencs-sc mentioned this issue Jun 21, 2023

(Bug) Eager loading not working with find and relationLoadStrategy = join (but working with query strategy) #9139

Open

This was referenced Aug 4, 2023

Loading entities with query strategy and connection limit set to 1 for mysql driver will cause dead lock #9298

Closed

.setRelationLoadStrategy("query") missing #8866

Open

evereq mentioned this issue Dec 28, 2023

[Enhancement] Optimize some critical queries with Query Builder and create indexes ever-co/ever-gauzy#7407

Open

casheeeewnuts mentioned this issue Jan 16, 2024

fix: Hangup when load relations with relationLoadStrategy: query #10630

Merged

7 tasks

pleerock closed this as completed in #10630 Jan 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Running simultaneous queries with relations stalls typeorm #4738

Running simultaneous queries with relations stalls typeorm #4738

oskarious commented Sep 12, 2019

johnbjurulf commented Nov 26, 2019

jwhitmarsh commented Sep 11, 2020

oskarious commented Sep 23, 2020

falahati commented Jan 25, 2021 •

edited

falahati commented Jan 30, 2021 •

edited

Stene3 commented Jul 1, 2022

mkeemon commented Aug 11, 2022

edeesis commented Jan 30, 2023 •

edited

adrien2p commented May 11, 2023

scr4bble commented Aug 4, 2023

joelybahh commented Sep 21, 2023 •

edited

darius-00 commented Sep 28, 2023 •

edited

joelybahh commented Sep 29, 2023 •

edited

joelybahh commented Sep 29, 2023

clintonb commented Sep 29, 2023

darius-00 commented Sep 29, 2023

KamalAman commented Oct 13, 2023

dgonzalezcuyna commented Nov 20, 2023

ajubin commented Nov 28, 2023

scr4bble commented Nov 28, 2023

gauravl-tevaeralabs commented Jan 9, 2024

Running simultaneous queries with relations stalls typeorm #4738

Running simultaneous queries with relations stalls typeorm #4738

Comments

oskarious commented Sep 12, 2019

johnbjurulf commented Nov 26, 2019

jwhitmarsh commented Sep 11, 2020

oskarious commented Sep 23, 2020

falahati commented Jan 25, 2021 • edited

falahati commented Jan 30, 2021 • edited

Stene3 commented Jul 1, 2022

mkeemon commented Aug 11, 2022

edeesis commented Jan 30, 2023 • edited

adrien2p commented May 11, 2023

scr4bble commented Aug 4, 2023

joelybahh commented Sep 21, 2023 • edited

darius-00 commented Sep 28, 2023 • edited

joelybahh commented Sep 29, 2023 • edited

joelybahh commented Sep 29, 2023

clintonb commented Sep 29, 2023

darius-00 commented Sep 29, 2023

KamalAman commented Oct 13, 2023

dgonzalezcuyna commented Nov 20, 2023

ajubin commented Nov 28, 2023

scr4bble commented Nov 28, 2023

gauravl-tevaeralabs commented Jan 9, 2024

falahati commented Jan 25, 2021 •

edited

falahati commented Jan 30, 2021 •

edited

edeesis commented Jan 30, 2023 •

edited

joelybahh commented Sep 21, 2023 •

edited

darius-00 commented Sep 28, 2023 •

edited

joelybahh commented Sep 29, 2023 •

edited