What is the improvement or update you wish to see?
We have a repo with a few tasks in various sub-directories. We build bioinformatic software and one of our sub-directories sometimes contains ~100+GB test datasets.
I expected to be able to run a dev task like this one:
"auth:dev": {
"cache": false,
"persistent": true
},
And because cache:false not have turbo scan the directory.
Turbo is trying to compute checksums of the entire repository content, including the test files and it blows up memory and crashes computers (quickly gets to ~64GB of RAM).
I can stop this by setting an input string for the dev server:
"auth:dev": {
"cache": false,
"inputs": ["src/**"],
"persistent": true
},
But this was not intuitive, if cache:false, why should I need to tell turbo to only look under src/ for cache files?
Secondly, even with this above configuration, running "npx turbo run auth:dev" from the root of the monorepo also blew up memory, and after much troubleshooting I fixed it by adding
To my root turbo.json, because the daemon appears to checksum every file in the repository regardless of the input definitions of sub-tasks and the value of globalDependencies.
The documentation for the daemon parameter says "Turborepo runs a background process to pre-calculate some expensive operations.", but it would be helpful to control what files the deamon is going to checksum, because every file in the repo is quite surprising and in this case disruptive without some way to control it.
Is there any context that might help us understand?
You can recreate this by creating a monorepo with a sub-directory with a persistent cache:false task. In some sub-directory add a ~100+GB files and try to run the dev task with the default settings. Memory explodes.
Does the docs page already exist? Please link to it.
No response