Copy in serial? #34

jamestalmage · 2016-07-22T22:05:43Z

It seems really unlikely to me that copying a bunch of files in parallel would speed anything up. Indeed, I would not be surprised to find it slowed things down. Increasing the number of files read simultaneously seems likely to encourage lots of seek delays for HDD's, and SSD's should be able to saturate their bandwidth regardless of how many files are being copied simultaneously.

It's probably worth running some benchmarks to prove this, but I think you would have seen Operating Systems commands / GUI's doing parallel copies if there were any advantage to it.

I guess technically, you could be copying from multiple slow network mounted drives - in which case parallelization might make sense. But I think that's a very small (possibly non-existent) percentage of users.

sindresorhus · 2016-07-22T22:19:53Z

I think it really depends on what you copy. My theory is that lots of small files would be faster to copy serially, especially with sync fs methods, but with large files we could see a speed-up with parallelization. Only a wild guess though, so would need some benchmarks. At the very least, we should limit the concurrency.

YurySolovyov · 2016-10-23T17:48:23Z

At the very least, we should limit the concurrency.

We can start by replacing Promise.all with https://github.com/sindresorhus/p-all and choosing some finite number for concurrency

sindresorhus · 2016-10-23T20:20:50Z

@YurySolovyov Yup. We should do some benchmarking on this, but I think (os.cpus().length || 1) * 2 or (os.cpus().length || 1) * 3 is generally a good default.

YurySolovyov · 2016-10-23T20:44:00Z

@sindresorhus I'm just not sure where the bottleneck is, if it is CPU threads, yes this is reasonable, if it is number if file descriptors, then it should be something much higher, like 64 or 128

sindresorhus · 2016-10-26T17:15:08Z

Hence why we need benchmarks.

YurySolovyov · 2016-10-26T17:54:05Z

@sindresorhus do you have an experience of writing good ones? (I would like to learn if so)

kevva · 2016-10-26T18:12:46Z

@YurySolovyov, you could use https://github.com/logicalparadox/matcha for example.

schnittstabil · 2016-10-26T18:54:40Z

@YurySolovyov If you want to read some code: globby uses matcha (bench.js)

I don't fully trust results of matcha, but neither I can remember why nor do I know a good alternative – good benchmarking tools are extremely rare, for every language and platform.

Maybe @jamestalmage have had similar concerns when he handcrafted the AVA benchmarks, great work btw, however I would prefer some library, even though it's matcha in the end.

kevva · 2016-10-26T19:00:55Z

There is https://github.com/bestiejs/benchmark.js too.

sindresorhus · 2019-06-11T06:25:23Z

I think the easiest solution here is to just add p-all and expose a concurrency option, which defaults to (os.cpus().length || 1) * 2.

sindresorhus added the help wanted label Jul 22, 2016

sindresorhus added the enhancement label Jun 11, 2019

threequartersjohn mentioned this issue Jul 31, 2019

Add concurrency option #69

Merged

sindresorhus closed this as completed in #69 Aug 3, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Copy in serial? #34

Copy in serial? #34

jamestalmage commented Jul 22, 2016

sindresorhus commented Jul 22, 2016

YurySolovyov commented Oct 23, 2016

sindresorhus commented Oct 23, 2016

YurySolovyov commented Oct 23, 2016

sindresorhus commented Oct 26, 2016

YurySolovyov commented Oct 26, 2016 •

edited

kevva commented Oct 26, 2016

schnittstabil commented Oct 26, 2016 •

edited

kevva commented Oct 26, 2016

sindresorhus commented Jun 11, 2019

Copy in serial? #34

Copy in serial? #34

Comments

jamestalmage commented Jul 22, 2016

sindresorhus commented Jul 22, 2016

YurySolovyov commented Oct 23, 2016

sindresorhus commented Oct 23, 2016

YurySolovyov commented Oct 23, 2016

sindresorhus commented Oct 26, 2016

YurySolovyov commented Oct 26, 2016 • edited

kevva commented Oct 26, 2016

schnittstabil commented Oct 26, 2016 • edited

kevva commented Oct 26, 2016

sindresorhus commented Jun 11, 2019

YurySolovyov commented Oct 26, 2016 •

edited

schnittstabil commented Oct 26, 2016 •

edited