nodejs-performance

Welcome to my NodeJs performance example. In this repository, I demonstrate how some scenarios can cause performance bottlenecks in NodeJs and how to improve performance of our NodeJs apps. Let's get started.

First things first

To start things off, let's illustrate how NodeJs execution can be blocked using a simple contrived situation. Say we have an ExpressJs app with two endpoints:

const express = require('express');

const app = express();

app.get('/', (req, res) => {
    res.send('Performance example');
});

app.get('/delay', (req, res) => {
    delay(9000);
    res.send('Delay example');
});

app.listen(3000, () => {
    console.log('Server listening on port 3000');
});

We can define our delay function as follows:

function delay(duration){
    const startTime = Date.now();
    while(Date.now() - startTime < duration){
        // This is not an I/O activity, event loop is blocked!
    }
}

We know that NodeJs uses a thread pool and the operating system to handle I/O processes and these do not block handling other requests. When the I/O tasks are done, their callbacks are put in the Job Queue so the event loop can process them on the next clock tick. In our delay() function however, our server is actually blocked because the while loop is not making any I/O call and is just waiting until a condition is met. If we visit our browser with the network tab on, we see that the request to localhost:3000/ takes around 20ms depending on your environment. However, if we call the /delay endpoint, we have to wait for around 9 seconds (assuming we call delay with 9000) for the page to load. We can also try to load /delay in one browser tab and / url in a second tab. Notice how the / endpoint now takes around 7 to 8 seconds to load. This is because the timer endpoint is blocking the server. Compare this with an async example like this:

app.get("/delay-async", (req, res) => {
  setTimeout(() => {
    res.send("Async delay example");
  }, 9000);
});

Now if we load /delay-async on one tab and / on a second tab, the second tab loads fast as usual. This is all to illustrate the blocking nature of NodeJs when we aren't performing async operations.

Real life examples

JSON.stringify and JSON.parse functions are examples of function calls that can take a long time to execute. These can execute in milliseconds even for large objects, however, if you have a server that takes in many requests, this can add up especially if these requests all make use of these JSON functions which is common when logging objects. if a call to stringify takes 10ms and your request takes 10ms, you have doubled the response time of the request. Other function to watch out for are sorting very large arrays, as these can start to get slow. Node's crypto module also has functions designed to execute slowly so it's harder for your passwords to be guessed. These are used to create hashes of user's passwordds.

Improving performance

Generally, in dealing with overloaded servers, we want to divide the work up and spread the load. Knowing that unlike Jave and C#, Node is a single threaded runtime. What we do is to run multiple processes side by side to share the work. Our requests can be spread across multiple NodeJs processes. This simple technique allows NodeJs to make full use of all the CPUs on your machine. How do we achieve this: enter the Node cluster module. It allows us to create copies of our server process that each run the server code side by side in parallel. The code for this part can be found in the cluster branch.

Using the cluster module, NodeJs has a master process and this process can fork child processes using the fork() method. The worker processes run the same code as the master but we can differentiate which process is running a particular request using the isPrimary property.

We can now adjust our code by importing the cluster module (const cluster = require('cluster');) and using it like so:

console.log('running server.js');
if(cluster.isPrimary){
    console.log(`Master ${process.pid} is running`);
    cluster.fork();
    cluster.fork();
} else{
    console.log(`Worker ${process.pid} started`);
    app.listen(3000);
}

To demonstrate that every worker node runs the exact same code, we add a console.log call before the cluster code. Now when we run npm start, we see that it logs '' 3 times, one for the master node and one each for the 2 worker nodes.

Now when we repeat our 2 tab trick by running /delay in one tab and / in another, the second request returns immediately with a different process ID. Also, try running the same /delay request in both tabs at the same time and see how in about 9 seconds, both requests complete. Ensure to disable cache in the network tab so that browsers like chrome don't wait for one request to complete before running the second.

Improving our cluster performance

How many forks should we create? The answer the number of physical or logical cores in our CPU. We can find this out by using the os module to find the number:

const NUMCPUs = os.cpus().length;
for(let i = 0; i < NUMCPUs; i++){
    cluster.fork();
}

I have 8 logical cores in on my computer and that should vary on different systems. Now we have maximised the number of cores we can use. Now we can open the /delay in multiple tabs and they'll each take around 9 seconds to complete the requests. It should now take up to 9 concurrent requests to slow down our server.

Using PM2

PM2 is a great tool for handling clusters automatically. With PM2, we can use clusters without directly using the cluster module. After installing pm2 NPM package and removing all the cluster code, we can just run the following on the terminal:

pm2 start server.js -i max

Making requests on multiple browser tabs now shows the process IDs for the different processes handling our requests. Remember, PM2 uses the cluster module under the hood. We can do pm2 list to show the current status of our server and pm2 logs to get a real time view of what's being logged in our server in real time.

PM2 has so many other handy features like sending logs to a file and log rotation. We can take advantage of these to manage our clusters:

pm2 start server.js -l logs.txt -i max

pm2 show 0

The show command accepts the ID of a worker process and gives us more information about its running. We can also stop and start individual processes by their id: pm2 stop 7.

The pm2 monit commit shows us a live dashboard of the status of each server process as they accept incoming requests. We can see the CPU usage go up and down as requests come in.

Lastly, if our server code has changed, we want to ensure a zero downtime restart. Using pm2 restart server, there will be a point where the server will be unavailable to users. We instead use the reload command which restarts the individual processes one by one, keeping at least one process running at all times.

pm2 reload server

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
README.md		README.md
logs.txt		logs.txt
package-lock.json		package-lock.json
package.json		package.json
server.js		server.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

nodejs-performance

First things first

Real life examples

Improving performance

Improving our cluster performance

Using PM2

About

Releases

Packages

Languages

onubrooks/nodejs-performance

Folders and files

Latest commit

History

Repository files navigation

nodejs-performance

First things first

Real life examples

Improving performance

Improving our cluster performance

Using PM2

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages