Pavel Romanov

The Ultimate Guide to Cron Jobs in Node.js

Pavel Romanov — Tue, 30 Apr 2024 12:15:47 GMT

One of the most common features of each application is task scheduling. For example, an application needs to send email reports about some operations once every 8 hours to a certain number of users.

While it is a common task, people in the Node.js community still get confused about how to implement it, what options we have to create and manage those tasks, and the pros and cons of each option.

In this article, well answer all those questions, giving you a clear picture of the tools you have at your disposal.

Terminology

The world of task scheduling uses different terms that basically mean the same thing: jobs or tasks that the application performs on a schedule.

Many people call those cron jobs. However, this term can cause more confusion, especially if youre just starting your journey on the backend.

For example, cron is a UNIX-based scheduler. What does it have to do with your application if you're running it on a Windows server?

A better term for it would be scheduled tasks. The name itself is crystal clear; you dont have to second-guess what it stands for.

This term will be used throughout the rest of the article.

Factors to consider before choosing a task scheduling approach

Before exploring the different tools and approaches, it's important to understand your application's specific requirements. Considering the following factors will help you to make an informed decision.

Application and infrastructure scale
High-frequency tasks
Long-running tasks
Task stacking

Let's explore each of these factors in more detail.

Application and infrastructure scale

You have to understand the scale of your application and underlying infrastructure. Itll allow you to make a reasonable decision on what approach to choose.

To name a few cases:

A single instance of application is running on a single server
Multiple instances of application are running on a single server
Multiple instances of applications are running on multiple servers

It is not limited only to those 3. You might have a different setup and infrastructure configuration. The point is that you have to understand it before deciding on the task scheduling approach.

High-frequency tasks

If you need to run tasks at intervals shorter than 1 minute, your options will be more limited. Some scheduling solutions don't support high-frequency tasks out of the box at all or require additional workarounds.

Consider the minimum interval between tasks that your application requires, and ensure that the chosen solution fits those needs.

Long-running tasks

Long-running tasks introduce their own set of challenges. These tasks can exacerbate issues like memory leaks, which may not be apparent in applications without long-running processes. When implementing long-running tasks, consider the following challenges:

Debugging complexity: Long-running tasks can be harder to debug due to their extended runtime and potential interactions with other parts of the system.
Maintenance and updates: Careful planning is required when performing maintenance or updates on applications with long-running tasks to ensure minimal disruption and proper handling of in-progress tasks.
Resource management: Long-running tasks consume system resources over an extended period. Proper resource management, such as memory and CPU usage, is crucial to avoid performance degradation and memory leaks.

Tasks stacking

Task stacking occurs when a new task is started before the previous one has been completed. This can happen when the scheduled interval is shorter than the task's execution time.

Task stacking can lead to resource contention, performance degradation, and unexpected behavior. When selecting a scheduling solution, consider how it handles task stacking and whether it provides mechanisms to prevent or manage such situations.

Tasks scheduling approaches

While all scheduled tasks share the common characteristics of having a specified execution time and associated code, they differ in how and where they are scheduled and managed. In this section, we'll explore the various approaches to scheduling and managing tasks, including:

UNIX-based scheduling with cron
Runtime scheduling
Runtime scheduling with persistence
Cloud-based scheduling solutions

Each approach has its own pros and cons, which we'll discuss in detail.

UNIX-based scheduling with cron

Cron is a long-standing and widely-used scheduler in UNIX-based systems. It allows you to schedule tasks, known as "cron jobs," to run at specific intervals of time.

The interface through which you schedule tasks to cron is called crontab (cron table). Crontab uses a specific syntax to define the schedule.

Cron runs a cron daemon, a single process responsible for managing all the scheduled tasks. When a new task is scheduled, the daemon starts monitoring it with a frequency of 1 minute. When a task is due to run, cron creates a separate process for that task and executes it.

Pros:

Flexibility: Cron is a versatile tool for running various types of scripts and programs on a scheduled basis.
Process isolation: Each cron job runs in its own separate process, providing a level of isolation and minimizing the impact of one task on others.
Reliability: Cron has been around since 1975 and has proven to be a reliable and stable solution for task scheduling.

Cons:

Single machine limitation: By default, cron is limited to a single machine, which can make it harder to scale for larger workloads.
Task stacking: If a task takes longer to execute than its scheduled interval, multiple instances of the task may start running concurrently, leading to resource contention. Cron doesn't have built-in mechanisms to prevent this.
Limited fault tolerance: Cron doesn't provide built-in error handling functionality. If a task fails, cron wont automatically retry it. You need to implement your own error handling and retry mechanisms.

Runtime scheduling

Runtime scheduling refers to scheduling tasks directly within the application code. This approach is facilitated by various ready-to-use libraries in the Node.js ecosystem, such as node-cron, corn, croner, and others. While these libraries may have different implementation details, they all operate on the same fundamental principle.

Under the hood, these libraries typically use setTimeout or setInterval functions to schedule tasks. When your application starts, the scheduled jobs are initialized and set to run at their specified intervals.

Pros:

Simplicity: Runtime scheduling is easy to set up and get running, as it doesn't require any additional infrastructure or external dependencies.
Quick development: For simple use cases or prototypes, runtime scheduling allows you to implement task scheduling quickly without the overhead of more complex solutions.

Cons:

Resource contention: Not all solutions execute tasks in a separate process/thread. This means that if you run CPU-intensive logic inside a task, you have to be aware of particular implementation details to ensure that those tasks wont block the main thread.
Deployment challenges: When deploying a new version of the application, all scheduled tasks will be rescheduled because all timers run inside of the same process as the application code (unless you separate it into a dedicated, independent process). It results in delayed task execution.
Scalability limitations: As the application scales beyond a single instance, managing runtime-scheduled tasks becomes increasingly difficult. Each application instance runs the same code and schedules the same tasks, leading to duplication and conflicts.
Task stacking: If a scheduled task takes longer to execute than its specified interval, multiple instances of the task may start running concurrently. This can result in resource contention and unexpected behavior, especially for long-running tasks.

Runtime scheduling can be a good fit for simple applications or prototypes where you want to test and visualize scheduled jobs quickly. However, it may not be the most suitable approach for production-grade applications that require scalability, reliability, and efficient resource utilization.

In this case, both task data and timers reside within the application.

Runtime scheduling with persistence

Runtime scheduling with persistence builds upon the basic runtime scheduling approach and introduces a persistence mechanism to store and manage scheduled tasks.

By incorporating a persistence layer, the application can store information about scheduled tasks in a database or a persistent storage system. This allows the application to track and recover tasks even after a restart or a new deployment, ensuring that tasks are executed on time.

Two notable libraries in the Node.js ecosystem provide runtime scheduling with persistence: Bull (or BullMQ) and Agenda.

Bull is a popular library that uses Redis, an in-memory data store, as its persistence mechanism.

Agenda is another library that provides runtime scheduling with persistence using MongoDB. It's important to note that the Agenda may not be actively maintained, with the last major release dating back to November 2022 (as of May 2024).

Unlike Ageda, Bull is actively maintained. The downside of Bull is it uses in-memory databases. If your server goes down, all jobs are lost.

Pros:

Persistence: Runtime scheduling with persistence ensures that scheduled tasks are not lost during application restarts or deployments. The persistence mechanism allows you to recover and resume tasks from where they left off.
Scalability: Abstracting task management from the main application and storing tasks in a separate persistence layer makes it easier to scale the application in the future. Multiple application instances can share the same persistence layer and coordinate task execution.
No task stacking: When using Bull, the queue-based structure ensures that tasks are executed in a reliable and orderly manner. New tasks are only started after the previous ones are completed, preventing all problems related to task stacking.

Cons:

Complexity: Implementing runtime scheduling with persistence requires additional setup, configuration, and learning compared to basic runtime scheduling. You need to set up and manage the persistence layer (e.g., Redis or MongoDB) and integrate it with your application.
Dependency on external systems: Relying on external systems like Redis or MongoDB introduces additional points of failure. If the persistence layer experiences issues or downtime, it can affect the task scheduling workflow.

Runtime scheduling with persistence offers a more robust and reliable approach compared to basic runtime scheduling. It addresses the issue of losing scheduled tasks during restarts and deployments and provides better scalability options. However, it also introduces additional complexity and dependencies on external systems.

With this approach, we move task information to the persistence layer. The application is now only responsible for running timers and executing tasks themselves (in case we still run everything in a single application instance).

Cloud-based scheduling solutions

The other option is to move to the cloud. In this case, you dont need to use any of the previous solutions to create, schedule, and persist tasks.

Popular cloud providers, such as Amazon Web Services (AWS) and Google Cloud Platform (GCP), offer dedicated services for scheduling tasks. For example, AWS provides the EventBridge Scheduler, while GCP offers the Cloud Scheduler.

Pros:

Decoupled architecture: Cloud-based scheduling solutions decouple the scheduling logic from your application code. This separation of concerns makes it easier to scale and maintain your application independently of the scheduling infrastructure.
Scalability and reliability: Cloud providers offer highly scalable and reliable scheduling services. They handle the underlying infrastructure, ensuring that tasks are executed on time and with high availability. You don't need to worry about managing the scheduling infrastructure yourself.
Flexibility and integration: Cloud-based scheduling solutions often provide flexible scheduling options, such as cron-based scheduling or more advanced scheduling patterns. They also integrate well with other cloud services, allowing you to build complex workflows and data pipelines.

Cons:

Learning curve: Cloud-based scheduling solutions require an understanding of a specific cloud platform and its services.
Cost considerations: While cloud-based scheduling solutions offer scalability and convenience, they come with associated costs. As your usage grows, the cost of using these services may increase. It's important to carefully evaluate the pricing models and estimate the long-term costs based on your application's needs.
Complexity in code sharing: If your application relies on code sharing between the main application and the scheduled tasks, using a cloud-based scheduling solution makes it increasingly harder to share the code. You may need to package and deploy your task code separately from your main application, which can require additional configuration and deployment processes.

Conclusion

In the world of Node.js task scheduling, there is no one-size-fits-all solution.

When deciding on a task scheduling approach for your application, it's critical to evaluate your specific needs and trade-offs carefully. Consider the scalability and reliability requirements, the complexity of setup and maintenance, and the potential costs involved.

Be open to exploring different options, adapting as your needs evolve, and finding the solution that best fits your application's goals and constraints.

Resource management in Node.js: the good, the bad and the worst

Pavel Romanov — Sun, 21 Apr 2024 12:15:31 GMT

In the previous article on resource management in Node.js, we covered the options available to manage resources. However, the previous article only provides a general overview.

In this article, well see the pros and cons of using each of them. Spoiler: some of the options dont make much sense to use.

Node.js CLI options

The Node.js CLI has two options for managing heap sizes: --max-old-space-size and --max-semi-space-size.

Trade-offs

Those options cannot regulate everything. Here a just a few cases where CLI options wont work for you.

Spawned process. Whenever you spawn a new process via child_process.spawn, youre creating a new instance of V8 alongside the process. It doesnt inherit all of the options from the process that it was spawned from automatically. Sure, you can pass them manually, but now you have to be aware of each process and its memory consumption in the system.

Addons. Addons are not directly related to JavaScript. In fact, they are C++ libraries accessible through JavaScript. This means the code running inside those libraries is not affected by V8 restrictions, at least in this case.

I/O operations. I/O operations are handled by the libuv library and it means were facing C++ again. And not only that. When reading a large file, the result is passed as a buffer into the callback:

import fs from 'node:fs';// The data is Buffer object.fs.readFile('large-file.txt', (err, data) => {  if (err) {    console.error('Error reading file:', err);  } else {    console.log('File contents:', data.toString());  }});

Even when we have the buffer instance inside of JavaScript, the memory allocated for this buffer resides outside of the V8 heap and is, therefore, unaffected by the options.

When to use

Use those when you want to specifically imply the limits of JavaScript-related code consumption to a single process.

They make garbage collection more efficient. When memory consumption gets closer to the size that you provided, the garbage collection runs more frequently, resulting in less memory usage.

PM2 process manager

PM2 is a popular JavaScript process manager. It has a feature to restart a certain process based on the memory it consumes. You can achieve this by specifying the max_memory_restart option.

Trade-offs

The first and most significant tradeoff is how the process manager manages memory consumption. It has a separate worker process that checks memory consumption every 30 seconds.

This means it takes up to 30 seconds for the process that ran out of memory to be detected and restarted. These 30 seconds can cost a lot, up to the whole server crash.

The other is inherited problems that come with directly using PM2, such as:

Overselling clustering that leads to cumbersome workflow.
Attempts to go beyond a simple process manager and deliver on those attempts purely, compared to the alternatives.
Licensing issues.

Ill soon publish an article that goes deep into the PM2 problems.

When to use

To be honest, I dont see any good reason to use PM2 at this point except only one specific use-case that Ill describe in the upcoming article.

User limits

User limit is the way to set limits to resources based on the user's role in a unix-based system.

Trade-offs

It is hard to scale. This type of limitation is especially challenging to scale if we want fine-grained control over specific processes or groups of processes.

Managing a large number of users, each with its own limits, quickly becomes complex.

When to use

It is an excellent option to use as a second line of defense. For example, if you have some way of managing resource consumption of a process or process group but want to be sure that if something goes wrong, you have a backup plan.

Control groups

Control groups are meant to manage resource consumption on the system level, similar to user limits in their focus on resource management.

Trade-offs

The only major problem I see with control groups is the configuration process. You might rightly say skill issues. However, the lack of clear interfaces through which we can configure them, like a single configuration file that resides next to the application code, makes it increasingly hard to manage.

One more issue could be a lack of complete isolation. With control groups, youre still pretty much working on the same machine, with the shared file system, network, and any other resources.

When to use

If you have enough skill and understanding of how they work, you can use them whenever you want. They are flexible enough to deliver on most of the tasks related to resource management.

Containers

Container is an abstraction that goes further than control groups. It allows allocating resources for a group of processes and almost completely isolates them in terms of file system, network, etc.

Trade-offs

The main trade-off of containers is the abstraction itself. It adds more complexity to the whole workflow.

Such complexity results in:

Higher resource usage. Creating a container requires more resources than making a simple control group.
Performance problems in high-performance applications. As a result of high resource consumption, you can run into performance issues.
Learning curve.

When to use

Despite container trade-offs in particular cases, it is still the best solution for resource management we have so far. Here are just a few benefits that you get by using them:

Granular control. They heavily restrict resource usage (using control groups for it). Each container is isolated from one another, making it harder to break things up.
Better tooling. Tooling around containers allows you to configure containers for specific needs easily. Moreover, If youre using some IDE with tools like Docker, it will hint you commands that you can use and highlight the ones that you cant, making the experience even better.
Great isolation. Containers provide a great isolation for applications running inside of them.

Conclusion

There are many options for resource management of Node.js applications. It all comes down to understanding the trade-offs of each approach and your specific needs.

In general, I would stick with containers whenever possible. Even if you don't know about them much, it is a great opportunity to learn.

Node Resource Management

Pavel Romanov — Sun, 14 Apr 2024 14:52:28 GMT

It is crucial to avoid issues like the one I encountered after joining a Node.js project. The project had strange code that simultaneously started multiple CPU and memory-demanding processes whenever we deployed a new application version. This approach caused high resource consumption and crushed crucial processes due to insufficient resources. Vertical scaling temporarily solved the problem, but it wasn't enough.

To avoid such issues, we implemented strategies to manage the processes and change our scaling approach, including correctly managing each process's resources to prevent excessive resource consumption.

In this article, you'll learn about tools you can use to properly manage resources like memory, CPU, and others in a Node.js application.

Overview

We have several options to manage resource consumption in Node.js applications, including:

Limiting V8 memory heap size: By setting a specific limit on heap size, we can make the garbage collection mechanism more efficient and avoid running out of memory.
Using PM2: PM2 can restart a process whenever it reaches some memory usage cap.
OS-specific features: Each OS provides a set of specific features and utilities that limit resource usage.
Container: You can use containers to limit resource usage of applications inside the container.

Now, lets look at each of those options in more detail.

Managing V8 memory heap size via CLI options

We can manipulate the memory heap size of the V8 JavaScript engine that Node.js uses by default. Two available options are --max-old-space-size and --max-semi-space-size.

These two options are directly related to the garbage collection process.

V8 uses a generational garbage collection strategy. The initial heap consists of two parts: young and old generations.

As you can see, the young generation consists of two parts called semi-spaces. The --max-semi-space-size option allows us to configure the size of these semi-spaces.

All newly created objects first get into the young generation heap. If an object survives a certain number of garbage collections, it is promoted to the old generation. The objects in semi-space are short-lived and get either promoted or destroyed relatively fast.

On the other hand, you might have many objects that live for a long time and aren't destroyed by a garbage collector. Thats where the --max-old-space-size option comes in handy. When the old size generation gets closer to its limit, the garbage collection process runs more frequently to clear up the space.

Here is an example command where we limit the size of the old space generation to 2GB.

node -max-old-space-size=2048 app.js

The size is set in megabytes.

Memory limits with PM2

PM2 is a popular Node.js process manager. It provides the ability to restart a particular process based on the memory consumed by a process.

The process manager has a specific configuration option called max_memory_restart.

The max_memory_restart tells PM2 to restart the process whenever it reaches the memory usage limit. Here is an example of using it directly within CLI:

pm2 start api.js max_memory_restart 500M

In this example, we tell PM2 to restart a process whenever memory consumption reaches 500M.

Memory is the only resource that we can manage through PM2.

OS-specific options

This section covers features of Linux or any other Unix-based OS to run Node.js. However, Windows has similar mechanisms. You can find Windows alternatives in each section without any further details.

User limit

Windows alternative - Windows System Resource Manager (WSRM).

User limit is a system command that allows setting limits on the number of resources used per user. It has two types of limits: soft and hard.

Soft limits are the ones that current users might manage and increase or decrease as needed. The hard limit tells up to which numbers the soft limit can be raised. It is configured exclusively on the root level and is needed to add an extra layer of protection.

As an example, you can set a CPU time limit with the following commands:

# Set hard limit of CPU usage to 50 seconds per processulimit -H -t 50# Set soft limit of CPU usage to 25 seconds per processulimit -S -t 25

Control group

Windows alternative - Job Object

The control group (cgroup) is an alternative feature for managing resource usage. It serves a similar purpose to user limits (resource control), but the approach is different.

The control group is an abstraction that allows users to group different processes and allocate a specific number of resources, such as CPU, memory, or file I/O, to each group. The control group ensures that processes under this group will only use the allocated resources.

Here is an example of how to create a control group and add a process to it:

# Create a new cgroupsudo cgcreate -g memory:/my_node_group# Set a memory limit to the groupsudo echo 536870912 > /sys/fs/cgroup/memory/my_node_group/memory.limit_in_bytes # Add process to the cgroupsudo echo  > /sys/fs/cgroup/memory/my_node_group/tasks

Despite the number of processes you add to this group, they will be collectively limited to 512MB of memory.

Isolate and Control Resource Using Containers

Containers are similar to control groups; in fact, containers use control groups to isolate resources and set strict limits to them. However, they go beyond just resource management and provide a spectrum of additional features like:

Network isolation
Filesystem isolation
Isolation of hostname and domain name
User and user group isolation
And others

As you can see, containers offer much more than resource management. They are much closer to a virtual machine (VM).

At the same time, tooling around containers is superior, thanks to projects like Docker and Podman. You can seamlessly create new containers and set their parameter, including resource limits.

This tooling makes it easier to use and manage containers than to do so manually with control groups. Thats why you usually see containers instead of plain control groups in real-world projects.

The control groups catch

There is a catch when using control groups directly or by any other containerization solution that uses control groups.

Currently, there is no way to know the exact constraints from within the Node itself, at least in a straightforward way. For example, if we set a limit of 1 CPU per container but have 5 in total on the machine, Node only knows about those 5, not the one it is running in.

In theory, this behavior is already fixed in Node.js 20.x. However, people are still running into the issue, which means it is still present.

If you ever need to make the Node application aware of particular resource constraints, the most straightforward approach is to declare those inside the Node application itself. This duplicates the sources of truth, but we can clearly respect the configuration without any additional overhead.

Conclusion

Whether you're preventing crashes, ensuring smooth multi-application environments, or optimizing performance, you now have the tools to manage Node.js resources effectively.

They range from Node CLI-specific, like passing options to limit heap sizes and using PM2 process manager, to more ubiquitous, like user limits, control groups, and containers.

In the next article, we'll examine the pros and cons of each approach and determine which is probably the best.

Node.js cluster module. You probably don't need it

Pavel Romanov — Sun, 07 Apr 2024 13:40:18 GMT

When you start learning more about the Node, you might encounter a thing called cluster module.

You may initially be confused about the difference between simply spawning processes and using clusters to do so at least I was.

But there is a difference, and it is significant. Actually, it is so significant that you don't need to use the cluster module in most of the cases.

In this article we'll look into more details about the cluster model. What is it, how it works, and answer the main question: "Why are you probably good without using it?"

What is a cluster?

First, let's understand a cluster and the difference between it and simply spawning new processes manually.

A cluster is simply an abstraction over a group of processes that is glued with some network features.

A cluster consists of four main components:

The main process
Worker processes
Inter-process communication (IPC)
Load-balancing mechanism

You can see that the cluster is, by default, provided with a load-balancing mechanism, unlike a group of manually created processes.

There is also a slight distinction in how the process created by cluster is different from the one created manually. Here is what the documentation says about it

server.listen({fd: 7}) Because the message is passed to the primary, file descriptor 7 in the parent will be listened on, and the handle passed to the worker, rather than listening to the worker's idea of what the number 7 file descriptor references.
server.listen(handle) Listening on handles explicitly will cause the worker to use the supplied handle, rather than talk to the primary process.
server.listen(0) Normally, this will cause servers to listen on a random port. However, in a cluster, each worker will receive the same "random" port each time they do listen(0). In essence, the port is random the first time, but predictable thereafter. To listen on a unique port, generate a port number based on the cluster worker ID.

Cluster structure

Now, let's explore the cluster components in more detail.

Main process

The main process is responsible for creating a cluster. It creates child processes called worker processes or simply workers.

The main process doesnt directly process any of the requests coming to the server. It's responsibility to manage the cluster and distribute incoming requests between worker processes.

Worker processes

Worker processes are the workhorses. They are responsible for processing requests and giving responses to the clients.

Inter-Process Communication (IPC)

IPC is the glue of clustering. It enables processes to exchange information, such as their health status and different data, and handle errors properly.

For example, we can spin up a new worker process in case some of them are crushed because we have information about the crush. That way, the cluster can always maintain the required number of workers.

Load-balancing

Load-balancing, built into the cluster, prevents overloading a single worker. Cluster can employ two main load-balancing strategies, which we will see later in this article.

How does it work all together?

Were ready to dive into the details of how the cluster components work together.

Creating a simple server

To better understand the concept, well create a simple server using the cluster module.

import { fork, isPrimary } from 'node:cluster';import { createServer } from 'node:http';import { availableParallelism } from 'node:os';if (isPrimary) {  const numbuerOfCPUs = availableParallelism();  for (let i = 0; i < numbuerOfCPUs; i++) {    fork();  }} else {  createServer().listen(8000);}

Lets break it down. We start by checking whether the current process is the main one (primary). We use the same file to start both the main process and all of the worker processes, so we have to check what kind of process were in before going any further.

Here is how the initial state of the application looks like:

Here is the picture after creating the worker processes:

In case a process is main, we check how many CPUs are available using the availableParallelism function. It is a tiny wrapper around the libuv library function with the same name. Were doing so to utilize the maximum resources available at the moment for cluster creation. It might not be ideal for all use cases, and you have to play with it to see what number best suits your needs.

As a simple example, where the number returned from availableParallelism might not be the best for you is having continuous running processes for external jobs like sync with external services and others. We have to take those processes into account while creating a cluster so both of them, cluster workers and other running processes, can be as efficient as possible.

Lastly, in the else block, we write code that all worker processes will run. In this case, it is creating an HTTP server on the 8000 port.

Notice that while were passing the same exact port for each of the workers that we create, it doesnt mean that every worker will run on the 8000 port. Only the main process establishes a connection.

Requests handling

How the server handles incoming requests heavily depends on the load-balancing strategies in a cluster. The cluster employs two main load-balancing strategies: round-robin and shared handle.

There are three main factors that dictate which type of load balancing will be used:

Server configuration. The cluster can be configured via the NODE_CLUSTER_SCHED_POLICY environment variable, which takes two values: rr and none.
Operating system. On Windows, Node.js uses the shared handle by default because round-robin doesnt perform well due to OS-specific constraints.
Connection type. A shared handle is used for UDP connections. The reason is simple: UDP is connectionless, and round-robin works only with TCP connections, like HTTP.

Round-robin (default) strategy. The round-robin strategy is used by default. This type of load-balancing relies on the round-robin algorithm to distribute incoming requests between worker processes. The main process plays the managers role and runs the algorithm to make it possible. The following picture shows how it works:

Shared connection (handle). The second type of load balancing is a shared handle. The main process establishes a connection with a port and creates the handle, which is simply a reference to the port where the connection is established. After that, the main process shares this handle with all of the worker processes. All of this happens during a cluster initialization process.

The difference with shared handles is that the main process no longer acts as a manager. It doesnt distribute the incoming requests by itself; instead, the underlying OS mechanism fulfills this role:

In both cases, the worker process is responsible for fulfilling the requests and sending back the response.

Cluster vs traditional pipelines

One of the main topics when it comes to scalability is the difference between using the clustering module and traditional pipelines (the process of building and delivering your applications to the final users) like Docker, Kubernetes, Nginx, and others.

Scalability rate

When we talk about scaling via cluster, we always refer to a single server. Were trying to scale our application within one server only by using multiple processes. This approach is prone to failure if the whole server experiences troubles.

Using traditional pipelines, were not limited to a single server. In fact, we can employ as many servers as we need to keep our application up and running.

Management

Node.js developers usually manage clusters themselves. They monitor the server load, balance it properly, and scale whenever needed.

On the other hand, we have separate engineering roles and teams that manage traditional pipelines. This field is huge, and it is beneficial to have specialized people who apply the best tools and practices to ensure the best delivery and responsiveness of applications.

Flexibility

Using cluster only, we're very limited in options for load-balancing strategies, deployment strategies, and reactions to failures. Sure, we can write something custom, like a custom load-balancing strategy, but do you really want to spend time on it?

On the other hand, traditional pipelines already provide everything you might need in terms of load-balancing, deployment, and others. You just have more options, and those options are objectively better, and you don't have to write anything from scratch.

Conclusion

The cluster module was meant to solve a wide range of tasks, such as scaling, making applications more resilient, performing tasks in an isolated environment without blocking the main thread, and load-balancing incoming requests.

At the same time, external tooling can solve most of those problems when it comes to scaling. Were left with a single reason to use the module we want to ensure the isolation of running tasks inside different processes.

But you can do it manually by spawning different processes.

The only case where I see the module is useful is if your team is small and you dont have the time and resources to invest in building a full pipeline with all the tooling for load-balancing, deploying, and scaling.

Overall, if you have extra time or people who can configure the pipeline, I strongly recommend doing so instead of relying on cluster to scale your application.

Understanding Node.js Threads

Pavel Romanov — Sun, 31 Mar 2024 12:47:37 GMT

Node.js is full of surprises, especially when it comes to threads. The first thing you might hear about it is, There is only one thread. After digging a bit deeper, we can discover that Node.js is, at the very least, not single-threaded.

But what are all those threads? Do you know the difference between them? At first, it might be counterintuitive, but different threads are meant for different purposes. Writing and reasoning about the code you encounter becomes hard without properly understanding those differences.

In this article, we will build a mental model you can rely on while working with Node.js threads. The article assumes you already have some basic knowledge of how Node.js operates and its high-level architecture.

To better understand threads, we must understand what resource-related operations we can perform in Node.js.

By resources-related operations, I mean operations that, for the most part, depend on one of the following hardware resources:

Input and output or I/O
CPU
Different kinds of memory

The list is not limited to these three, and it can go on and on. However, were primarily interested in the first two: I/O and CPU.

Every operation in a system uses some resources. For example, a function can read a file's content and then do heavy calculations based on it. First, we use the I/O resource to read the file and then the CPU to perform calculations.

Some operations rely heavily on one kind of resource. These operations are called *-bound, where * means any type of resource. If the operation mostly relies on the CPU, and the better the CPU gets, the faster the operation completes, we can call it CPU-bound. The same can be applied to I/O-bound operations.

I/O-bound operations

I/O-bound operations are the ones that involve input and output of any kind in a system.

For example, imagine we have some Excel file that we want to read and do something with it afterward. To achieve that, we can use file system API provided by Node.js. The following code does the job:

import { readFile } from 'node:fs/promises';const userExcelDoc = await readFile('./todo-list.excel');

Notice that we dont do any heavy lifting here. The platform APIs nicely abstract everything from us. Our job is to call the function and pass a valid file path thats it.

But what if we have 2 or 3 large files that we want to read at the same time? Are they going to block the main thread? Not really. We have a great mechanism for handling I/O operations in Node.js that well look at in a minute.

CPU-bound operations

CPU-bound operations are the ones that involve any logic with a high demand for CPU resources.

To demonstrate CPU-bound operation, we dont need much, just a simple cycle that runs billions of times:

for (let = 0; i < 20000000000; i++) {  // do something}

Here we have it, the operation that requires a lot of CPU resources. If we run this code in our main thread, the whole server will be frozen until it's done.

Unlike I/O-bound operations, we dont have a low-level platform API that does the heavy lifting for us. Nobody will allocate a separate thread for it automatically, so we have to find another way to do so by ourselves.

Thread types in Node.js

Now, were ready look at different types of threads in Node.js, here we go!

Libuv threads

Libuv is the library that provides an event loop, which makes it possible to perform asynchronous operations on a server. At the same time, it enables us to interact with different OS by providing a high-level abstraction.

This library has 3 main parts: thread pool, event loop, and callback queue.

The thread pool has 4 threads allocated by default, and the size can be increased up to 1028 threads. The thread pool is used to run three types of operations:

File system operations
DNS functions (getaddrinfo and getnameinfo)
User-defined code (when using libuv API directly)

One more thread comes from the event loop itself. The library strictly allocates only a single thread to run the event loop.

We can see that only the libuv library alone can create up to 1029 (1028 thread pool + 1 event loop) threads if needed, and were not even counting the one that it might use internally.

When talking about I/O operations in Node.js, the libuv library is the exact component that handles them.

JavaScript engine

The default JS engine shipped with Node.js is V8, but it is a pluggable part that can be switched to JavaScriptCore, SpiderMonkey, or any other.

Despite specific JS engine implementation, it has its own threads. Here is a list of operations that those threads can be allocated for:

Garbage collection
Compilation of JavaScript
System tasks like monitoring and others.

Those are a few examples. The main idea is that the engine can freely create new threads whenever it needs them to function properly and is not limited to a specific number.

Addons and Node-API

The addons and Node-API are basically a bridge that allows you to write custom C++ and C code and plug it into your Node.js application.

You might be using addons or Node-API yourself. If youre not using them directly, some of the dependencies that youre using might do so.

It looks similar to the V8 engine case, where we dont know the exact number of threads for an addon to function properly. Each addon can spawn multiple threads and use multiple resources whenever it needs to.

Worker threads

Last but not least is the worker threads module. Its unique feature is the ability to create new threads directly from JavaScript code. No other components or APIs of Node.js give such power.

Here is an example of how to create a thread using the module:

const { Worker } = require('node:worker_threads');const worker = new Worker('./worker.js');

The Worker class is a synonym for an independent thread in this case. Because of this, it is common to hear that threads created by Worker class are referred to as workers or "worker threads" instead of just threads.

The worker can encapsulate CPU-bound operations in JavaScript without blocking the main thread. As an example, we can refer to the billion iterations loop example that we mentioned before.

Conclusion

While working with Node.js, we'll encounter two primary resource-related tasks: CPU-bound and I/O-bound operations.

Node.js, as a platform, uses many different types of threads to make it work properly all together. Separate components of the platform create as many threads as they need to function properly.

In essence, all of those threads are the same from the perspective of an operating system. However, they have different roles and responsibilities regarding the platform as a whole.

Differences between declarative and imperative programming

Pavel Romanov — Fri, 22 Sep 2023 11:48:35 GMT

There are two main paradigms in programming: imperative and declarative. Every other paradigm, such as reactive, functional, and procedural, is just a subset of one of these two. Often, you hear that declarative code is better than imperative code and that you should give it preference. But why? Is declarative code always better than imperative code? Can it bring more problems than it solves? We'll answer all of these questions next.

The paradigms

First of all, lets look at both paradigms to have a better understanding of how they look and behave. That way, we can evaluate them later and see their good and bad sides.

Imperative programming

Imperative programming is a programming paradigm that focuses on a step-by-step logic execution. The code in this style looks like a set of precise instructions on what needs to be done to get the desired result.

Example of imperative code in JavaScript.

const numbers = [1, 2, 3, 4, 5];function sum(list) {  let result = 0;  for (let i = 0; i <= list.length; i++) {    result += list[i];  }  return result;}

In this code sum function contains step-by-step instructions on how to achieve the exact result:

Create the accumulator variable.
Start a loop and set the stop condition to the length of the list.
Take an element by the index from the list and add it to the accumulator.
Return accumulator.

Each step is carefully guided, and we need to be very precise. If we set the wrong loop condition, it won't work. If we increment the loop variable i incorrectly, it won't work, and so on.

Declarative programming

Declarative programming is a programming paradigm that focuses on the final result rather than a step-by-step logic compared to imperative programming. The code in this paradigm looks like a description of a final result that we want to get instead of a precise path to it.

Example of declarative code in Javascript.

const numbers = [1, 2, 3, 4, 5];function sum(list) {  return list.reduce((sum, currentNumber) => sum + currentNumber, 0);}

Notice the difference from imperative code. We do not manually declare a variable for the accumulator (even though we pass it as a second argument, which eliminates the need for manual declaration), and we do not control the iteration process. Instead, we describe the outcome we are interested in.

Problems of declarative paradigm

With all its glossiness, code written in a declarative way has its drawbacks, which are usually overlooked. This can seriously affect the overall architecture and codebase, making the code less readable and maintainable.

Efficiency

In most cases, declarative code is an abstraction over imperative code. If the abstraction has a poor design, memory leaks, or performance issues, our codebase will suffer from it as well. This is especially true if we are talking about an abstraction that doesn't come with a standard library of a language.

Even if a declarative abstraction doesn't have any design or memory leak problems, it might still lack efficiency compared to the imperative approach. Why? Because there might be no way of writing a more efficient algorithm for this abstraction and, therefore, one may have to sacrifice efficiency for the sake of a good abstraction.

Less control

Usually, if we can abstract some details and make the code more readable, it is a good choice, but not always. If we take the example of the declarative sum function, we can clearly see that there is no way we can somehow affect the iteration process. For example, if we want to stop the iteration of the array when there is a negative number and return the sum of all numbers before this number, we simply cannot do that. Sure, we can create some flag variable outside, which allows us to control the flow of the function, but it is not a declarative way at all.

Overall, the main problem of the declarative code is the abstraction itself.

Is imperative always bad?

One of the main selling points of the declarative paradigm is code readability. However, this doesn't mean that imperative code is unreadable. It all depends on how one writes it. For example, if we have imperative code that filters an array of numbers and sums all the numbers from the array, it can be written in two ways.

const numbers = [1, 2, 3, 4, 5];function performTaksWith(list) {  // code where we filter the numbers   const filteredNumbers = [];  for () {}  // code where we summ numbers  let result = 0;  for () {}}

Every part of the logic is placed in one place, making it difficult to read and modify as needed. However, the readability can be improved if we write it in a slightly different way.

const numbers = [1, 2, 3, 4, 5];function performTaskWith(list) {  const filteredNumbers = filterNumbers(list);  return sum(filteredNumbers);}function filterNumbers(numbers) {  // code where we filter number numbers.}function sum(list) {  // code where we sum the numbers.}

This version is much easier to read and, therefore, maintain. It is still written in an imperative paradigm because we are providing step-by-step instructions on what needs to be done.

Conclusion

There are two main programming paradigms: imperative and declarative. Usually, code written in a declarative paradigm tends to be shorter and more readable. However, it doesn't mean that it is always better. Declarative constructions are abstractions over some imperative code. They might have bad designs or memory problems that will leak into our apps. On the other hand, imperative programs can be written in a clear and readable way, making them easy to maintain and reason about.

Reactive programming patterns

Pavel Romanov — Mon, 11 Sep 2023 13:42:45 GMT

We already know what reactive programming is (if you don't, check out this article). In this article, we will look at some of the most common design patterns used in reactive programming and what common design principle they all have that enables the actual reactivity.

List of the patterns

Here is the list of the patterns that we are going to explore:

Observable/Observer
Event emitter
Signal

These are the most frequently used ones when it comes to reactive programming and shed some light on the whole picture of this world.

Observable and observer

Those two patterns go hand in hand. There is no point in having one without another. The idea behind them is pretty simple. There is a list of clients that want to know about some changes. The Observable can notify them about those changes, but it is required to establish a connection between the Observable and the clients. That's where Observer comes into play. It is basically the contract of the Observable that the client has to fulfill to get notification about the changes.

Here is what a simple observer looks like.

class Observer {  public notify() {}}

That's right, there is only one method and nothing else.

Observable in this regard looks more complex.

class Observable {  private subscribers: Observer[];  public subscribe() {}  public unsubscribe() {}  public notifyObservers() {}}

Whenever some client wants to get updates from a particular Observable, he provides an observer interface. It can be a basic function, object, or an observer class.

observable.subscribe(new Observer(...))observable.subscribe(function() {...})

The important part here is to follow the contractor of the observer that the observable declares. If it only accepts classes, you can't use functions, and vice versa.

Under the hood, the Observable adds the new observer to the list of subscribers.

class Observable {  // other class fields  private subscribers: Observer[];  public subscribe(observer: Observer) {    this.subscribers.push(observer);  }}

Whenever the Observable updates its value, it notifies each Observer from the list of subscribers.

class Observable {  // other class fields  private subscribers: Observer[];  public notifySubscribers(value) {    // iterates over the the array of subscribers    // and calls notify(value) method with the new value  }}

Event emitter

The name itself gives us a hint about this pattern. The primary focus of the pattern is events and their handlers. Here is the structure of a simple event emitter.

class EventEmitter {  private eventsMap: Map<string, EventHandler[]>;  public addEventHandler(event, handler) {};  public removeEventHandler(event, handler) {};  public notify(event) {};}

If someone wants to add a new event/handler to the emitter, he passes an event name alongside the handler that is called whenever this event is fired.

class EventEmitter {  // other class fields  private eventsMap: Map<string, EventHandler[]>;  public addEventHandler(event, handler) {    // 1. check if there is such event in the `eventsMap`    // 2. if there is no such event add it    // 3. add handler for this event  }}

Whenever this particular event is triggered, all handlers are called one by one.

class EventEmitter {  // other class fields  private eventsMap: Map<string, EventHandler[]>;  public notify(event) {    // 1. Check if event registered in the map    // 2. If there is no such event exit    // 3. If there is such event call its handlers  }}

Notice that the notify method can accept not only the event itself but also some payload that event handlers of this particular event require.

Signal

Signal is a bit different from the previous patterns in terms of structure.

class Signal {  private children: Node;  private value: string;  public getValue() {}  public setValue() {}}

Basically, it represents a tree, and all subscribers/children of the signal are stored as child nodes of the root node. Whenever the signal has some value updates, it traverses the tree of children to update them respectively.

Note that changes in a particular signal only affect its children but not the whole tree structure. More precisely, it affects all dependent signals. I won't dive any further in this article in this pattern. Maybe there will be a separate one in the future.

Summary of the patterns

While all of the three patterns look different, they actually have the common principle lying behind all of them: store depending entities and update them whenever some action happens. This general abstraction over those patterns allows us to see the picture behind a particular pattern and switch between them simultaneously.

Conclusion

There are multiple patterns used to build the code in a reactive paradigm, such as Observable/Observer, Event emitter, and Signal. We saw their simple implementations and how they function to achieve reactivity. While they look quite different, all of these patterns have a common ground: they all have to store subscribers and notify/update them whenever a new value comes. This high-level abstraction model allows us to see through all the patterns and quickly navigate through them.

Introduction to Reactive Programming

Pavel Romanov — Sat, 29 Jul 2023 08:11:38 GMT

You've probably heard something about reactive programming before, but do you know what it means? It is one of the topics that bring more confusion than clarity when you try to google it. There are a lot of examples out there, but most of them have a hard time conveying the core idea clearly and simply without any references to a particular framework, library, or platform. We'll explore what it is and how it works. Stay tuned!

Common misconceptions

Before diving into the topic, let's briefly review what reactive programming is not and the difference between it and the topics that it is usually confused with.

Reactive system

The first on the list is the reactive system. The main idea behind the reactive system is architecture. It is a high-level abstraction that enables us to build reliable distributed systems. This concept even has its reactive manifesto where key concepts of such systems are highlighted. If you are interested in this topic Amazon has an extended list of reactive manifesto and it is a good starting point.

Event-driven programming

The two programming paradigms enable us to work with asynchronous logic. Although event-driven programming shares some similarities with reactive programming, they have conceptually different ideas behind them. The main idea of event-driven programming is events and their handlers while reactive programming focuses on asynchronous data streams. Well discover more details and differences between these two in upcoming articles.

Exploring reactive programming

Now, when we know what it isnt we can talk about what it is. The main building block of reactive programming is asynchronous data stream and it is all about their interconnections. To make it easier to understand lets break it down with a help of an example.

Excel sheet

As an example, we will use an Excel sheet and a very simple formula to demonstrate how things work. Imagine that we have a formula for calculating net profit from some sale after taxes. It might look like this.

You can see that formula is pretty simple (for those curious of you the actual formula looks like this: "sell price x (1 - taxes rate)" where 1 is 100% and "taxes rate" is a decimal value). Let's fill it with some values.

We can already spot the asynchronous data stream here! Well, there are at least 3 of them: the "Sell price" cell value, the "Taxes" cell value and the "Net result" cell value. The "Net result" cell value is calculated by taking the values of the other 2 cells and performing calculations based on the formula.

So far, so good. But why are they called async data streams, and what does it even mean? You can see a data stream as a value over time. If we compare it to a simple variable, the variable would represent a value at a specific point in time (whenever we decide to read its value).

The asynchronous part comes from the way we work with these data streams. We are "asking" the value itself to tell us whenever it changes. For those familiar with JavaScript it might sound somewhat similar to promises - and it is! Different patterns describe such behavior which we will dive into later.

Back to our example. The next step is to change the "Sell price" cell value and see what is going to happen.

It is not only changing its value but informing everyone interested, in this case, the "Net result" cell. After receiving a new value from the "Sell price" cell it recalculates its value.

Benefits

There are strong reasons why this approach gaining traction. Here are a few:

Code becomes more granular and less coupled. We can write specific logic related to the "Net result" cell from the example only where it belongs. There is no need to keep logic for all cells in one place.
Switch from imperative to declarative code. In short, the declarative style allows us to write less verbose code that focuses on the result rather than the path to it.

Conclusion

Reactive programming is the paradigm that focuses on asynchronous data streams. Those data streams can be perceived as a value over time compared to simple variables that stand for value at a certain point in time. Such data streams inform interested consumers about new values allowing them to execute specific logic based on it. The paradigm has great benefits in terms of granular and less coupled code as well as imperative style.

Pavel Romanov

The Ultimate Guide to Cron Jobs in Node.js

Terminology

Factors to consider before choosing a task scheduling approach

Application and infrastructure scale

High-frequency tasks

Long-running tasks

Tasks stacking

Tasks scheduling approaches

UNIX-based scheduling with cron

Runtime scheduling

Runtime scheduling with persistence

Cloud-based scheduling solutions

Conclusion

Resource management in Node.js: the good, the bad and the worst

Node.js CLI options

Trade-offs

When to use

PM2 process manager

Trade-offs

When to use

User limits

Trade-offs

When to use

Control groups

Trade-offs

When to use

Containers

Trade-offs

When to use

Conclusion

Node Resource Management

Overview

Managing V8 memory heap size via CLI options

Memory limits with PM2

OS-specific options

User limit

Control group

Isolate and Control Resource Using Containers

The control groups catch

Conclusion

Node.js cluster module. You probably don't need it

What is a cluster?

Cluster structure

Main process

Worker processes

Inter-Process Communication (IPC)

Load-balancing

How does it work all together?

Creating a simple server

Requests handling

Cluster vs traditional pipelines

Scalability rate

Management

Flexibility

Conclusion

Understanding Node.js Threads

Types of resources-related operations

I/O-bound operations

CPU-bound operations

Thread types in Node.js

Libuv threads

JavaScript engine

Addons and Node-API

Worker threads

Conclusion

Differences between declarative and imperative programming

The paradigms

Imperative programming

Declarative programming

Problems of declarative paradigm

Efficiency

Less control

Is imperative always bad?

Conclusion

Reactive programming patterns

List of the patterns

Observable and observer

Event emitter

Signal