Managing multiples projects with Lerna and Yarn Workspaces

7 settembre 2019 · Lettura di 10 minuti

Contributor

Verdaccio is a project with a big ecosystem, composed by several projects, each one with its own configurations and ways to manage. This fact transforms a simple maintenance operation, like updating a common dependency, into a real hard work.

We saw a problem, these configurations entropy made harder to work with all the projects. So, we needed to simplify and unify them to make it easier. We need a monorepo.

A monorepo is a project configuration to manage a collection of dependencies in a simple and unified way. There are many examples out there that Javascript developer use nowadays, like Babel, Create React App or Material UI.

Now, we are proud to announce our monorepo, our big ecosystem joined in only one repository. This article is the first part of a series of articles where we will try to explain our motivation about to set up by your own, improve the management and workflows (CI, code quality, etc).

Background and tools

A year ago, Juan and I met in Madrid, Spain, and were talking about the roadmap for Verdaccio 4, the scope of the projects and more. We had some concerns about the Verdaccio ecosystem we want to build, such amount of repositories requires hard work for maintenance by each one with their own dependencies, scripts, configurations, etc.

We decided to unify all configurations, because handle several repositories would not be realistic and would have all things replicated in all repositories. In order to achieve our goal, and we found Lerna and Yarn Workspaces.

As a first step, we created the basic architecture of the monorepo and moved the first dependencies in, an ESLint config and a Babel preset.

After Verdaccio 4 release, Typescript migration and more, we saw that many projects shares the same architecture, so it could be a good moment to achieve our goals.

There are more tools for this purpose, but we will focus on Lerna and Yarn Workspaces.

Lerna

Lerna is a tool to manage several Javascript projects with multiple packages (called monorepos), optimizing the workflows around them.

You can install it globally with npm install --global lerna, yarn global add lerna or your favourite package manager, to run commands with lerna <command>. Also, if you don't want to install it, you can use package runners such as npx.

Yarn Workspaces

Yarn Workspaces is a way to setup package architecture where all packages dependencies are installed together with a single yarn install.

This involves two things that you could not see at the first moment.

All the packages in the workspace uses a common lockfile as a single source of truth.
If a package has a dependency on other package in the workspace, they are linked without affecting your global environment.

Since Yarn 1.0, this feature is enabled by default, you only need a root package.json to setup them.

Creating the monorepo

The initial setup is really simple, you only need to create a new repository and run lerna init to initialize the monorepo. This will generate files like package.json or lerna.json (packages structure is shown as example, but not generated).

lerna.json
package.json
packages/
├── pkg1
│   └── package.json
├── pkg2
│   └── package.json
└── pkg3
    └── package.json

Let's see the main configuration for lerna.json and root package.json.

lerna.json

After initial setup, this file will looks like:

{
  "packages": ["packages/*"],
  "version": "0.0.0"
}

packages: this array defines the location for all packages that conforms the monorepo. They could be explicit (packages/pkg1) or, if a folder has several packages, you can use * wildcard. In our case, we use core/*, plugins/* and tools/*.
version: the version of the packages. It could be a semver value (called fixed mode) or independent if you want to let packages define their own version. Take care that independent mode will create one tag for each package that will be published with its own version. We prefer fixed mode to keep all packages with the same version and reduce headaches to users.

Other interesting settings are:

npmClient: you can define in you want to use Yarn, Npm or your favourite client.
useWorkspaces: if you want to enable Yarn Workspaces, you will have to tell it to Lerna, setting this option to true.
stream: if you want to have the output of a child process inmediately in the console, you have to enable this option. Also, this will prefix each line with the package name that generate them.

package.json

After initial setup, you need some configuration to allow Lerna work with Yarn Workspaces:

workspaces: this is the array where we define the packages that are part of the workspace. In a simple way, this is the same you have defined in lerna.json under the packages key.
private: as the root package should be a simple container, you should keep it with true value to not publish it.

Later, you will learn how to define more settings in the root package.json.

Creating and importing packages

You have a monorepo, but it's not useful at this moment. Let's create and import some packages.

lerna create

To create new packages, you can use lerna create <package_name> like you would do with npm init or yarn init. The wizard will ask you for some fields like package description, author or license. Other way to give that information is using command options (--description, --author, etc).

Then, you have the package ready for work with it, add stuff like Babel or ESLint, dependencies and scripts to package.json...

It's important to say that if your package is a scoped package, you have to add the next codeblock in package.json, but do not add it if your package is not scoped, because lerna publish will fail in this case:

"publishConfig": {
  "access": "public"
}

lerna import

If you have a project you want to import, you don't have to create a new one in the monorepo, you can import it using lerna import <path_to_project>.

This command will read all the Git history from the project specified and apply commit-by-commit into your monorepo to avoid losing the original history. If there were commits with conflicts, the import process will fail, but CLI propose you to use the --flatten option to bypass it. Also, if you want to keep original authors and committers, you can pass --preserve-commit option.

After the import completes, you can remove unnecessary stuff like CI settings, old scripts, hoisted devDependencies...

Managing dependencies and devDependencies

Each package will contain their own dependencies and devDependencies like if the package is not in a monorepo, but there are some interesting things you can do with devDependencies, hoist them in the root package.json.

Let's see with an example, pkg1 and pkg2 defines ESLint as devDependency, so you have defined that in two packages, duplicating the definition and management. You can extract it from both and set it as devDependency in the root package.json. This way, all the projects that need it will have it available. Now, we are going to add ESLint to pkg3, you only have to add its own configuration, because package has been hoisted right now.

But the same doesn't work with dependencies, because they are needed when the package is published.

A good practice is to hoist every devDependency so they will be available for every package, except in two cases:

A package needs a specific version of the package. In this case, you can have a root definition for all the packages and the specific version for the package that requires it. This will create a node_modules for the specific package, but not another yarn.lock.
Those devDependencies that are part of the workspace must not be defined in the root package.json, because if you do that, you will create a cycle. An example is if pkg1 and pkg2 define pkg3 as devDependency, it could not be defined in the root because pkg3 will depend with itself.

Running scripts

Like dependencies, each package will have their own scripts, so you should define them in their specific package.json.

But what happens when you want to run scripts for many packages at the same time? You don't need to extract them to the root package.json because they will contain specific arguments/options for each project. You can invoke scripts or commands from the root package using two Lerna commands, lerna run and lerna exec.

The first, lerna run <script>, will perform the script provided looking for what packages have it defined in their packages.json. This command is useful when you want to build package or run tests, because not all packages would have them defined.

lerna-run-example

The second, lerna exec <command>, will execute the command (not script) in all the packages. This is useful if you want to run tools like ESLint in all the packages and you have it installed globally. In this case, the command invoked must be in your system PATH (ls, cat, npm binaries, etc).

lerna-exec-example

Both commands shares options like --scope=<packages> and --ignore=<packages>, where the first will run only in packages specified and the last will ignore them.

Versioning the monorepo

As we mentioned in lerna.json section there are two versioning ways for packages in monorepos: fixed mode and independent mode.

We will focus in fixed mode because:

We want to use the same version for all the packages in the ecosystem.
Independent mode creates one git tag for each package and version in each release. With the example before, in a release we will create three tags, pkg1@0.1.0, pkg2@2.1.0, pkg3@0.5.2. In larger codebases you will create more than 10 tags at the same time.

In fixed mode, the version set in the lerna.json is a reference for all the packages but, if a package has no changes between releases, that package will not be published except when the version bump is major (from X.Y.Z to X+1.0.0).

Because Lerna is going to manage the versioning, you should change your mind to use lerna version and lerna publish commands.

lerna version will update version for the packages that has changes from the release (you can review with lerna changed). This will launch a wizard except if you pass the version or a semver keyword (major, minor, patch, etc) as the first argument or use Conventional Commits (see below). This command updates version in all affected packages, commit changes, create tags and pushes to the remote automatically.
Also, you can use --conventional-commits option if your commits follows Conventional Commits spec, automating the changelog generation. Additionally, for Github and Gitlab users, you can use --create-release <github | gitlab> to create release with changes. Keep in mind that you have to provide an auth token (GH_TOKEN or GL_TOKEN).
lerna publish will act as lerna version and publish packages if you don't provide from-git or from-package arguments, or only publish if provide one. You would like to use from-git to version first, using Git as a single source of truth.

Conclusion and thanks

Lerna and Yarn Workspaces are a great combination for creating monorepos. In this first part, you have learned how to setup a monorepo, add packages, improve dependency management, scripts and versioning. In the next chapters, you will see more configuration and tooling (and their settings for monorepos), and how to automate some things using Continuous Integration tools.

We have to thank the teams and community behind both projects, specially Henry Zhu and Daniel Stockman for Lerna, and Maël Nison for Yarn. Also, but not less important, we want to thank all the people that makes Verdaccio possible, contributing, donating, documenting, and more.

If you 😍 Verdaccio as we do, helps us to grow more donating to the project via OpenCollective.

Thanks for support Verdaccio ! 👏👏👏👏.

Background and tools​

Lerna​

Yarn Workspaces​

Creating the monorepo​

lerna.json​

package.json​

Creating and importing packages​

lerna create​

lerna import​

Managing dependencies and devDependencies​

Running scripts​

Versioning the monorepo​

Conclusion and thanks​