项目作者: hchauvin

项目描述 :
Monorepo administration
高级语言: Python
项目地址: git://github.com/hchauvin/monorepo-tools.git
创建时间: 2019-09-13T13:25:05Z
项目社区:https://github.com/hchauvin/monorepo-tools

开源协议:MIT License

下载


monorepo-tools: Monorepo administration

CircleCI

monorepo-tools aims at offering a collection of tools to administrate a
monorepo. Monorepos have
many advantages for closed-source systems
as compared to separate repos, and are a sound evolution or starting point for projects
in need of large-scale code refactoring, collaboration, and ease of code
reuse. A monorepo correctly set up can diminish friction both for
fledging startups, and for companies maintaining, evolving or migrating
legacy projects: they can be introduced at all stages of a product’s lifecycle.

The tools can be either consumed using a Command-Line Interface (CLI),
or programmatically. They are written in Python, packaged in a runnable ZIP
file, and compatible with Python 2.7 and Python 3.7. Prepackaged runnable
ZIP files are available
on the release page.
Tests are continuously run on Windows, Linux, and Mac OSX.

Right now, monorepo-tools only offers one subcommand, import, but other
commands will follow. The scope will be vendoring, open sourcing part of
a monorepo with an OSS-monorepo sync, and related topics. We plan on
open-sourcing separately some work on continuous integration and
deployment pipelines for monorepos, as CI/CD is out-of-scope for this project.
Currently only Git is supported as a Version Control System and no plan
is made to extend support to other VCS such as Mercurial.

Installation

For CLI use, please go to the release page
and download the appropriate ZIP bundle for your platform. For Windows,
please make sure that you have Python 2 or Python 3 installed. On Windows, we
recommend installing Python using Chocolatey (respectively,
with choco install python2 and choco install python). Usage can be queried with:

  1. python monorepo_tools.zip --help

For programmatic access, use bazel and import
this project in your workspace.

monorepo_tools-import

  1. usage: monorepo_tools import [-h] --individual_repos INDIVIDUAL_REPOS
  2. --dest_branch DEST_BRANCH --monorepo_path
  3. MONOREPO_PATH
  4. Import individual repos into a monorepo
  5. optional arguments:
  6. -h, --help show this help message and exit
  7. --individual_repos INDIVIDUAL_REPOS
  8. Path to python module that exports one function,
  9. individual_repos, that takes the destination branch
  10. name as an argument
  11. --dest_branch DEST_BRANCH
  12. The destination branch to import into
  13. --monorepo_path MONOREPO_PATH
  14. The local path to the monorepo (it is created if it
  15. does not exist)

Note that incremental update of an existing monorepo is supported, just
set --monorepo_path to a clone.

See ./import_into/individual_repos.py for an example for --individual_repos.

The strategy for import is “merge unrelated history then move”: for each
individual repo, we create in the monorepo a branch that is the result
of pulling the unrelated history from the requested branch in the individual
repo. This history is directly taken from the individual repo, without
any transformation, meaning that the commit SHA1 are the same, which helps
for traceability and auditing. Additionally, because there is no
transformation, the import is faster than other strategies (see below).
The files in this branch are moved to the appropriate subdirectory of the
monorepo (and these moves are committed), then this branch is merged into
the destination monorepo branch. This way, import introduces two
additional commits per individual repo and destination branch: a move,
and a merge. Additionally, import provides the first commit in the
monorepo branch (with the message “Initial monorepo commit”), onto which
the individual repos are grafted. With this strategy, commit history is best
viewed in date order, not ancestor order.

Alternatives

While researching import, other strategies and tools were looked at. We
specifically wanted a tool that would allow the complete import of histories,
and autonomy of the monorepo from the separate repos. Therefore, Git
submodules
and git-subrepo were taken
out of the picture, as they work by maintaining references to the separate repos.

Next, Copybara was
considered. However, its iterative filtering strategy is a huge
performance issue for large separate repos, and it was quickly abandoned,
as a full migration of the repos we were considering would take Copybara
many days to perform.

git-stitch-repo
was also considered. It nicely uses git-fast-import and git-fast-export
to combine linear histories into one linear history, which could be cleaner
than our “merge unrelated history then move” (as it comes with merge “nonlinearities”).
However, we found out that git-stitch-repo gave wrong results for nonlinear histories, as
the commits were sometimes not correctly stitched. The project, written in Perl, had not
been maintained for years. We also decided that Git history
rewriting was too difficult to get right for the mixed benefits
of enforcing a linear Git history. That’s why we went back to the
very simple strategy than ended up being import and didn’t try to patch
git-stitch-repo instead.

License

monorepo-tools is licensed under The MIT License.

FOSSA Status