Skip to content

Instantly share code, notes, and snippets.

@SKempin
Last active December 28, 2024 06:42
Show Gist options
  • Save SKempin/b7857a6ff6bddb05717cc17a44091202 to your computer and use it in GitHub Desktop.
Save SKempin/b7857a6ff6bddb05717cc17a44091202 to your computer and use it in GitHub Desktop.
Git Subtree basics

Git Subtree Basics

If you hate git submodule, then you may want to give git subtree a try.

Background

When you want to use a subtree, you add the subtree to an existing repository where the subtree is a reference to another repository url and branch/tag. This add command adds all the code and files into the main repository locally; it's not just a reference to a remote repo.

When you stage and commit files for the main repo, it will add all of the remote files in the same operation. The subtree checkout will pull all the files in one pass, so there is no need to try and connect to another repo to get the portion of subtree files, because they were already included in the main repo.

Adding a subtree

Let's say you already have a git repository with at least one commit. You can add another repository into this respository like this:

  1. Specify you want to add a subtree
  2. Specify the prefix local directory into which you want to pull the subtree
  3. Specify the remote repository URL [of the subtree being pulled in]
  4. Specify the remote branch [of the subtree being pulled in]
  5. Specify you want to squash all the remote repository's [the subtree's] logs

git subtree add --prefix {local directory being pulled into} {remote repo URL} {remote branch} --squash

For example:

git subtree add --prefix subtreeDirectory https://github.com/newfivefour/vimrc.git master --squash

This will clone https://github.com/newfivefour/vimrc.git into the directory subtreeDirectory.


Pull in new subtree commits

If you want to pull in any new commits to the subtree from the remote, issue the same command as above, replacing add for pull:

git subtree pull --prefix subtreeDirectory https://github.com/newfivefour/vimrc.git master --squash


Updating / Pushing to the subtree remote repository

If you make a change to anything in subtreeDirectory the commit will be stored in the host repository and its logs. That is the biggest change from submodules.

If you now want to update the subtree remote repository with that commit, you must run the same command, excluding --squash and replacing pull for push.

git subtree push --prefix subtreeDirectory https://github.com/newfivefour/vimrc.git master


Subtree issues

  • It isn't readily apparent that part of the main repo is built from a subtree
  • You can't easily list the subtrees in your project
  • You can't, at least easily, list the remote repositories of the subtrees
  • The logs are slightly confusing when you update the host repository with subtree commits, then push the subtree to its host, and then pull the subtree.

Other than that, they're looking nicer than submodules.

Amended from original articles:

  1. https://newfivefour.com/git-subtree-basics.html
  2. https://docs.acquia.com/articles/using-git-subtrees-instead-git-submodules
@sankartn
Copy link

@DougLeonard Thanks a lot for guiding with a very clear explanation. Thumbs up!

@tmillr
Copy link

tmillr commented Jul 31, 2022

Thanks for this. I'm just wondering how this is any better than submodules when it has several issues and isn't even apart of core git? It's also implemented with hacky solutions and external changes to the subrepo are not tracked by git fetch nor git status? It seems to me like the only benefit is one less option that you have to pass when doing git clone and seems to offer little convenience over simply doing what subtree does, but manually (something like git pull --commit -s subtree remote refspec). I've also heard that rebases can become more work/more confusing with subtrees involved. At least with submodules git status will notify you if your submodule is behind yeah? Am I missing something?

@KuangJie7
Copy link

Is there a way to extract folders or files in one git repo(mono-repo) as subtree/subtrees of another git repo?

For example, mono-repo A:

mono-repo A
  - .git
  - packages
    - module1
    - module2
    - module3

I want to define A/packages/module1 and A/packages/module3 as subtrees of repo B so that I can track updates from repo A.
In this case, repo B:

repo B
  - .git
  - sub-modules -> attached with repo A
    - module1
    - module3

Then I can take use of source code from repo A in repo B. Any idea to achieve this?

@chris-hatton
Copy link

chris-hatton commented Dec 4, 2023

I've been using submodules for years and failing to understand the hate; yes they add a bit of maintenance overhead, but it's easy to see how they work and you get the benefit of a single source-of-truth for each component.
Now that I need to widen out the usage to a larger team I thought I would give subtrees a spin; but the experience is awful I can't imagine why these are billed as friendlier than submodules. The pull/push just doesn't work as advertised; I had git telling me the tip was behind HEAD and that's why I couldn't push, even though it wasn't, I ended up having to delete and recreate the subtree several times, and on one occasion the entire history of the consuming repo got pushed into the subtree repo 🤦 I'm not an inexperienced Git user, and no doubt I'm 'holding it wrong' somehow - but friendlier than submodules it is not...
Backing away hard and thinking how to most gently intro submodules to the team instead.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment