- Add a filter to git config by running the following command in bash inside the repo:
git config filter.strip-notebook-output.clean 'jupyter nbconvert --ClearOutputPreprocessor.enabled=True --to=notebook --stdin --stdout --log-level=ERROR'
-
Create a
.gitattributes
file inside the directory with the notebooks -
Add the following to that file:
*.ipynb filter=strip-notebook-output
After that, commit to git as usual. The notebook output will be stripped out in git commits, but it will remain unchanged locally.
This is useful if you sometimes want to add specific notebooks with their cell outputs intact to git, while still having the default behavior of clearing out cells.
- When adding to git a notebook whose cell outputs you want to keep, instead of the usual
git add <path to your notebook>
command, use this:git -c filter.strip-notebook-output.clean= add <path to your notebook>
Wow, this worked a treat!
I tend to run Jupyter in a Docker container. So, if you want to run the Notebook output cleaning preprocessor using a Jupyter container instead, all you need to do is tweak the git config like this:
I've set it as a global config parameter, so that it applies to all repos.