So here\'s an interesting situation when using git and python, and I\'m sure it happens for other situations as well.
Let\'s say I make a git repo with a folder /foo
My solution is more compatible with git: Git removes only enpty directories where any file has been deleted by checkout. It doesn't search the complete workcopy tree. That is useful for big repositories or repositories with a very big ignored tree, like virtual environments by tox package for testing many different with Python versions etc.
My first implementation explains the principle very clearly: Only pyc files related to files under version control are cleaned. It's for reasons of efficiency and unwanted side effects.
#!/bin/bash
# A hook that removes orphan "*.pyc" files for "*.py" beeing deleted.
# It doesn not clean anything e.g. for .py files deleted manually.
oldrev="$1"
newrev="$2"
# ignored param: branchcheckout="$3"
for x in $(git diff --name-only --diff-filter=DR $oldrev..$newrev | grep "\.py$")
do
if test -a ${x}c && ! test -a ${x}; then
rm ${x}c
fi
done
The post-checkout hook receive the three useful parameters that allow to get known exactly which files have been deleted by the git checkout, without searching the complete tree.
After reading the question I rewrote my hook code to Python and extended it according to your requirements about empty directories.
My complete short source code (Python) is in
https://gist.github.com/hynekcer/476a593a3fc584278b87#file-post-checkout-py
The doc string:
"""
A hook to git that removes orphan files "*.pyc" and "*.pyo" for "*.py"
beeing deleted or renamed by git checkout. It also removes their empty parent
directories.
Nothing is cleaned for .py files deleted manually or by "git rm" etc.
Place it to "my_local_repository/.git/hooks/post-checkout" and make it executable
"""
The problem with *.pyc files is not important for Python 3, because *.pyc files in __pycache__ can not be executed without the related *.py* file in its parent directory.
No change directory is necessary, because hooks are started everytimes in the root of the repository.
__pycache__ are cleaned completely, because they are never important (don't take part in any binary distribution) and also for high efficiency because deleting by parts __pycache__/some_name.*.pyc could be slow.