I was trying to reproduce the few tutorial steps from:
https://github.blog/2020-01-17-bring-your-monorepo-down-to-size-with-sparse-checkout
Which was made fo
I believe I found the reason for this. Commit f56f31af0301 to Git changed the implementation of sparse-checkout so that, when you have an uninitialized working tree (as you would right after running git clone --no-checkout), running git sparse-checkout init will not check out any files into your working tree. In previous versions, the command would actually check out files, which could have unexpected effects given that you wouldn't have an active branch at that point.
The relevant commit, f56f31af0301 was included in Git 2.27, but not in 2.25. That accounts for why the behavior you see is not the behavior shown on the web page you're trying to follow. Basically, the behavior on the web page was a bug that nobody realized was a bug at the time, but with Git 2.27, it has been fixed.
This is explained very well, I think, in the message for commit b5bfc08a972d:
So...that brings us to the special case: a git clone performed with
--no-checkout. As per the meaning of the flag,--no-checkoutdoes not check out any branch, with the implication that you aren't on one and need to switch to one after the clone. Implementationally,HEADis still set (so in some sense you are partially on a branch), but
- the index is "unborn" (non-existent)
- there are no files in the working tree (other than
.git/)- the next time
git switch(orgit checkout) is run it will run unpack_trees withinitial_checkoutflag set to true.It is not until you run, e.g.
git switch <somebranch>that the index will be written and files in the working tree populated.With this special
--no-checkoutcase, the traditionalread-tree -mu HEADbehavior would have done the equivalent of acting likecheckout-- switch to the default branch (HEAD), write out an index that matchesHEAD, and update the working tree to match. This special case slipped through the avoid-making-changes checks in the originalsparse-checkoutcommand and thus continued there.After
update_sparsity()was introduced and used (see commit f56f31a ("sparse-checkout: use new update_sparsity() function", 2020-03-27)), the behavior for the--no-checkoutcase changed: Due to git's auto-vivification of an empty in-memory index (seedo_read_index()and note thatmust_existis false), and due tosparse-checkout'supdate_working_directory()code to always write out the index after it was done, we got a new bug. That made it so thatsparse-checkoutwould switch the repository from a clone with an "unborn" index (i.e. still needing aninitial_checkout), to one that had a recorded index with no entries. Thus, instead of all the files appearing deleted ingit statusbeing known to git as a special artifact of not yet being on a branch, our recording of an empty index made it suddenly look to git as though it was definitely on a branch with ALL files staged for deletion! A subsequent checkout or switch then had to contend with the fact that it wasn't on aninitial_checkoutbut had a bunch of staged deletions.
Here is a solution that will populate only files in the root folder:
$ git clone --filter=blob:none --sparse https://github.com/derrickstolee/sparse-checkout-example
Then subsequent sparse-checkout calls work like a charm.
Still no idea why the tutorial is broken.