Advanced Git Tutorial — Print View

1

Branches, HEAD, and Detached HEAD

🎯 You will learn to

Explain why branch creation is O(1) — no files get copied.
Tell attached from detached HEAD by reading .git/HEAD.
Anticipate where orphaned commits come from, setting up the reflog rescue.

📚 The 15-step arc (open once, then close)

Phase	Steps	What you build
Foundations	1–3	Mental model: branches are pointers; commits are immutable hashed snapshots
Daily tools	4–7	Stash, cherry-pick, blame, bisect — used weekly on real teams
History rewriting	8–11	Rebase, interactive rebase, squash-merge, revert — when to use each
Submodules	12–14	Nested repos, the gitlink, six-step publish ceremony
Capstone	15	Compose 5+ tools under pressure with no hand-holding

Steps 1–3 are foundational — every later step refers back. After Step 7, take a break before Step 8 (spacing helps consolidation).

Why this tutorial exists

You already know init, add, commit, branch, merge, remotes. This tutorial lifts the hood — object database, refs, HEAD — so every “scary” command becomes a safe, predictable pointer move.

Two antipatterns to retire on sight:

Antipattern	What it looks like
Blind-testing	Typing random `add/commit/push/pull` permutations until errors stop
Burning down the repo	Deleting the folder, copying files out, re-cloning, force-pushing

Both come from an inaccurate mental model. Each step fixes one piece.

Prerequisite self-check

Answer from memory. Any shaky? Revisit the basic tutorial.

New file is red in git status. State name? Command to green?
After a commit + one more edit, what does bare git diff compare?
main and feature have diverged. Can merge feature fast-forward?
Teammate pushed a buggy commit to shared main. reset --hard + force-push, or revert?
Staged a .env with secrets. Does adding to .gitignore now help?

Expected answers

Untracked → git add stages it.
Working tree vs. index. Index matches HEAD (nothing staged), so you see unstaged edits.
No — diverged branches need a merge commit with two parents.
git revert. Additive; doesn’t break teammates’ clones.
No. .gitignore only blocks future tracking. Use git rm --cached + rotate the secret.

Task 1: Prove a branch is a 41-byte pointer

Predict first: what’s in .git/refs/heads/main? A commit list? A snapshot?

cd /tutorial/myproject
cat .git/refs/heads/main
cat .git/refs/heads/feature-divide
cat .git/HEAD

Each branch file is one line — a commit SHA. HEAD is ref: refs/heads/main — a pointer to a pointer.

@startuml
branch main:
  A "Initial commit"
  B "Add add function"
head main
@enduml

That indirection lets commit advance the branch pointer while HEAD auto-follows — no HEAD rewrite needed.

Task 2: Detach HEAD and feel the difference

git switch --detach HEAD~1
cat .git/HEAD        # now a raw 40-char SHA, not a ref

Detached HEAD = HEAD pinned to a commit, not a branch. Watch the graph: HEAD floats on the commit node itself.

Museum-archive analogy. You can read any document, but notes left without a label have nowhere to go when you leave. git switch -c <name> is that label.

Any commit you make here is anchored to nothing. git switching away orphans it. The next step shows how to rescue orphans.

Cleanup

git switch main

✍️ Before moving on (30-second self-test)

Without scrolling up, answer:

How many bytes is a branch?
What’s the physical difference between attached and detached HEAD?

Got both? You’ve internalized the schema this whole tutorial rests on.

Branch Internals & Detached HEAD — Knowledge Check

Min. score: 80%

1. What is a Git branch, physically speaking?

A full copy of all project files stored separately
This answer treats a branch like a folder copy — branch creation never duplicates files; only one ref file is written.
A 41-byte text file in .git/refs/heads/ containing one commit SHA
A separate network connection to a remote server
This answer confuses a branch with a remote; branches are local refs in .git/refs/heads/, not network connections.
A compressed archive of your working directory
This answer treats a branch like a backup archive rather than a single-line pointer file.

A branch is just a tiny file holding a 40-character SHA. That is why git switch -c new-branch is instantaneous — Git does not copy files, it writes one line of text.

2. Which statements about Detached HEAD state are true? (Select all that apply) (select all that apply)

HEAD points directly to a commit hash instead of a branch name
Any commit you make in detached HEAD is automatically saved to the nearest branch
This answer assumes Git auto-anchors detached-HEAD commits to a branch — it doesn’t, which is the whole reason orphans happen.
Commits made here are orphaned unless you anchor them with a branch before leaving
.git/HEAD contains a 40-char SHA rather than a line like ref: refs/heads/main

Detached HEAD stores a raw SHA in .git/HEAD. No branch tracks commits made here. Before leaving, create a branch (git switch -c rescue) — otherwise the commits become orphaned and the next step’s reflog is your recovery path.

3. Why can HEAD point to a branch name rather than a commit SHA?

Branch names are shorter and easier to type
This answer mistakes ergonomics for mechanism — the indirection exists so commits can update the branch ref without rewriting HEAD.
Symbolic refs let git commit update the branch pointer and HEAD automatically stays current through indirection
Git requires it for compatibility with GitHub
This answer assumes a remote constraint drives a local design decision; symbolic refs predate any hosting service.
It prevents you from committing on the wrong branch
This answer treats indirection as a safety feature; it’s actually a mechanism for cheap commits, not a guard rail.

The pointer chain is HEAD → refs/heads/branch → commit. A commit only needs to rewrite the branch file — HEAD dereferences through it. This indirection is the engineering reason branches are cheap.

4. You want to inspect a commit from last week without risking any accidental edits. Which is the safest approach?

git switch --detach <old-sha> — look around; return to main with git switch main
git reset --hard <old-sha> — rewinds your current branch to that point
This answer confuses inspection with rewinding — reset --hard mutates the current branch’s pointer, the opposite of read-only inspection.
git checkout <old-sha> . — overwrite files in working directory
This answer overwrites working-tree files without moving HEAD; not destructive to history but stomps your editor state.
rm -rf .git && git clone <old-sha> — start fresh
This answer is the “burning down the repo” antipattern — far more destructive than the read-only detached HEAD it replaces.

git switch --detach enters read-only-feeling detached HEAD at any commit. You can look around freely; git switch main returns you unchanged. git reset --hard would rewrite your current branch — destructive. git checkout <sha> . overwrites files without moving HEAD.

5. Put in order the steps to safely inspect an old commit and return to normal operation. (arrange in order)

Correct order:

git log --oneline main # find the old commit SHA
git switch --detach # enter detached HEAD at it
cat calculator.py # look around, read freely
git switch main # return; no changes made

Distractors (not used):

git reset --hard # DANGER: rewinds main
git checkout . # DANGER: overwrites working dir
git branch -f main # DANGER: moves branch pointer

Detached HEAD is the safe inspection mode — HEAD anchored to the commit, no branch pointer moved. The distractors all modify state (main’s pointer, working directory), which is the opposite of inspection.

2

Rescuing Lost Work with git reflog

🎯 You will learn to

Recover commits lost to bad rebases, hard resets, and detached-HEAD orphans.
Tell what git log --all can see from what git reflog can see.
Know reflog’s limits — it’s local, and disappears with the clone.

🤔 Predict first

You make an experimental commit in detached HEAD, then git switch main away without creating a branch. Can git log --all find that commit? Can anything?

`log --all` vs `reflog` — the load-bearing distinction

	`git log --all`	`git reflog`
Walks	Commits reachable from refs	Every position HEAD occupied
Sees orphans?	No (unreachable = invisible)	Yes (reachability irrelevant)
Shared across clones?	Yes	No — local only

Task 1: Deliberately lose work

cd /tutorial/myproject
git switch --detach HEAD
echo "# experimental note" >> calculator.py
git add calculator.py && git commit -m "Experimental: add note in detached HEAD"
git switch main
git log --all --oneline      # the Experimental commit is GONE from this view

It’s orphaned — no ref reaches it, so log --all walks right past.

Task 2: Find the orphan

git reflog

Each line: <sha> HEAD@{n}: <action>: <description>.

Expression	Meaning
`HEAD@{0}`	where HEAD is now
`HEAD@{1}`	where HEAD was one move ago
`HEAD@{n}`	n moves ago

The detached-HEAD commit is at HEAD@{1}.

Task 3: Anchor it with a branch

git branch rescued-work HEAD@{1}
git log rescued-work --oneline

The universal recipe: git reflog → note the SHA or HEAD@{n} → git branch <name> <sha> anchors it as reachable. Works for dropped commits after interactive rebase, botched resets, failed rebases — any “lost” commit that’s still in .git/objects.

git reflog — Knowledge Check

Min. score: 80%

1. You made three commits in detached HEAD state, then ran git switch main without creating a branch. A teammate asks if the commits are lost. What do you tell them?

Yes, they are permanently gone — detached HEAD commits are deleted by git switch
This answer assumes git switch deletes data — it only moves HEAD; the commit objects stay in .git/objects/ until git gc decides to prune them.
They are currently orphaned but still in the object database; git reflog can find their SHA, and git branch rescue <sha> makes them reachable again
Git automatically migrated them to the main branch when we switched
This answer assumes Git auto-attaches detached commits to a branch on switch — it doesn’t, which is exactly why we need the rescue.
They are stored in a hidden recovery folder we can copy from
This answer invents a recovery folder that doesn’t exist; orphans live in the same .git/objects/ as every other commit.

Orphaned commits remain in .git/objects/ until git gc prunes them. git reflog shows every position HEAD has been at, including the orphaned one. git branch rescue <sha> rescues the work.

2. In one sentence, why can git reflog show commits that git log --all cannot?

git log --all walks only reachable commits from refs (branches, tags); an orphan is reachable from no ref. git reflog logs every HEAD position regardless of reachability, so it remembers SHAs even after all references to them are gone.
git reflog is a newer, more powerful version of git log
This answer treats reflog as “log v2” — they’re different tools answering different questions (graph traversal vs. HEAD-position diary).
git log --all is broken by default
This answer assumes a default tool is broken; both work correctly, they differ only in what they walk.
git reflog reads from the remote server; git log only from local
This answer confuses reflog with remote history; reflog is purely local, while log --all walks any ref including remote-tracking ones.

This is the load-bearing distinction. git log --all is a graph traversal starting at refs; an unreachable commit is invisible to it. git reflog is a literal diary of HEAD positions — reachability is irrelevant. Internalize this or later destructive commands will feel unpredictable.

3. What does HEAD@{2} mean?

The second commit on the current branch
This answer confuses HEAD@{2} (a reflog position) with HEAD~2 (a parent walk along commit history).
Where HEAD was two movements ago in the reflog — usable anywhere Git expects a commit ref
The second parent of HEAD (like HEAD^2)
This answer confuses HEAD@{2} with HEAD^2 — three similar-looking suffixes (~, ^, @{}) all mean different things.
A tag named HEAD@{2}
This answer treats @{n} as part of a name rather than reflog syntax.

HEAD@{n} is reflog syntax — n movements back in the HEAD-position log. Different from HEAD~n (n commits back along first-parent chain) and HEAD^n (nth parent of HEAD). Three similar-looking but semantically different suffixes — get them wrong and you will end up at a different commit than you intended.

4. Reflog is local only. Which of these destroys your reflog and the rescue path with it? (Select all that apply) (select all that apply)

rm -rf .git (or deleting the whole clone directory)
A fresh git clone of the same repo from the remote
git push --force to a remote
This answer confuses remote history with local reflog; force-pushing rewrites the remote branch but leaves your local .git/logs/ (and its rescue path) untouched.
Default reflog expiry once the configured retention window passes without activity

Reflog lives in .git/logs/. Destroying .git/ (option 1) takes the reflog with it. A fresh clone (option 2) starts with an empty reflog of that clone. Expiry (option 4) is configurable via gc.reflogExpire. A force-push (option 3) rewrites the remote’s branch but doesn’t touch your local reflog — your local rescue path is still intact.

5. Put these steps in the correct order to rescue an orphaned commit you just made in detached HEAD. (arrange in order)

Correct order:

git switch main # leave detached HEAD
git reflog # find the SHA of the orphaned commit
git branch rescued-work HEAD@{1} # anchor it with a branch
git log rescued-work --oneline # verify it is reachable again

Distractors (not used):

git log --all --oneline # would NOT show the orphan
git checkout # needs the SHA you just lost
git push origin rescued-work

The canonical rescue recipe. Distractor 1 would fail silently — log --all cannot see orphans. Distractor 2 needs the SHA, which was lost when the terminal scrolled. Distractor 3 is a separate sharing concern — irrelevant to rescue.

6. [Revisit Step 3 — preview] In the next step you will see that commits are content-addressable and immutable. Given that, what does git branch rescue <orphan-sha> actually do to the commit object?

It copies the commit into a new location
This answer treats git branch as a copy operation; it only writes a 41-byte ref file — the commit object itself never moves.
It does nothing to the commit object — it only writes a 41-byte file (.git/refs/heads/rescue) containing the SHA. The commit itself never moves; it just becomes reachable again via the new ref
It rewrites the commit’s SHA so it becomes reachable
This answer assumes Git mutates commits to make them reachable; SHAs are content-addressable and immutable, so any rewrite would be a different commit.
It restores the commit from a backup
This answer invents a backup mechanism; the commit was never deleted, only unreferenced.

Same mechanic you learned in the previous step. Creating a branch is one fwrite() of 41 bytes. Rescue doesn’t move commits; it makes them reachable. This is why rescue is instantaneous regardless of commit size — a concept the next step formalizes as Git’s object model.

3

Relative Commit Addresses & Git's Object Database

🎯 You will learn to

Name any commit without a SHA using HEAD~n, BRANCH^, and rev-parse.
Prove Git’s history model is snapshot-based — commits point to trees that point to blobs holding full file bytes — by hashing content directly.
Predict that a single trailing space changes the entire SHA chain — and say why that matters for blame later.

🚪 This is the threshold step

Step 3 is the conceptual hinge of the whole tutorial. Every later step (rebase, cherry-pick, bisect, submodules) becomes obvious or mysterious depending on whether the object model clicks here.

If it doesn’t click on the first read, that’s expected — threshold concepts (Meyer & Land) are transformative (they reframe the whole domain) and troublesome (they resist quick mastery). Re-read, re-run the hashing experiment, sleep on it. Most learners need two passes. The recall prompt at the bottom is your self-check.

Relative references

Expression	Meaning
`HEAD~n`	n commits back along first-parent chain
`BRANCH^`	shorthand for `BRANCH~1`
`BRANCH^2`	second parent of a merge commit

@startuml
branch main:
  A "Oldest commit"
  B "main~2"
  C "HEAD~1"
  D "HEAD / main"
head main
@enduml

Task 1: Practice

cd /tutorial/myproject
git rev-parse HEAD        # current SHA
git rev-parse HEAD~1      # parent
git rev-parse main        # same as HEAD

Task 2: Prove content-addressability

Every object in .git/objects/ is addressed by the SHA-1 of its content. Three object kinds:

Object	Stores
blob	Raw file bytes (no filename)
tree	Directory: filename → blob/tree SHA
commit	Tree SHA + parent SHAs + author + message

Hash the same bytes in two unrelated repos:

echo "hello world" | git hash-object --stdin
cd /tmp && git init -q bob-repo && cd bob-repo
echo "hello world" | git hash-object --stdin
cd /tutorial/myproject

Identical 40-char SHA. Same bytes → same hash, always, everywhere. That’s why Git deduplicates across branches and history for free.

Task 3: Byte-exact means byte-exact

Predict: hashing "hello world " with one trailing space — same SHA?

printf 'hello world \n' | git hash-object --stdin

Different. One whitespace byte → new blob SHA → new tree SHA → new commit SHA. That’s why reformatter commits (Step 6) mask real authorship: every whitespace tweak rewrites the entire hash chain.

✍️ Before moving on (the unifying invariant)

Close this and answer from memory:

“What’s the one property of existing commit objects that lets every later step in this tutorial work?”

The invariant (peek only after attempting)

Existing commit objects are immutable. Git changes history by creating new objects and/or moving references — never by editing old commits in place.

Every Git command falls into one of these operation categories:

Operation type	Examples	What changes
Create immutable objects	`hash-object`, `commit`, `stash`, `cherry-pick`, `revert`	New blob / tree / commit objects
Move refs	`branch`, `reset`, fast-forward `merge`, finalizing a successful `rebase`	Branch / ref points to a different commit
Update index	`add`, conflicted-resolution staging, `merge --squash`	Staging area changes
Update working tree	`switch`, `restore`, `checkout`, `stash pop`, `submodule update`	Files on disk change
Transfer objects/refs	`fetch`, `push`, `pull`	Local/remote object/ref sets change

Most everyday commands combine categories (e.g., commit creates a commit object and moves a branch ref and clears the index). The point isn’t that operations are pure — it’s that no operation rewrites existing commit objects. Whenever a later step feels confusing, ask: what objects is this creating? what refs is it moving? what’s still in .git/objects that I could recover?

Relative Addresses & Object Database — Knowledge Check

Min. score: 80%

1. You want the commit two before main. Which reference is correct?

main^2
This answer confuses ^2 (the second parent of a merge) with ~2 (two commits back along the first-parent chain).
main~2
main-2
This answer invents a syntax — Git uses ~ and ^, never - for relative addressing.
main..2
This answer confuses .. (a range expression) with single-commit addressing.

main~2 walks back 2 commits along the first-parent chain. main^2 means the second parent of a merge commit — completely different. main-2 and main..2 are not valid syntax.

2. What does git hash-object do?

Uploads the file to GitHub
This answer confuses a local plumbing command with a network operation; hash-object never touches a remote.
Computes the SHA-1 Git would assign to the content — letting you prove that identical content always produces the same hash
Encrypts the file with a cryptographic hash
This answer confuses cryptographic hashing (one-way fingerprint) with encryption (reversible).
Compresses the file for storage
This answer confuses content hashing (identity) with compression (size reduction).

git hash-object is the low-level plumbing that every commit uses internally. Because identical content always yields the same SHA, Git deduplicates identical files across the entire history for free.

3. Which statements about Git objects are true? (Select all that apply) (select all that apply)

A commit object stores a reference to a tree, not the files themselves
A blob stores file content without the filename
Trees map filenames to blob or sub-tree SHAs
Git stores file differences (deltas) between commits, not full snapshots
This answer confuses Git’s snapshot-based history model with its later pack-file storage layer; delta compression is a storage optimization, not the model.

Git’s history model is snapshot-based: each commit points to a tree, which points to blobs holding full file content. Storage may later be packed and delta-compressed (git gc produces pack files using delta encoding) without changing the model — the abstraction commits expose is always whole snapshots. Filenames live in tree objects, not in blobs, so two files with identical content share one blob.

4. [Revisit Step 1] You are in detached HEAD at a commit that is 4 back from main. Which command prints that SHA without copying it from git log?

git rev-parse main~4
git log main --limit 4
This answer invents a flag — git log uses -n or --max-count, not --limit.
git checkout main~4
This answer moves HEAD instead of just printing the SHA; the prompt asked for the SHA without checkout.
git show main~4
This answer prints the full commit diff; git show is overkill when you only need the resolved SHA.

git rev-parse is the universal ‘ref → SHA’ translator. It accepts relative (main~4), symbolic (main), short (a73f), or branch/tag references. git show displays the full commit diff, not just the SHA. git log --limit is not valid syntax.

5. Why does git branch feature complete in milliseconds even on a 10-GB repo?

Git compresses the whole working directory in parallel
This answer assumes branch creation involves heavyweight work; it’s just a fwrite() of 41 bytes — no traversal, no compression.
It just writes one 41-byte file (.git/refs/heads/feature) containing the current commit SHA — no files are copied
Branches are stored on the remote server, not locally
This answer confuses local branches with remote refs; local branch creation is purely local and offline-capable.
Git skips adding the branch until you commit to it
This answer assumes lazy initialization; the branch is created and ready immediately, no commit required.

A branch creation is a single fwrite() of 41 bytes. No copying, no traversal, no network. Once you see branches as tiny pointer files, their speed and cheapness stops being mysterious.

6. [Revisit Steps 1-2] You enter detached HEAD at an old commit, make one exploratory commit, and switch away without creating a branch. In terms of Git objects, what happens to that commit?

Git deletes the commit object immediately
This answer assumes Git is eager about cleanup; objects survive until git gc decides to prune unreachable ones.
The commit object still exists in .git/objects but is unreachable from any ref, so git gc will eventually prune it
The commit is moved into the .git/orphans directory
This answer invents a .git/orphans directory; orphans live in the same .git/objects/ as everything else.
The commit is converted into a blob
This answer assumes Git mutates the object type; commits stay commits, they just lose their refs.

Objects in Git live until garbage collection. An orphaned commit is not ‘deleted’ — it is just unreachable from any ref. git reflog still records HEAD’s path through it, which is how git branch rescue <sha> can rescue it. This links Step 2’s reflog safety net to Step 3’s object-model view: unreachable ≠ deleted.

7. Put in order the commands that prove Git is content-addressable (same bytes → same SHA, across unrelated repos). (arrange in order)

Correct order:

echo "hello world" | git hash-object --stdin # in your main repo
# note the 40-char SHA — call it SHA-A
cd /tmp && git init -q other && cd other # fresh unrelated repo
echo "hello world" | git hash-object --stdin # same bytes
# output is IDENTICAL — call it SHA-B; SHA-A == SHA-B

Distractors (not used):

git push origin main # sharing has nothing to do with hashing
git config user.email alice # author metadata is in commits, not blobs
git commit -m "hello" # commit SHA includes parent/time; not the demo

Content-addressability is a property of bytes hashed, independent of repo, branch, or user. Distractor 1 (push) is irrelevant — hashes are local. Distractor 2 (email) affects commit objects but not blob hashes. Distractor 3 (commit) creates a commit object whose SHA depends on parent + time + author — not the cleanest demo of blob deduplication.

4

Saving Work Temporarily with git stash

🎯 You will learn to

Context-switch cleanly mid-feature without polluting history with WIP commits.
Pick pop vs. apply correctly.
Diagnose the classic “stash missed my new file” footgun.

Scenario

You’re mid-feature when your lead yells “hotfix on main, now!” Your options without stash are all bad: WIP commit (pollutes history), git restore (destroys work), or stay put (can’t isolate the fix).

git stash is the escape hatch.

🤔 Predict first

After git stash, where does your in-progress work end up — in the index, in the working tree, in a private commit, or deleted? And what will git status say about your working tree?

Task 1: See the dirty tree

A half-finished power function is already sitting in calculator.py:

def power(a, b):
    # TODO: add input validation
    return a ** b

cd /tutorial/myproject
git status
git diff

Task 2: Stash it

git stash
git status           # clean!
git stash list       # your WIP is here

💡 How stash works internally (Step 3 callback)

A stash is a merge commit at refs/stash — first parent is HEAD at stash time, second parent records the index (and a third parent records untracked files when you use -u). Same object model as every other commit, which is why git stash apply <sha> works on any historical stash.

Task 3: Do the hotfix on a dedicated branch

git switch -c hotfix-divide-zero

In the editor, append to calculator.py:

def safe_divide(a, b):
    """Divide a by b, raising ValueError on zero denominator."""
    if b == 0:
        raise ValueError("Cannot divide by zero")
    return a / b

git add calculator.py
git commit -m "Hotfix: add safe_divide to prevent zero-division errors"
git switch main
git merge hotfix-divide-zero --no-edit
git branch -d hotfix-divide-zero

Task 4: Restore your WIP

git stash pop
git stash list       # empty — pop removed it

pop = apply + drop. Use apply instead if you want to keep the stash (e.g. to apply it on multiple branches).

📋 Full stash cheat sheet (other flags)

Command	Effect
`git stash`	Save tracked mods + staged; clean tree
`git stash pop`	Restore and drop the top stash
`git stash apply`	Restore but keep the stash
`git stash drop`	Delete without applying
`git stash push -m "msg"`	Save with a message
`git stash -u`	Also include untracked files

Gotcha: plain git stash skips untracked (never-add-ed) files. Use -u to include them — the most common stash footgun.

Task 5: Finish the feature

Edit calculator.py so power has real validation, then commit (message must include “power”):

def power(a, b):
    """Return a raised to the power of b."""
    if not isinstance(a, (int, float)) or not isinstance(b, (int, float)):
        raise TypeError("Arguments must be numbers")
    return a ** b

Solution

myproject/calculator.py

"""A simple calculator module."""
def add(a, b): return a + b

def divide(a, b): return a / b

def safe_divide(a, b):
    """Divide a by b, raising ValueError on zero denominator."""
    if b == 0:
        raise ValueError("Cannot divide by zero")
    return a / b

def power(a, b):
    """Return a raised to the power of b."""
    if not isinstance(a, (int, float)) or not isinstance(b, (int, float)):
        raise TypeError("Arguments must be numbers")
    return a ** b

Commands

cd /tutorial/myproject && git switch main
git reset --hard HEAD
git clean -fdq
while git stash list 2>/dev/null | grep -q .; do git stash drop -q 2>/dev/null || break; done
(git branch -D hotfix-divide-zero 2>/dev/null; true) && git switch -c hotfix-divide-zero
printf '\ndef safe_divide(a, b):\n    """Divide a by b, raising ValueError on zero denominator."""\n    if b == 0:\n        raise ValueError("Cannot divide by zero")\n    return a / b\n' >> calculator.py
git add calculator.py && git commit -m 'Hotfix: add safe_divide to prevent zero-division errors'
git switch main && git merge hotfix-divide-zero --no-edit
git branch -D hotfix-divide-zero 2>/dev/null; true
printf '\ndef power(a, b):\n    """Return a raised to the power of b."""\n    if not isinstance(a, (int, float)) or not isinstance(b, (int, float)):\n        raise TypeError("Arguments must be numbers")\n    return a ** b\n' >> calculator.py
git add calculator.py && git commit -m 'Add power function with input validation'

git stash: snapshots tracked modifications and staged changes into a stash commit in .git/refs/stash, then resets the working tree to match HEAD. Untracked files are not included unless you use git stash -u.
git stash pop: applies the top stash and removes it. Conflicts surface exactly like merge conflicts — resolve them, then git add and commit (or drop the stash manually).
Why not just git commit -m "WIP"? A WIP commit pollutes shared history if pushed. The stash is private, local, and temporary — no risk of shipping half-baked work.
Internal: a stash is stored as a merge commit reachable via refs/stash. Its first parent is HEAD at stash time; its second parent is a commit recording the index state. With git stash -u, a third parent records the untracked files. This is why git stash apply <sha> works even on detached stashes.

git stash — Knowledge Check

Min. score: 80%

1. You are mid-edit on app.py when your lead asks for an urgent hotfix on main. You have NOT staged your changes yet. Which approach keeps your tree clean for the hotfix without losing your in-progress work?

git restore app.py — discards the edits
This answer destroys the in-progress work the question wants to preserve.
git commit -m 'WIP' — commits the half-finished changes
This answer pollutes shared history if pushed; the prompt asks to keep the tree clean without losing work.
git stash — saves modifications to a temporary holding area and cleans the tree
git reset --hard — resets everything so you can restart later
This answer wipes the in-progress changes entirely — the opposite of preserving them.

git stash is built for this: save tracked modifications and staged changes to a private stack, reset the tree, let you context-switch cleanly. Recovered with git stash pop.

2. What does git status report immediately after git stash?

Your modified files appear staged and ready to commit
This answer confuses stash with staging — stash moves changes to a private commit at refs/stash, not the index.
Your modified files appear untracked — Git has forgotten about them
This answer assumes Git forgets the changes; they’re safe in a stash commit, the working tree just matches HEAD.
nothing to commit, working tree clean
An error, because git status cannot be run after stashing
This answer assumes stash breaks git status; status works fine after stashing.

git stash resets the working tree to match HEAD — so git status reports clean. Your changes are safe in the stash commit at refs/stash.

3. Difference between git stash pop and git stash apply?

pop restores AND removes the stash entry; apply restores but keeps the entry
pop works only on the oldest entry; apply works on any entry
This answer reverses the constraint; both verbs work on any entry, the difference is whether the entry is removed afterward.
apply creates a new commit; pop does not
This answer confuses applying a stash with creating a commit; neither pop nor apply produces a new commit.
They are identical
This answer ignores the actual difference (whether the entry survives or is dropped).

Use pop for the usual workflow. Use apply when you want the same stash on multiple branches — the entry stays in the list until you manually git stash drop.

4. You ran git stash but your brand-new file feature.py (never git add-ed) is still there. Why?

The file is too large to stash
This answer invents a size restriction — stash has no size limit; the issue is tracked vs. untracked.
By default, git stash only stores tracked file changes; untracked files need git stash -u
git stash needs a message for untracked files; use git stash push -m
This answer confuses message annotation with the inclusion of untracked files; -u is the relevant flag, not -m.
Git ignores files starting with .
This answer confuses .gitignore rules with stash’s tracked-files default; dot-files aren’t ignored by default.

Plain git stash only captures what Git is tracking — modified tracked files and staged changes. For brand-new files, use git stash -u (--include-untracked).

5. [Evaluate] A teammate says: ‘I never use stash — I just commit with WIP and squash later.’ Best evaluation?

WIP is strictly better — stashes can be accidentally dropped
This answer ignores the cost of WIP commits polluting shared history if pushed.
Equivalent — personal preference only
This answer flattens a real distinction (private/local stash vs. potentially-pushed WIP commits) into mere preference.
Both work; WIP pollutes shared history if pushed and requires disciplined squash, stash stays private/local. Mature teams have conventions for both
WIP is wrong — Git rejects commits with that message
This answer assumes Git rejects messages — it doesn’t; the practice is just discouraged, not blocked.

Both preserve work. The difference is visibility. Pushed WIP commits enter shared history and degrade git log, git bisect, and code review. Stash is private — no pollution, but you can forget it. Neither is universally right.

6. [Revisit Step 1] You stashed on main, then switched to a commit with git switch --detach HEAD~2 to inspect old code. What is the safe way to recover the stash?

Run git stash pop here — stash works in any HEAD state
This answer ignores the conflict-on-pop risk in detached HEAD where there’s no branch to anchor unresolved changes.
Return to a named branch first (git switch main), then git stash pop — applying a stash in detached HEAD risks creating orphaned state if conflicts arise
The stash is lost — it was tied to main
This answer assumes stashes are tied to the branch where they were created; they aren’t — refs/stash is independent.
Run git stash apply detached to redirect it
This answer invents a syntax — git stash apply takes a stash ref, not a branch state name.

Stashes are not tied to a branch — but a conflicting pop in detached HEAD leaves you with unresolved changes and nothing anchoring them. Always return to a named branch before popping.

7. [Revisit Step 3] Where does Git physically store a stash entry?

As a plain text file in .git/stash.txt
This answer invents a stash.txt file; stash uses the regular Git object database, not a separate text store.
As a single merge commit reachable via refs/stash (with HEAD and an index-state commit as its parents) — the same object model as any other commit
In RAM only; restarting the shell loses it
This answer assumes stash is volatile; it survives reboots because it’s a real commit anchored at refs/stash.
In the remote repository
This answer confuses local stash with remote sync; stash is local-only by design.

A stash is a proper commit in the object database, anchored by refs/stash. This is why git stash survives across terminals and reboots, and why git stash apply <sha> works with any historical stash. Same object model as Step 3 — stash is not a special case.

8. Put in order the complete “stash → hotfix → resume” workflow from Task 3 of this step. (arrange in order)

Correct order:

git stash # save WIP, clean working tree
git switch -c hotfix-xyz # dedicated branch for the hotfix
git commit -am "Hotfix: ..." # fix + commit on the hotfix branch
git switch main && git merge hotfix-xyz --no-edit
git branch -d hotfix-xyz # clean up the merged hotfix branch
git stash pop # restore your WIP on main

Distractors (not used):

git commit -m "WIP" # pollutes history if pushed
git restore . # discards WIP permanently
git stash drop # would throw away the WIP
git push origin stash # you cannot push a stash

The canonical context-switch sequence. Each distractor is a common novice mistake — committing WIP pollutes shared history; git restore destroys work; dropping the stash before popping loses it; stashes are local-only (no push). Learn this six-line sequence as a unit.

5

Cherry-Pick: Copy One Specific Commit

🎯 You will learn to

Pick cherry-pick for one-commit backports; reject it for many-commit integration.
Resolve a cherry-pick conflict end-to-end (same marker dance as merge — different final verb).
Explain why the copied commit has a new SHA (apply Step 3’s object model).

Scenario

Lead: “The absolute helper on experimental is useful on main too. Bring that one commit over — leave the half-baked multiply behind.”

🤔 Predict first

Cherry-pick produces a new commit with a new SHA. What happens to the original commit on experimental — does it move, get rewritten, vanish, or stay put unchanged?

cherry-pick <sha> replays one commit’s patch on top of HEAD as a new commit (new parent → new SHA, same message + diff).

Task 1: Inspect

The pre-built experimental has two commits: a half-baked experimental_multiply, and a reusable absolute.

cd /tutorial/myproject
git log experimental --oneline

You only want the second commit.

Task 2: Cherry-pick the tip

A branch name resolves to its tip commit — no SHA copy needed:

git switch main
git cherry-pick experimental
git log --oneline

A new commit Add absolute value function sits on main with a different SHA from the original. Same patch, new parent → new SHA.

💡 Schema check (Step 3 callback). Cherry-pick creates a new immutable object and moves the branch pointer to it. The original commit on experimental is untouched — Git never edits commits in place. This pattern repeats in every step from here on.

🔍 Contrast — what’s not like cherry-pick. git branch foo at the same commit creates zero new objects (just a 41-byte ref file). Both move pointers; only cherry-pick also creates a new commit. That’s why branch creation is instant and cherry-pick can fail with a conflict.

Task 3: Produce and resolve a conflict

Make the same line differ on both branches:

On main, edit calculator.py so def add(a, b): return a + b becomes:

def add(a, b):
    """Return the sum of two numbers."""
    return a + b

git add calculator.py && git commit -m "Document add function"

On experimental, change the same line differently:

git switch experimental

Edit to:

def add(a, b): return a + b  # simple addition

git add calculator.py && git commit -m "Inline comment on add"
git switch main
git cherry-pick experimental      # CONFLICT
git status

You’ll see <<<<<<< / ======= / >>>>>>> in the file. Conflicts are not failures — Git is asking a human to combine two valid changes.

Edit the block to keep both sides:

def add(a, b):
    """Return the sum of two numbers."""
    return a + b  # simple addition

git add calculator.py
git cherry-pick --continue     # NOT `git commit` — use the cherry-pick verb

🆘 Stuck on the conflict?

Open calculator.py and find the <<<<<<< / ======= / >>>>>>> block.
The block has two halves: above ======= is what you have (HEAD), below is what’s coming in (the cherry-picked commit).
Edit so the result keeps the docstring and the inline comment, then delete all three marker lines.
git add calculator.py → git cherry-pick --continue.
To bail at any point: git cherry-pick --abort resets cleanly.

Cherry-Pick — Knowledge Check

Min. score: 80%

1. What does git cherry-pick <sha> do?

Permanently moves the commit from its original branch to the current branch
This answer assumes cherry-pick mutates the source branch; it’s a copy, not a move — the source commit is untouched.
Computes the patch of that commit vs its parent and creates a new commit on the current branch applying the same patch
Merges the entire branch containing that commit
This answer confuses single-commit cherry-pick with whole-branch merge.
Renames the commit on the current branch
This answer treats cherry-pick as a metadata edit; it actually creates a new commit object with a new SHA.

Cherry-pick replays one commit as a new commit on HEAD. The source commit is unchanged. The new commit has the same patch and message but a new parent and therefore a new SHA.

2. You cherry-pick commit abc123 from experimental onto main. Afterwards, what is on experimental?

abc123 has been removed from experimental
This answer assumes cherry-pick removes the source commit; cherry-pick is purely additive and doesn’t touch the source.
abc123 is still on experimental, unchanged — cherry-pick copies, not moves
experimental now points to main
This answer assumes branches got rewired; cherry-pick only adds to the current branch.
experimental has been deleted
This answer assumes cherry-pick is destructive on the source; it isn’t.

Cherry-pick is a copy operation. The source commit stays where it is. Two commits with the same patch now live in two branches with different SHAs.

3. During a cherry-pick, Git reports a conflict. Which sequence correctly completes it?

Delete the conflicted file and run git cherry-pick --continue
This answer destroys the work instead of resolving the conflict.
Resolve conflicts in the file, git add the file, then git cherry-pick --continue
Run git cherry-pick --force to override the conflict
This answer invents a flag; Git has no --force option for cherry-pick.
Run git commit --amend to finalize the cherry-pick
This answer breaks the cherry-pick flow — --amend rewrites the previous commit, not the cherry-pick in progress.

Standard conflict resolution: edit the file to remove <<<<<<< markers, git add to mark as resolved, git cherry-pick --continue (commits silently with the original message; pass -e/--edit if you want the editor). --abort bails and restores HEAD.

4. [Revisit Step 3] After a cherry-pick, the new commit has a different SHA from the source. Why?

Cherry-pick mutates the patch content
This answer assumes cherry-pick rewrites content; the patch is identical, only the parent differs.
The new commit has a different parent, and a commit’s SHA hashes its tree + parent(s) + metadata — any change alters the SHA
Git randomly regenerates SHAs for safety
This answer assumes randomness; SHAs are deterministic hashes of (tree + parents + metadata).
The new commit has no parent at all
This answer assumes the new commit is parentless; it has the current HEAD as its parent.

A commit’s SHA is SHA-1(tree + parent(s) + author + committer + message). Same patch on a different parent → different tree (possibly) and definitely different parent reference → different SHA. Chapter 2’s object-model lesson makes this inevitable.

5. Which scenario is a bad fit for cherry-pick?

Backporting a single bugfix from main to a release branch
This answer rejects cherry-pick’s canonical use case; one-commit backports are exactly what it’s for.
Pulling one reviewed commit from a colleague’s branch while the rest is still in review
This answer rejects cherry-pick’s other canonical use case (surgical extraction from a partial branch).
Integrating all 50 commits of a long-running feature branch into main
Applying the same emergency patch to three parallel maintenance branches
This answer rejects another canonical use case; cherry-picking the same fix into multiple maintenance branches is standard practice.

For integrating many commits, use git merge or git rebase — cherry-picking 50 commits by hand is laborious and loses merge base information, which complicates future merges. Cherry-pick is surgical — reserve it for one or a few commits.

6. [Revisit Step 4] Mid-cherry-pick, a conflict pauses Git. You realize you need to check something on another branch first. Which sequence safely preserves your conflict-resolution progress so far?

git stash — stashes work across branches, including mid-cherry-pick state
This answer assumes stash captures Git’s internal cherry-pick state (CHERRY_PICK_HEAD, conflicted index); it doesn’t.
git cherry-pick --abort first to restore a clean state, then switch; re-do the cherry-pick when you return
Just git switch other-branch — Git will remember the cherry-pick
This answer assumes Git tolerates mid-cherry-pick switches; it doesn’t, the index is in an unmergeable state.
git commit -m 'WIP conflict' --allow-empty to save the mid-state
This answer corrupts the cherry-pick state by inserting a foreign commit into its in-progress flow.

You cannot cleanly stash or switch with an in-progress cherry-pick — Git’s internal state (MERGE_MSG, CHERRY_PICK_HEAD, conflicted index) is not stash-compatible. Abort, switch, do the other task, come back, and re-start the cherry-pick. The abort is cheap and restores a clean state.

7. Put in order the commands that resolve a conflicted cherry-pick end-to-end. (arrange in order)

Correct order:

git switch main # be on the target branch
git cherry-pick # CONFLICT reported
git status # see which files conflict
# edit the conflicted file: remove <<<<<<<, =======, >>>>>>> markers
git add # mark resolved
git cherry-pick --continue # finalize with original message

Distractors (not used):

git commit -m "resolve conflict" # breaks the cherry-pick flow
git cherry-pick --force # not a real flag
git merge --continue # wrong verb — not a merge
git reset --hard # would abort + discard

The post-conflict verb is cherry-pick --continue, not commit. The other distractors are common reflex mistakes — --force doesn’t exist here; merge --continue is a different operation; reset --hard discards rather than finalizes. Use --abort to bail out cleanly.

6

git blame: Who Last Changed This Line (and Why)?

🎯 You will learn to

Answer “why does this line exist?” by chaining blame -L → show <sha>.
Predict when plain blame lies — reformatter commits mask real authors.
Defuse the lie with -w or blame.ignoreRevsFile.
Recognize blame’s blind spot: it can only see existing lines.

The two-command forensic workflow

git blame -L <start>,<end> <file> → find the SHA that last touched the line.
git show <sha> → read the commit message and diff — the why lives here.

Blame is for context, not accusation.

Task 1: Why does this line exist?

git blame -L 7,7 calculator.py
# Copy the SHA from the first column, then:
git show <that-sha>

Who, when, why — covered. That chain is 90% of real blame use.

Task 2: The reformatter-masked authorship case

Setup planted: Bob wrote clip. CI-Bot later ran whitespace normalization (no logic change).

Predict: who will plain blame name as the last author of def clip?

git blame -L 1,$(wc -l < calculator.py) calculator.py | grep -i 'clip'

Last-toucher wins — blame names CI-Bot, masking Bob. Inspect:

git show <ci-bot-sha>     # pure whitespace diff

Add -w to skip whitespace-only changes:

git blame -w -L 1,$(wc -l < calculator.py) calculator.py | grep -i 'clip'

Now the author is Bob — the real logic author. For recurring formatters, persist this:

echo "<ci-bot-sha>" >> .git-blame-ignore-revs
git config blame.ignoreRevsFile .git-blame-ignore-revs

GitHub’s web blame UI honors this file too.

Task 3: Default blame vs. `HEAD --` blame

Predict first: if your working tree has uncommitted edits to a file, will plain git blame <file> show those uncommitted lines or hide them?

echo "# uncommitted note" >> calculator.py
git blame calculator.py | tail        # the uncommitted line is shown — with a zero SHA "Not Committed Yet"
git blame HEAD -- calculator.py | tail # only what's committed at HEAD
git restore calculator.py             # discard the experimental edit

The distinction. Default git blame <file> annotates the file as it currently is on disk — uncommitted lines included, marked with the zero SHA 00000000 and the author “Not Committed Yet”. git blame HEAD -- <file> instead asks “who last touched this line in the version recorded at HEAD?” Different question, different answer when the working tree is dirty.

Still a real blind spot, though. Blame can only attribute existing lines (in either mode). A bug caused by a deleted line is invisible. For deletions, reach for git log -p, git log -S (pickaxe search), or git bisect (next step) — the official Git docs are explicit that deleted/replaced lines require diff- or pickaxe-style history search.

📋 Full flag cheat sheet (`-C`, `-M`, `ignoreRevsFile`)

Flag	Use when
`-L start,end`	You know which lines matter (avoid scanning 1000 lines)
`-w`	A reformatter was the last toucher
`-C -M`	A line moved or was copied across files
`blame.ignoreRevsFile`	Permanently skip known reformat commits

💡 Sanity check: when `-w` is a no-op (try it)

git blame -L 1,$(wc -l < calculator.py) calculator.py | grep -i 'def add'

Plain blame already shows the real author — -w is identical here. Rule: -w matters only when a reformatter was the last toucher.

git blame — Knowledge Check

Min. score: 80%

1. What does git blame calculator.py show?

A list of bugs that were found in calculator.py
This answer treats git blame as static analysis; blame is purely about line provenance, not bug detection.
For every line in the file: the SHA, author, and timestamp of the commit that last modified that line
The remote collaborators who have access to the file
This answer confuses authorship-of-lines with access-control of the file.
The diff of the most recent commit that touched the file
This answer confuses blame’s per-line view with git show’s per-commit view.

Blame gives per-line provenance — the last-touching commit and author. Combined with git show <sha>, you see the full context: why the line was written this way.

2. You need to know the commit message for line 42’s last modification. Which sequence gets you there fastest?

git log calculator.py | grep 42
This answer treats line numbers as a search string; git log doesn’t index by line number, only blame does.
git blame -L 42,42 calculator.py to get the SHA, then git show <sha> for the message and diff
git diff calculator.py to see what changed
This answer shows uncommitted changes, not authorship of committed lines.
git status calculator.py
This answer reports working-tree state, not historical authorship.

git blame -L 42,42 restricts output to line 42 only — instant. The first column is the SHA; pipe that SHA into git show for the full message. This two-step recipe is idiomatic.

3. A colleague recently ran black across the whole repository. Now git blame shows them as the author of every line. How do you see the real last-meaningful author?

Revert the black commit
This answer is needlessly destructive; -w and blame.ignoreRevsFile solve this without rewriting history.
Use git blame -w to ignore whitespace-only changes, and/or configure blame.ignoreRevsFile to exclude the black commit SHA
Git has no way to recover this information
This answer assumes the reformatter has masked the data permanently; it hasn’t, blame just needs the right flags.
Use git reflog to rewind blame
This answer confuses reflog (HEAD-position diary) with blame’s commit-by-line attribution.

-w ignores whitespace-only changes, hiding pure reformatting from blame. For recurring formatters, add the reformat commit SHAs to a file referenced by blame.ignoreRevsFile — now everyone skips them consistently.

4. [Revisit Step 3] When git blame prints a SHA for a line, what kind of object does that SHA refer to?

A blob object containing just that one line
This answer confuses commit objects with blob objects; blame attributes to commits, not blobs.
A tree object for the file’s containing directory
This answer confuses commit objects with tree objects; blame’s SHAs are commit SHAs.
A commit object — the last one that modified that line
A stash entry
This answer confuses commits with stash entries; blame doesn’t reference stashes.

Blame attributes lines to commits. The SHA printed is a commit SHA — run git cat-file -t <sha> to confirm it reports commit. You then use git show <sha> to read it.

5. When is git blame the wrong tool for finding a bug?

When the bug is a missing line that was never added
When you need to know who last modified a specific line
This answer states blame’s core use case — the question asked when blame is the wrong tool.
When you need the timestamp of a line
This answer states another core blame use case; blame prints timestamps for every line.
When you need the commit message behind a line
This answer describes the standard blame → show chain; it’s blame’s strength, not its weakness.

Blame only tells you about existing lines. A bug caused by an absent line (e.g., forgetting to call validate()) leaves blame blind. For regressions introduced by a missing line, use git bisect (next step) or git log -p to scan history.

6. [Analyze] Give a concrete bug where git blame would mislead you even though the culprit line IS in the file. Which of the following fits best?

A one-character typo fixed by a teammate — blame shows the typo’s original commit, not the fix
This answer assumes blame keeps showing the original typo; once fixed, blame attributes the line to the fixer.
A logic bug in a line that was auto-reformatted by a CI script last week — plain blame points at the CI bot, not the author who introduced the logic bug
A commit that was cherry-picked to main from experimental
This answer assumes cherry-pick fools blame; the new commit has its own SHA but blame still attributes correctly.
A commit that was merged via a merge commit
This answer assumes merge commits mask authorship; standard merges leave per-line history intact.

Reformatter commits are the classic blame-mislead scenario. The CI bot’s commit ‘last touched’ every line, so blame attributes all lines to the bot — hiding the real author who introduced the logic bug. Defense: git blame -w to skip whitespace-only changes, or blame.ignoreRevsFile to skip known reformat commits.

7. [Revisit Step 4] Your working tree has an uncommitted edit to calculator.py. You run plain git blame calculator.py. What do you see for the modified line?

Nothing — Git refuses to run blame on a dirty file
This answer assumes Git refuses dirty trees; blame works fine, it just shows the uncommitted line specially.
The line appears with a zero SHA labeled “Not Committed Yet” — default blame annotates the working-tree copy of the file, including uncommitted lines
The line appears with the SHA of the last commit that touched the file — uncommitted edits are invisible to blame
This answer assumes blame ignores the working tree; default blame uses the working-tree copy, only blame HEAD -- restricts to committed lines.
An error about a detached working tree
This answer invents an error condition; “detached” applies to HEAD, not to working trees.

Default git blame <file> annotates the file as it currently is — so an uncommitted line appears with the zero SHA 00000000 and author “Not Committed Yet”. To restrict to the committed version of the file, use git blame HEAD -- <file>. Two different questions (“who touched what I’m reading right now?” vs. “who touched what’s recorded at HEAD?”); two different commands. Note this is separate from the deletion blind spot — a line that no longer exists in the file is invisible to either mode of blame.

8. Put in order the “forensic chain” for understanding why a specific line in parser.py exists. (arrange in order)

Correct order:

git blame -L 42,42 parser.py # find SHA of last change to line 42
# copy the SHA from the first column
git show # read commit message + full diff
# is the commit message a reformatter? If yes, try -w:
git blame -w -L 42,42 parser.py # ignore whitespace-only commits

Distractors (not used):

git log parser.py | grep 42 # slow, imprecise
git diff HEAD parser.py # shows YOUR uncommitted edits, not authorship
git status parser.py # unrelated to authorship

The blame → show chain answers “why does this line exist?” The -w fallback defuses reformatter masking. The distractors are all plausible-looking commands that don’t answer the authorship question — common cul-de-sacs when learners panic-grep instead of reaching for blame.

7

git bisect: Binary Search for the Commit That Broke Things

🎯 You will learn to

Decide when bisect is worth reaching for (rule: ≥ ~5 commits or slow tests).
Run an automated bisect end-to-end and always reset afterward.
Spot regressions blame cannot find — deletions, behavioral changes, and anything involving missing lines.

🤔 Predict first

A regression appeared somewhere in the last 1000 commits. Roughly how many tests would git bisect need to find the exact breaking commit? Pick one before reading on: 1000, 500, 100, or ~10.

Why bisect beats every alternative

Reading 30 diffs by hand is slow. blame can’t see missing lines. log --grep="fix" is wishful thinking.

Bisect runs binary search on history: log₂(30) ≈ 5 tests to pin the exact culprit. 1000 commits → ~10 tests. Scales forever.

Task 1: See the regression

Setup planted 5 commits; one of them broke absolute(-4) == 4.

cd /tutorial/myproject
git log --oneline -7
python3 test_calculator.py      # AssertionError

Task 2: Manual bisect (feel the motion)

git bisect start
git bisect bad HEAD
git bisect good HEAD~5
# Git checks out a midpoint. Test it:
python3 test_calculator.py
# exit 0 → git bisect good ;  exit ≠ 0 → git bisect bad
# Repeat until Git prints "<sha> is the first bad commit"
git bisect reset

Task 3: Automated bisect (the real-world default)

git bisect start HEAD HEAD~5
git bisect run python3 test_calculator.py
git bisect reset

bisect run uses the script’s exit code (0 = good, non-zero = bad) to drive the search. Always finish with reset — otherwise HEAD stays on the last midpoint.

Task 4: Fix the bug

Bisect points at Simplify absolute (BUG: removes negation!). In the editor, restore the body of absolute:

def absolute(x):
    """Return |x| — handles negatives, zero, and positives."""
    return x if x >= 0 else -x

git commit -am "Fix: restore negation in absolute"
python3 test_calculator.py      # all tests pass

⚠️ Test-portability caveat (real-world bisects)

Bisect runs the test at every historical commit in range. If the test itself was added mid-range, older commits won’t have it and bisect breaks. Restore the modern test each iteration:

git bisect run -- bash -c 'cp /tmp/test.py . && python3 test.py'

🌙 Halftime: take a break before Step 8

You’ve finished the daily tools phase (stash, cherry-pick, blame, bisect). Steps 8–11 are history rewriting — denser and structurally riskier.

Walk away for at least 30 minutes (overnight is better) before continuing. Spaced practice is one of the most replicated findings in cognitive science: a 30-minute break before harder material produces measurably better retention than pushing straight through. Your hippocampus consolidates while you’re not studying.

When you come back, predict from memory: what does git stash actually save? Why does cherry-pick create a new SHA? If those don’t come fast, re-do the step. If they do, Step 8 awaits.

git bisect — Knowledge Check

Min. score: 80%

1. A regression appeared somewhere in the last 50 commits. Roughly how many tests does git bisect need to find the exact breaking commit?

50 — one per commit
This answer assumes linear search; bisect halves candidates each step (log₂(50) ≈ 6, not 50).
25 — approximately half
This answer halves only once; bisect halves repeatedly until one commit remains.
~6 — log₂(50) ≈ 6
1 — bisect inspects all commits at once
This answer assumes a single oracle; bisect needs one test per midpoint to drive the search.

Binary search halves the range each test: 50 → 25 → 13 → 7 → 4 → 2 → 1. About 6 iterations. For 1000 commits, ~10 tests. This scaling is why bisect is irreplaceable on long-running projects.

2. Which sequence correctly runs an automated bisect?

git bisect start; git bisect run <cmd>
This answer skips the boundary-marking step; without bad/good markers, bisect can’t binary-search.
git bisect start; git bisect bad; git bisect good <old-sha>; git bisect run <cmd>
git bisect run <cmd> HEAD HEAD~50
This answer puts boundaries in the wrong place; start takes them, run takes a command.
git bisect <cmd>
This answer omits both start and run; bisect needs both for automation.

You must tell bisect the boundaries first — bad (usually HEAD) and a known-good earlier commit. Only then can run automate. The command’s exit code (0 = good, nonzero = bad) drives the search.

3. You ran git bisect run successfully. What must you do afterwards?

Nothing — bisect cleans itself up
This answer assumes auto-cleanup; HEAD stays at the last midpoint until you bisect reset.
git bisect reset — otherwise HEAD stays on the midpoint commit it last checked out
git stash to preserve bisect state
This answer confuses stash (working-tree shelving) with bisect cleanup.
Push the result to the remote
This answer assumes the bisect result needs to be shared via push; the result is local diagnostic info.

git bisect reset is non-negotiable. It restores HEAD to where you started and removes bisect’s temporary refs. Skipping it leaves HEAD on a random historical commit — a common cause of ‘why is my code weird?’ panic.

4. Which test property is required for git bisect run to work?

The test must produce JSON output
This answer invents an output-format requirement; bisect cares about exit codes, not output format.
The test must exit 0 on pass and non-zero on fail, and must run on every commit in the range
The test must be written in Python
This answer invents a language requirement; bisect runs any executable.
The test must only exist in the latest commit
This answer is backwards; the test must work at every commit in the range, not just the latest.

Bisect uses the exit code as its oracle. Also critical: the test must actually run at every historical commit — if the test file was added mid-range, older commits will fail to even find the test, confusing bisect. Use git bisect run -- bash -c 'cp /tmp/test.py . && python3 test.py' to work around this.

5. [Revisit Step 1] In the middle of a manual bisect, Git leaves HEAD at a historical commit while you decide good/bad. What HEAD state are you in, and why is that OK?

Detached HEAD — and it is safe because bisect manages it; git bisect reset restores you to the starting branch afterwards
Attached to a new branch called bisect/working that Git creates automatically
This answer invents a bisect/working branch; bisect uses internal refs (refs/bisect/*), not a magic branch.
Detached HEAD — and any git commit you make here will corrupt history
This answer assumes detached-HEAD commits are dangerous; bisect’s transient state is safe and reset undoes it.
Attached to main — bisect only moves the working tree, not HEAD
This answer assumes HEAD doesn’t move; bisect explicitly checks out historical commits at each midpoint.

During bisect, HEAD is detached at whichever historical commit Git picked as midpoint. That is fine because bisect’s internal refs (BISECT_HEAD, refs/bisect/*) track progress. git bisect reset restores the pre-bisect HEAD. Same detached-HEAD concept as Step 1 — just used in service of a search.

6. [Revisit Step 6] A bug appears because a line that used to exist was deleted. Which tool finds the deletion commit?

git blame — it shows every line’s author
This answer ignores blame’s deletion blind spot; absent lines are invisible to blame regardless of mode.
git bisect — it binary-searches the history for the commit where behavior changed, regardless of whether the change was an addition or deletion
git log --grep='delete' — it finds commits with ‘delete’ in the message
This answer assumes the commit message contains “delete”; commit messages don’t reliably announce deletions.
git cherry-pick --find-deletion
This answer invents a flag that doesn’t exist on cherry-pick.

Blame only attributes existing lines. A deletion is invisible to blame (the line isn’t there!). Bisect operates on behavior, not lines: if the test failed after commit X and passed at commit X-1, X is the culprit, regardless of whether X added, modified, or deleted code.

7. Put in order the commands for an automated bisect that finds a regression in the last 100 commits and returns HEAD to normal. (arrange in order)

Correct order:

python3 test_calculator.py # verify HEAD currently fails
git bisect start HEAD HEAD~100 # bad=HEAD, good=100 back
git bisect run python3 test_calculator.py # Git iterates ~7 times
# Git prints: " is the first bad commit"
git bisect reset # return HEAD to pre-bisect state

Distractors (not used):

git bisect stop # not a real subcommand
git bisect --force # not a real flag
git reflog # unrelated to bisect workflow
git bisect run -- python3 test.py HEAD HEAD~100 # wrong arg order

Bisect needs boundaries first (start <bad> <good>), then run. The run script’s exit code (0 = good, nonzero = bad) drives the binary search — ~log₂(100) ≈ 7 iterations. reset is non-negotiable; skipping it leaves HEAD on a historical midpoint commit and your code “looks weird.”

8

Rebase: Integrate Changes Without a Merge Commit

🎯 You will learn to

Pick rebase for short local branches, merge for shared/long-lived ones — and say why.
Produce linear history with rebase + fast-forward merge (no diamond).
Resolve a rebase conflict — same marker dance as merge, but finish with rebase --continue.
Recover from a bad rebase using reflog (Step 2’s safety net applied).

Mental model: the video-editor timeline cut

Select the clips (commits) unique to your feature, cut, move playhead to main’s tip, paste. Each paste is a new commit object — same patch, new parent, new SHA. Originals stay in .git/objects (reflog recovers).

💡 Schema check (Step 3 callback). Rebase = “cherry-pick a series” under the hood. New objects, branch pointer moved. Same mechanic Step 5 used on one commit; Step 8 just iterates.

🔍 Contrast — what’s not like rebase. A fast-forward merge on a strict-extension branch creates zero new commits — main’s pointer just slides forward to the feature tip. Rebase + ff-merge together produce linear history because rebase did all the new-commit-creation up front; the merge has nothing left to do.

Task 1: Inspect the divergence

Pre-built: feature-sqrt has square_root; main later got Bump version notes + Add identity helper.

cd /tutorial/myproject
git log --all --oneline --graph --decorate

Task 2: Rebase and fast-forward

Predict before running: how many parents will the feature tip have after rebase?

git switch feature-sqrt
git rebase main
git switch main
git merge feature-sqrt        # fast-forward, no merge commit
git branch -d feature-sqrt

Result: one linear line on the graph. No diamond.

Task 3: Rebase through a conflict (desirable difficulty)

Real rebases conflict when upstream touched the same lines. Produce one deliberately:

git switch -c feature-trailer main~1
echo '# end-of-module trailer' >> calculator.py
git commit -am 'Add trailer comment at end of file'
git rebase main       # CONFLICT — both sides appended at EOF
git status

Conflicts aren’t failures — they’re “two valid changes touched the same lines; a human must combine them.” Edit calculator.py so the bottom keeps both the identity helper and your trailer comment, removing the <<< / === / >>> markers.

git add calculator.py
git rebase --continue         # NOT `git commit` — use the rebase verb
git switch main
git branch -D feature-trailer

Remember: rebase conflict = merge conflict mechanics, but finalize with git rebase --continue. Bail with git rebase --abort.

When to rebase vs merge

Situation	Prefer
Short feature branch (hours–days), only you	Rebase
Long-lived or already-pushed branch used by teammates	Merge
Cardinal rule	Never rebase shared history

Rebase — Knowledge Check

Min. score: 80%

1. What does git rebase main do when run on feature-sqrt?

Moves main to point at the tip of feature-sqrt
This answer reverses the direction; running rebase main on feature-sqrt rebases feature-sqrt, not main.
Computes the patches of commits unique to feature-sqrt, resets feature-sqrt to the tip of main, and replays those patches as new commits
Merges main into feature-sqrt with a merge commit
This answer confuses rebase with merge; rebase is the linearizing alternative, no merge commit involved.
Deletes main and renames feature-sqrt to main
This answer assumes destructive renaming; rebase moves the current branch’s commits, never deletes other branches.

Rebase rewrites the branch: feature-sqrt’s unique commits become new commits on top of main. This linearizes history but changes SHAs — so never rebase pushed branches others are using.

2. After rebasing, why does the rebased commit have a different SHA than before?

Git chooses a new SHA for clarity
This answer treats SHAs as cosmetic choices; they’re deterministic content hashes, not chosen for clarity.
The new commit has a different parent (the tip of main, not the previous branch point), and a commit’s SHA hashes its parent(s) + tree + metadata
Rebase encrypts the commit
This answer confuses content hashing with encryption.
Rebase changes the commit message
This answer assumes rebase rewrites text; the message is preserved, only the parent (and thus the SHA) changes.

Step 3 again: SHA(commit) = SHA-1(tree + parent(s) + author + committer + message). Change the parent → new SHA. Same patch, new identity.

3. When is git rebase a bad idea?

On a personal feature branch that only you have checked out
This answer rejects rebase’s canonical safe use case (private feature branch).
On a short-lived local branch before opening a PR
This answer rejects another canonical safe use case (short-lived local branch before PR).
On a public/shared branch that teammates have already pulled
To linearize history before a fast-forward merge
This answer rejects rebase’s exact intended workflow (linearize then ff-merge).

Rebase rewrites history. If others have the old SHAs, their branches will diverge and they will get ugly conflicts. Stick to merge for anything pushed and shared, rebase for local linearization.

4. [Revisit Step 2] You rebased feature-sqrt and realized it broke everything. Before pushing. How do you recover the pre-rebase state?

It is gone — rebase is permanent
This answer ignores reflog’s safety net; rebase only abandons refs, the old commits are still in .git/objects/.
git reflog to find the pre-rebase SHA, then git reset --hard <sha> to restore feature-sqrt
git stash pop to retrieve the old commits
This answer confuses stash (working-tree shelving) with reflog (HEAD-position history).
git clone a fresh copy of the repo
This answer is the “burning down the repo” antipattern when reflog can rescue locally.

Rebase is only ‘destructive’ in the sense of changing branch pointers — the original commits remain in .git/objects until garbage collection. git reflog records every HEAD position including the pre-rebase tip; git reset --hard restores it. Your safety net, earned in Step 2.

5. After rebasing a feature onto main, you run git merge feature on main. What happens?

A merge commit with two parents is created
This answer ignores that rebase made feature a strict extension of main, so merge fast-forwards instead of creating a merge commit.
A fast-forward occurs — main simply moves to point at the feature tip; no merge commit, no diamond in the graph
The rebase is reversed
This answer assumes merge undoes rebase; they compose, they don’t reverse.
Git reports an error
This answer assumes Git rejects this; it’s actually the canonical “rebase + ff merge” workflow.

After rebase, feature is a strict linear extension of main. The merge reduces to just advancing the main pointer (fast-forward). This is the whole reason many teams rebase before merging: clean, linear history.

6. Which statements about rebase are true? (Select all that apply) (select all that apply)

Rebase creates new commits with new SHAs for each replayed commit
Rebase can cause conflicts at each replayed commit, requiring you to resolve one at a time with git rebase --continue
Rebase automatically pushes the updated branch
This answer confuses local rewriting with publication; push is always a separate step from rebase.
Rebase can be aborted with git rebase --abort, restoring the branch to its pre-rebase state

Rebase applies each patch in turn and can conflict at any of them — you resolve, git add, git rebase --continue. --abort restores the pre-rebase state. Rebase does not push anything; that is a separate git push step (often needing --force-with-lease on rebased branches, which is where collaborator pain happens).

7. [Revisit Step 5] You had two choices for bringing a colleague’s single fix into main: cherry-pick or rebase. Both create new commits with new SHAs. What is the key difference in intent?

Cherry-pick copies one commit onto HEAD; rebase moves all of a branch’s unique commits onto a new base — they differ in scope, not mechanism
Cherry-pick is destructive; rebase is not
This answer assumes cherry-pick mutates the source; it’s purely additive, just like rebase’s commit creation.
Cherry-pick requires the commit to be on a branch; rebase does not
This answer invents a constraint; cherry-pick takes any commit-ish, branch or not.
They are identical aliases
This answer ignores the scope difference (one commit vs. a series).

Under the hood they use the same machinery — patch out, replay on new parent, new SHA. The difference is scope: cherry-pick = one commit, rebase = a series. Step 5’s cherry-pick and Step 8’s rebase are the same technique at different scales.

8. [Revisit Step 3] During rebase, a conflict halts you mid-stream. git reflog at this moment shows many entries. Which entry do you want to git reset --hard to if you decide to abort manually instead of using git rebase --abort?

The most recent HEAD@{0} entry
This answer points at the current (post-rebase) state; you want the entry from before the rebase.
The checkout: moving from ... entry that shows where the branch pointed before rebase started
The very oldest reflog entry
This answer ignores the relevant time range; the pre-rebase entry is recent, not the oldest.
Any entry — they all point to the same commit
This answer ignores that reflog entries each refer to a different historical HEAD position.

Reflog logs every HEAD movement. The pre-rebase position is typically labeled checkout or the last commit before the rebase entries. That SHA is the pre-rebase branch tip — the safe rescue point. This is the Step 2 reflog safety net applied to rebase.

9. [Revisit basic tutorial Step 11] You edit a conflicted file during a git rebase, remove the <<<<<<< / ======= / >>>>>>> markers, and run git add. What is the next command to finalize this one commit of the rebase?

git commit — same as finishing a merge
This answer treats a rebase conflict like a merge conflict’s final step; rebase needs --continue, not commit.
git rebase --continue — during a rebase, Git is replaying commits one at a time, so continuation (not a fresh commit) is how you resume
git merge --continue
This answer uses the wrong verb; this is a rebase, not a merge.
git push
This answer skips finalizing the rebase entirely.

A rebase conflict uses the same markers and the same git add step as a merge conflict (basic tutorial Step 11). The only difference is the final verb — git rebase --continue tells Git to replay the remaining commits, which git commit would not. Running git commit by reflex here often leaves the rebase half-done. git rebase --abort at any point restores the pre-rebase state.

10. Put in order the commands to rebase a private 3-commit feature branch onto the latest main and fast-forward merge, with nothing leftover on disk. (arrange in order)

Correct order:

git switch feature # be on the branch being rebased
git rebase main # replay feature commits onto latest main
git switch main # target the integration branch
git merge feature # fast-forward, no merge commit
git branch -d feature # clean up the now-merged branch

Distractors (not used):

git push --force # DANGER on shared branches
git merge feature --no-ff # would create a merge commit — not ff
git rebase feature # backwards — rebases main onto feature
git reset --hard feature # destroys main's history

The correct direction is “rebase the shorter branch onto the longer.” Running rebase feature from main (distractor 3) does the opposite — rebases main onto feature, usually rewriting commits you didn’t want to touch. --no-ff prevents fast-forward (that’s the point of this strategy — a linear, no-merge-commit result). --force has no place in a local pre-PR workflow.

9

Interactive Rebase: Edit, Squash, Reorder, Drop

🎯 You will learn to

Squash messy WIP commits into one clean commit before opening a PR.
Drop an accidentally-committed secret (and recover it from reflog if needed).
Reword a commit message retroactively without changing its diff.
Pick the right verb (pick/reword/squash/fixup/drop/edit) for the rewriting goal.

🚪 This is the second threshold step

Step 9 is the densest step in the tutorial — eight verbs, several edge cases, and the most “wait, what?” moments in real Git. That’s not a bug; it’s where most engineers’ command of Git plateaus. Crossing this threshold is what separates “I use Git” from “I shape Git history.” Plan two passes. Don’t worry if Task 4 needs a re-read.

⚠️ Safe zone only

Interactive rebase rewrites history (Step 3: new parents → new SHAs). Run it only on commits that (a) are unpushed, or (b) live on a feature branch only you use. For public history, use git revert (next).

🤔 Predict first

After rebase -i collapses four messy commits into one clean commit, do the original four still exist anywhere — and could you recover one of them with git reflog?

💡 Schema check. Same pattern as Steps 5 & 8: every rewriting verb here (squash, drop, reword, edit) creates new commit objects and moves the branch pointer. The “old” commits don’t disappear — they’re just unreferenced. Reflog finds them.

The four verbs you’ll use here

Verb	Effect
`pick`	Use commit as-is (default)
`squash`	Meld into previous; combine messages
`drop`	Remove commit
`reword`	Edit message only

📋 All six core verbs (`fixup`, `edit`)

Verb	Effect
`pick`	Use commit as-is (default)
`reword`	Edit message only
`edit`	Pause so you can `commit --amend` or add fixes / split
`squash`	Meld into previous; combine messages
`fixup`	Like squash, drop this commit’s message
`drop`	Remove commit

Two more verbs exist for advanced workflows: break (pause mid-rebase so you can poke around, then git rebase --continue) and exec <cmd> (run a shell command after each replayed commit, e.g. exec pytest). See git help rebase if you need them.

🛠 Why this VM uses scripted `sed` instead of `$EDITOR`

Real workflow: git rebase -i HEAD~N opens your $EDITOR, you hand-edit action words, save-and-close. This browser VM can’t host an interactive editor, so we script it via GIT_SEQUENCE_EDITOR="sed -i …".

The skill is knowing what to change, not typing the sed. For each task: (1) predict the edit on paper, (2) run the scripted version, (3) verify the log matches your prediction.

Task 1: Inspect the messy branch

cd /tutorial/myproject
git log --oneline -5           # 4 ugly commits on refactor-power

Task 2: Squash four commits into one

Predict: which lines get squash, and why must line 1 stay pick?

GIT_SEQUENCE_EDITOR="sed -i '2,4s/^pick/squash/'" git rebase -i HEAD~4
git commit --amend -m "Refactor: cleanup notes in calculator.py"
git log --oneline -3

Task 3: Drop a secret-leaking commit

Append to calculator.py: SECRET_API_KEY=oops. Commit: git commit -am "Accidentally add secret (should be dropped)".

Then append def placeholder(): pass and commit: git commit -am "Add placeholder function".

Drop the secret:

GIT_SEQUENCE_EDITOR="sed -i '1s/^pick/drop/'" git rebase -i HEAD~2
grep SECRET_API_KEY calculator.py || echo "secret is gone from branch"

Task 3b: Prove reflog rescues the “dropped” commit

Dropped ≠ deleted (Step 3 again).

git reflog -n 10
SECRET_SHA=$(git reflog | grep -m1 'Accidentally add secret' | awk '{print $1}')
git branch secret-backup $SECRET_SHA
git log secret-backup --oneline

⚠️ For *real* secrets: drop+rescue is the wrong workflow

Drop + rescue leaves more copies of the secret, not fewer. For an actual leaked credential:

Rotate the credential immediately (the only step that truly mitigates).
Scrub with git filter-repo or BFG.
Ask collaborators to re-clone.

Use drop only for non-sensitive cleanup (debug prints, experiments).

Task 4: Reword a message

GIT_SEQUENCE_EDITOR="sed -i '1s/^pick/reword/'" \
  GIT_EDITOR="sed -i '1s/.*/Refactor: cleanup notes and placeholder/'" \
  git rebase -i HEAD~2
git log --oneline -3

Two env vars = two editors (todo list + message editor). In real life you’d hand-edit both.

Wrap-up: rule of thumb

Local, unpushed history → rebase -i (any verb).
Shared, pushed history → git revert only (next step).

Rewriting public history forces every collaborator to reconcile.

Interactive Rebase — Knowledge Check

Min. score: 80%

1. Which interactive-rebase action keeps the commit but lets you change only its message?

pick
This answer doesn’t change anything; pick is a no-op that uses the commit as-is.
reword
squash
This answer melds two commits; the question asks for a message-only edit on one commit.
drop
This answer deletes the commit instead of editing it.

reword keeps the commit content identical but opens the editor to change the message. pick is no-op, squash melds into the previous commit, drop deletes it.

2. What is the difference between squash and fixup?

squash combines messages (editor opens); fixup silently discards the squashed commit’s message
squash rewrites hash; fixup preserves it
This answer confuses ‘rewrites the SHA’ (both do, all rebase verbs do) with the actual difference (message handling).
fixup can only be used on merge commits
This answer invents a constraint; fixup works on any commit, not just merges.
They are identical
This answer ignores the actual difference (whether the editor opens for the squashed message).

Both meld into the previous commit. squash opens the editor so you can combine messages; fixup just drops the squashed commit’s message. Use fixup for trivial typos, squash when both messages are meaningful.

3. You just ran git rebase -i HEAD~3 and realized you dropped a commit you needed. Can you recover it?

No — dropped commits are permanently deleted
This answer ignores reflog’s safety net; the dropped commit is unreferenced, not deleted.
Yes — git reflog still shows the pre-rebase HEAD; git reset --hard HEAD@{1} restores everything
Only by running git rebase --undo
This answer invents a subcommand; git rebase has no --undo.
Only if you had pushed the commit
This answer reverses the rule; pushing makes recovery harder, not easier (because teammates also have the old SHAs to reconcile).

Dropped commits remain in .git/objects until garbage collection prunes them. git reflog is the bookmark that lets you find them. git reset --hard <reflog-sha> restores the exact pre-rebase state. This is the safety net. Always verify it works once before a high-stakes rebase.

4. [Revisit Step 3] After an interactive rebase, the rewritten commits have new SHAs even if their patches are identical. Why?

Interactive rebase randomly generates SHAs
This answer assumes randomness; SHAs are deterministic content + parent hashes.
A commit’s SHA hashes its tree + parents + metadata; rebase changes the parent, so the SHA changes
Interactive rebase encrypts commits
This answer confuses hashing with encryption.
Git renames the commits for clarity
This answer treats SHAs as cosmetic; they’re determined by content + parents, not chosen for clarity.

Same answer as for simple rebase: different parent → different SHA. The object model does not allow ‘editing’ a commit — all that changes is which commit the branch pointer references.

5. Which of these is the most dangerous use of interactive rebase?

On a local branch to squash WIP commits before opening a PR
This answer rejects interactive rebase’s canonical safe use case (local WIP cleanup before PR).
On a local branch to drop a commit that accidentally included a secret
This answer flags a safe-zone use; rewriting unpushed history is the right context for drop.
On main of a shared repository used by 10 teammates, to drop a week-old commit
On a personal experiment branch that nobody else has cloned
This answer flags another safe-zone use; nobody else holds the old SHAs on a personal experiment branch.

Rewriting shared history is the nuclear option. Everyone who fetched the old commits now has a conflicting local copy of main; their pulls fail spectacularly. For public history, use git revert (which creates a new anti-matter commit) instead. Reserve interactive rebase for local cleanup.

6. You want to split one giant commit into three smaller ones during interactive rebase. Which action lets you do that?

pick
This answer keeps the commit as-is; splitting requires pausing and recomposing.
squash
This answer merges commits; splitting goes the other direction.
edit — pauses at the commit so you can git reset HEAD~, stage smaller pieces, and commit each separately
reword
This answer changes the message; splitting changes the commit count.

edit pauses rebase at that commit with HEAD there. You then git reset HEAD~ (un-commit but keep changes staged/unstaged), split the changes into multiple git add + git commit cycles, and finally git rebase --continue. The original one commit is replaced by your new sequence.

7. [Revisit Step 2] In Task 3b you ‘rescued’ a dropped commit. In terms of the object database, what did git branch secret-backup <sha> actually do?

It copied the dropped commit from .git/trash back into .git/objects
This answer invents a .git/trash directory; Git has none — nothing was ever moved out of .git/objects/.
It wrote a 41-byte file at .git/refs/heads/secret-backup containing the SHA, making the commit reachable again — the commit object itself never moved
It recreated the commit from the reflog’s textual record
This answer assumes reflog reconstructs commits from text; the commits were never gone, only unreachable.
It undid the rebase
This answer assumes branch creation is a global undo; it only adds one ref (no rebase reversal).

Same mechanic as Step 2’s rescued-work branch. The dropped commit was never deleted — only unreferenced. Creating a branch (one 41-byte file) re-anchors it as reachable. Now git gc won’t prune it. This is the same reflog + branch recipe, applied to a different scenario (rebase-drop vs detached-HEAD-orphan).

8. [Revisit Step 5] You are about to interactive-rebase a branch. You have uncommitted edits you want to keep but not carry through the rebase. Safest workflow?

Commit them as WIP then rebase — they will survive
This answer pollutes history; the question explicitly says “keep but not carry through” — WIP commits get carried through.
git stash them (Step 4) → rebase → git stash pop when done → same pre-rebase work, cleanly on the rebased branch
Run rebase with --ignore-uncommitted
This answer invents a flag; rebase has no --ignore-uncommitted option.
Git rebase will refuse to start if you have uncommitted changes, so nothing to worry about
This answer is half-true (rebase does refuse) but doesn’t address how to handle the uncommitted work the question wants preserved.

Rebase refuses to start with dirty working tree — so Git is already stopping you. Stash is the clean pattern: preserve the work-in-progress (Step 4), do the rebase, pop the stash onto the rebased branch. This composes tools across steps — recognizing when two tools work together is the mark of Git fluency.

9. Put in order the steps to squash 4 messy commits on a local branch into one clean commit (assuming you’re using the scripted VM editor). (arrange in order)

Correct order:

git log --oneline -5 # inspect what we're rewriting
GIT_SEQUENCE_EDITOR="sed -i '2,4s/^pick/squash/'" git rebase -i HEAD~4
# Git replays commit 1 as pick, squashes commits 2-4 into it
git commit --amend -m "Refactor: cleanup notes in calculator.py"
git log --oneline -3 # verify: 4 commits became 1

Distractors (not used):

git push --force # DANGER: never on shared history
git rebase -i HEAD~4 --squash-all # not a real flag
git reset --hard HEAD~4 # destroys commits without combining
git merge --squash HEAD~4 # wrong tool for intra-branch squash

The canonical pre-PR cleanup. Distractor 1 is the cardinal rule broken. Distractor 2 is invented. Distractor 3 discards instead of squashing (no single commit preserves the combined patch). Distractor 4 (merge --squash) is for branch-to-branch collapse, not for cleaning up commits on the current branch.

10

Squash Merge: Collapse a Feature Into a Single Commit

🎯 You will learn to

Pick squash vs. rebase vs. merge based on how main’s log should read.
Anticipate the trade-off: clean main, lost intra-feature bisect precision.
Recover individual feature commits if a regression needs fine-grained blame.

git merge --squash <branch> collapses a multi-commit feature into one new commit on main. The feature branch is untouched.

🤔 Predict first

After git merge --squash feature followed by git commit, how many parents does the new commit on main have — one, two, or three? And what does that imply for git bisect later?

📋 Three merge strategies side by side (Steps 8 + 10 unified)

Method	main’s graph	Use when
`git merge feature`	Merge commit, 2 parents (diamond)	Long-lived branch; preserve merge context
`rebase + merge` (ff)	Linear, each commit preserved	Short feature; keep individual commits
`git merge --squash`	One new commit, branch untouched	Want `main` to read as one commit per feature

Task 1: Inspect the feature

cd /tutorial/myproject
git log feature-stats --oneline -5     # three focused commits

Task 2: Squash-merge

git switch main
git merge --squash feature-stats
git status       # staged changes, but NO commit yet — squash stops here
git commit -m "Add descriptive statistics module (mean, variance, stddev)"

Task 3: Confirm + clean up

git log --oneline main            # one new commit for the feature
git branch -D feature-stats       # -D because not ff-merged in Git's view

⚠️ The cost: bisect granularity

bisect on main can only narrow to the whole feature commit, not one of its three internal commits. Keeping the feature branch around (or its reflog) preserves fine-grained recovery — the strongest argument against deleting merged feature branches the same day they merge.

Squash Merge — Knowledge Check

Min. score: 80%

1. What does git merge --squash feature do?

Creates a merge commit with two parents and the feature’s history intact
This answer confuses regular merge with squash-merge; squash deliberately avoids the second parent.
Stages the cumulative diff of the feature branch on main without committing, so you can create one combined commit
Deletes the feature branch
This answer assumes squash-merge cleans up the source; the feature branch is untouched.
Rebases the feature onto main
This answer confuses squash (one combined commit) with rebase (replays each commit).

Squash stages a combined patch but does not commit — you supply the message. The result is one new commit on main containing all of the feature’s changes; the feature’s individual commits never appear on main.

2. After git merge --squash feature; git commit, what is true of the feature branch?

It no longer exists — squash deleted it
This answer assumes squash deletes the source branch; squash leaves it intact.
It still exists, unchanged; its commits are not on main (only a new combined commit is)
It points at the new squash commit on main
This answer assumes branches got rewired; squash only adds a commit to the current branch.
Its commits have been rewritten
This answer assumes squash modifies the feature’s history; only the current branch gets a new commit.

Squash merge does not touch the feature branch. It is still there with its full history. To delete it after squashing, use git branch -D feature (force, because it is not ff-merged by Git’s definition).

3. [Compare with Step 8] You have a 3-commit feature. You merge it three ways. Which output is correct?

plain merge → 1 new commit on main; rebase + merge → 3 new commits on main; --squash + commit → 3 new commits on main
This answer gets squash backwards (squash produces 1 commit, not 3) and confuses plain merge’s parent count.
plain merge → 1 merge commit (3 parents); rebase + merge → 3 new commits; --squash + commit → 1 new commit
This answer invents a 3-parent merge commit; standard merges have exactly 2 parents.
plain merge → 1 merge commit with 2 parents; rebase + merge → 3 replayed commits on main (linear, ff); --squash + commit → 1 new commit on main
All three approaches produce identical history
This answer ignores the actual differences between strategies — they produce visibly different histories.

Plain merge = 1 merge commit (2 parents). Rebase linearizes so merge ff-forwards 3 commits. Squash collapses the 3 into 1 new commit. Team preference decides which is right for the project.

4. Why might a team reject squash merge as a default policy?

It forces a merge commit which they dislike
This answer is backwards; squash avoids merge commits.
Losing individual commit granularity hurts git bisect precision within a feature and erases authorship signal on intermediate commits
Git rejects squash merges on main
This answer invents a Git restriction; squash works on any branch.
It requires a paid GitHub subscription
This answer confuses Git mechanics with hosting service pricing.

With squash, git bisect can only narrow to ‘this whole feature’, not to which intermediate commit caused the regression. Intermediate authors also disappear from main’s history. Some teams prefer rebase/merge for richer history.

5. You squash-merged feature-stats into main. The next day you discover one of the three internal commits had a bug. How do you fix only that part?

Git tracks internal commits automatically — run git revert --internal <sha>
This answer invents a flag; revert has no --internal, and Git doesn’t track squash-internal commits on main.
You cannot isolate it from main alone. The original commits still exist on the feature branch (or in reflog) — cherry-pick, revert, or rewrite from there
Use git bisect run to revert automatically
This answer combines two unrelated tools; bisect doesn’t auto-fix or auto-revert.
Delete main and re-merge with git merge --no-squash
This answer ignores that the squashed commit is already on main; you can’t unmerge a commit.

Squash hides internal granularity on main. But the original commits still exist where the feature branch was (or via reflog). You can cherry-pick a fix or write a small revert patch on main. This is the classic squash trade-off — convenience on main, less surgical control later.

6. [Revisit Step 7] A regression is reported on main three months after a feature was squash-merged in. git bisect on main narrows the culprit to the squash commit. What is your next move?

Declare the feature author guilty and revert the whole squash commit
This answer is too broad; the question asks for fine-grained resolution, not whole-feature reversion.
Switch to the (still-existing) feature branch and run git bisect there — its internal commits give you the fine-grained resolution squash destroyed on main
Bisect is unreliable after squash — read every line of the squash commit’s diff
This answer ignores the available finer-grained source on the feature branch.
Re-run bisect with --squash-aware
This answer invents a bisect flag; bisect operates on commits, and the feature branch already has the granularity.

Squash flattens main’s history, not the feature branch’s. The fine-grained commits are still preserved on the feature branch (assuming you didn’t delete it) and in reflog. Bisect on the feature branch pinpoints the exact internal commit. This is the strongest argument for keeping merged feature branches for a while, not deleting them immediately.

7. [Revisit Step 3] After git merge --squash feature; git commit, the new squash commit on main is a Git commit object like any other. What are its parents?

Two parents: the tip of main and the tip of feature (like a merge commit)
This answer treats squash as a merge; squash deliberately gives only one parent (the prior HEAD), not two.
Three parents: main, feature, and their common ancestor
This answer invents a three-parent commit type; standard merges are 2-parent and squash is 1-parent.
One parent: the tip of main (before the squash). The feature branch tip is NOT a parent — that is what makes it a squash rather than a merge
Zero parents — squash commits are parentless root commits
This answer treats squash as a root commit; it has the prior HEAD as parent.

A squash commit has exactly one parent: the prior HEAD of the branch you ran merge --squash on. The feature branch tip is not referenced as a parent — which is why git log main shows a clean linear history and why git bisect on main cannot drill into the feature. Same object-model: the commit records exactly the parents it was given, nothing more.

8. Put in order the commands to squash-merge a 3-commit feature-stats branch into main, then clean up. (arrange in order)

Correct order:

git switch main # target branch
git merge --squash feature-stats # stages combined diff; NO commit yet
git status # verify: staged changes, no commit
git commit -m "Add statistics module (mean, variance, stddev)"
git branch -D feature-stats # force-delete (not ff-merged in Git's view)

Distractors (not used):

git merge feature-stats # creates a merge commit — wrong strategy
git branch -d feature-stats # refuses: the branch's commits aren't on main
git cherry-pick feature-stats # only copies the tip commit, not combined
git push --force # unneeded and dangerous

--squash stages but does NOT commit — the extra git commit step is intentional so you write a fresh, whole-feature message. Use capital -D to delete: Git’s fast-forward definition says the feature branch is not merged (only a new combined commit landed on main), so lowercase -d refuses. Distractor 3 (cherry-pick) would only copy the tip commit’s patch, not the cumulative diff of the whole branch.

11

Revert: Safely Undo a Pushed Commit

🎯 You will learn to

Reach for revert — not reset --hard — whenever a bad commit is already on a shared branch.
Read the anti-matter pattern in the graph: the original stays; a new commit negates it.
Decide between revert (public safety) and rebase-drop (private cleanup) by asking one question: has this been pushed?

Scenario

You pushed Refactor: rename divide → div to main. Ten teammates already pulled. Then CI discovers every import of divide now breaks.

🤔 Predict first

You have two options on the table:

A. git reset --hard HEAD~1 + git push --force
B. git revert HEAD + git push

Which one breaks every teammate’s clone? Why? (Step 3’s schema is the key — what changes existing SHAs?)

The answer

reset --hard + push --force would fix your clone but break every teammate’s — their local main still points at the rewritten SHA. Not acceptable.

git revert <sha> is the additive, public-safe undo. It computes the inverse patch of the target commit and commits that as a new commit. No existing SHAs change; no force-push; no collaborator pain.

Task 1: See the bad commit

Setup planted a “pushed” refactor that broke callers.

cd /tutorial/myproject
git log --oneline -5
grep -c 'def divide\|def div' calculator.py

Task 2: Revert it

git revert HEAD --no-edit
git log --oneline -5

Two commits visible: the bad one and its revert. git log is now a truthful record of what happened.

Task 3: Prove the reachable commit count

Predict: did revert delete anything? (Answer: no — history grew by 1.)

git rev-list --count HEAD
git cat-file -p HEAD          # examine the revert commit object
git cat-file -p HEAD^         # the original bad commit, still reachable

The single rule

If anyone else has it, revert. If only you have it, rebase is fair game.

📋 Revert vs. reset vs. rebase-drop, side by side

Goal	Pushed?	Tool
Remove a bad commit from shared history	Yes	`git revert <sha>` (additive)
Clean up a local WIP branch before PR	No	`rebase -i` with `drop`
Nuke local branch to a prior state	No	`reset --hard <sha>`

💡 Reverting a *merge* commit (`-m 1`)

Merge commits have two parents; revert needs to know which side is the “mainline” (the side you want to keep). git revert -m 1 <merge-sha> keeps the first-parent side and undoes the merged-in branch. Get the number wrong and you revert the wrong side.

git revert — Knowledge Check

Min. score: 80%

1. Why is git revert safe on shared branches where git reset --hard + push --force is not?

Revert doesn’t actually change anything
This answer is too strong; revert does change history (adds a new commit), it just doesn’t rewrite existing SHAs.
Revert is additive — it appends a new anti-matter commit. Existing SHAs are unchanged, so teammates’ clones stay consistent. Reset + force-push rewrites history, forcing every collaborator to reconcile.
Revert is encrypted
This answer invents encryption; revert is plain commit semantics.
GitHub blocks force-push by default
This answer relies on remote-side enforcement; the safety property is intrinsic to revert vs. reset, not a platform policy.

The rule compresses to one property: does this operation change existing SHAs? Revert — no. Reset/rebase/amend — yes. Changed SHAs break anyone who already fetched the old ones. Revert is the only undo that preserves shared-history safety.

2. What does git revert <sha> physically add to history?

Nothing — it just hides the commit
This answer treats revert as visibility-only; revert adds a real commit that changes the working tree.
A new commit on the current branch whose patch reverses <sha>’s patch (every + becomes - and vice versa). The revert’s parent is the prior HEAD — not necessarily <sha>, since you can revert any historical commit.
A deletion marker
This answer invents a “tombstone” object type; Git has none — revert is a regular commit.
A tag named revert/<sha>
This answer confuses commits with tags; revert produces a commit on the branch, not a tag ref.

Revert computes the inverse diff of <sha> and lands it as a regular commit on the current branch. The new commit’s parent is whatever HEAD was when you ran the command — when you revert HEAD, that happens to be <sha>, but when you revert an older commit it isn’t. git log shows both the bad commit and its undo, which is the honest story of what happened.

3. You accidentally pushed a bad commit to main. Three teammates have pulled. Best move?

git reset --hard HEAD~1 + git push --force (or --force-with-lease)
This answer rewrites shared history; teammates with the old SHAs will see divergence and their pulls will fail.
git revert HEAD + git push
git rebase -i HEAD~1 with drop, then force-push
This answer also rewrites shared history (rebase-drop is a rewrite); same breakage as reset + force.
Delete the whole repository and re-clone
This answer is the “burning down the repo” antipattern — needlessly destructive.

Shared history was already distributed. Revert appends an undo; teammates’ next pull fast-forwards cleanly. Force-pushing after reset or rebase makes teammates’ branches diverge and their pulls fail — exactly what we avoid.

4. [Revisit Step 9] Rebase-drop and revert both “undo” a commit. Which is correct about their effect on SHAs?

Both create new SHAs for all later commits
This answer ignores that revert leaves all existing commits’ SHAs unchanged; only one new commit is added.
rebase-drop rewrites every subsequent commit’s SHA. revert appends one new commit and changes nothing existing.
Both are non-destructive
This answer ignores rebase-drop’s rewriting of every subsequent commit’s SHA.
Revert rewrites SHAs; rebase-drop doesn’t
This answer reverses the additive/destructive distinction — revert is the additive one.

The destructive/additive distinction is the heart of this step. Rebase-drop replays every commit after the dropped one on a new parent — new SHAs cascading. Revert just appends one new commit. Same apparent outcome (the bad change is gone); completely different impact on collaborators.

5. You want to revert a merge commit (one with two parents). What additional flag do you need?

--merge
This answer invents a flag; revert has no --merge option.
-m 1 or -m 2 to tell Git which parent is the mainline
--force
This answer invents a flag; revert has no --force option.
None — it just works
This answer ignores that merge commits have two parents and revert needs to know which side is mainline.

Merge commits have two parents; revert needs to know which side is “mainline” (the version you want to keep). -m 1 means “first parent is mainline; undo the second-parent branch.” Getting this wrong reverts the wrong side.

6. Put in order the safe public-undo workflow after discovering a bad commit on shared main. Distractors rewrite history. (arrange in order)

Correct order:

git log --oneline -5 # find the bad SHA
git revert --no-edit # create the anti-matter commit
git log --oneline -3 # verify: both bad + revert are there
git push # succeeds — no history rewrite

Distractors (not used):

git reset --hard ^ # rewrites main, breaks teammates
git push --force-with-lease # overwrites remote
git rebase -i ^ drop # rewrites downstream commits
rm -rf .git && git clone # "burning down the repo" antipattern

Revert-and-push is the only sequence that leaves every existing SHA untouched. Each distractor rewrites history in some way — which is exactly the failure mode revert exists to avoid. Run the safe one often enough that it becomes reflex.

12

Git Submodules: Add & Clone

🎯 You will learn to

Add a submodule to an existing repo with one command.
Clone a submodule-using repo correctly (--recursive) — or recover after forgetting.
Recognize the gitlink (mode 160000) + .gitmodules as the two structural differences from a regular file.
Pick submodules vs. package manager vs. monorepo based on the actual problem.

🤔 Predict first

When you git submodule add a 200-MB repo, how much storage does the outer repo’s tracked tree gain — a few hundred megabytes, or a few hundred bytes?

📖 Three core terms (open before reading further)

Term	What it is
Submodule	A nested Git repo inside an outer Git repo
`.gitmodules`	Plain-text config file in the outer repo listing each submodule’s path + URL
Gitlink	A tree entry with mode `160000` whose “content” is a 40-char commit SHA (instead of file bytes)

Two more terms (Pinned SHA, --recursive) are introduced inline as they come up; the full glossary is at the bottom of this step.

Mental model: library subscription

A submodule is a subscription to a specific edition of a library:

No photocopy — no file duplication.
You record the book title + edition number (.gitmodules URL + pinned SHA).
Anyone with your note fetches the same edition.
Upgrade by changing the edition number.

Edition number = commit SHA. Book = the submodule’s Git repo hosted elsewhere.

On-disk layout

@startuml
main-repo/
  .git/
    modules/
      math-utils/  ← submodule's actual git data (objects, refs, HEAD…)
  .gitmodules      ← where Git should fetch each submodule
  src/
  vendor/
    math-utils/    ← nested Git repo (the working tree)
      .git         ← gitfile: "gitdir: ../../.git/modules/math-utils"
      utils.py
@enduml

Task 1: Inspect the “upstream” library

Pre-built: /tutorial/math-utils-src/ (working repo, double+triple) and /tutorial/math-utils.git (bare clone acting as the remote URL).

cat /tutorial/math-utils-src/utils.py

Task 2: Add the submodule

cd /tutorial/myproject
git switch main
git submodule add /tutorial/math-utils.git vendor/math-utils
git status                            # TWO new entries

Open .gitmodules in the editor. Predict before scrolling the answers:

How many lines per submodule?
Is the pinned SHA stored here?
What breaks if the file is deleted?

Answers

3 lines (header + path + url). Tiny by design.
URL yes, SHA no. The SHA is the gitlink in the tree (see below). Two independent facts: where to fetch vs. which commit to check out.
Teammates can’t clone the submodule. .gitmodules is the subscription directory; without it, clone --recursive has no URL.

Inspect the gitlink:

git ls-files -s vendor/math-utils    # mode 160000 = submodule
git commit -m "Add math-utils submodule at v0.1.0"

Task 3: Clone with `--recursive`

cd /tutorial
git clone --recursive myproject colleague-clone
ls colleague-clone/vendor/math-utils

Without --recursive, the folder exists empty until the teammate runs git submodule update --init --recursive.

💡 When submodules are the *right* tool

Yes: versioned code you own shared across several repos.

No: third-party deps (use a package manager — npm, pip, cargo), or single config files (use config management).

📋 Submodule glossary (full)

Term	What it is
Submodule	A nested Git repo inside an outer Git repo
`.gitmodules`	Plain-text config file in the outer repo listing each submodule’s path + URL
Gitlink	A tree entry with mode `160000` whose “content” is a 40-char commit SHA (instead of file bytes)
Pinned SHA	The exact commit of the submodule the outer repo wants checked out at the gitlink path
`--recursive`	Clone flag that fetches submodules at clone-time (otherwise the folder is empty)

Git Submodules — Knowledge Check

Min. score: 80%

1. What does a Git submodule actually store in the outer repository?

A full copy of the submodule’s files tracked like normal files
This answer confuses submodule working tree content with outer-repo gitlink state; the outer repo only stores a SHA + URL, never the files.
A gitlink — a 40-character commit SHA pinning which commit of the submodule’s repo should be checked out at that path, plus a .gitmodules entry with the URL
Nothing — submodules are purely a runtime concept
This answer assumes submodules are runtime-only; the outer repo records the gitlink and .gitmodules persistently.
A symbolic link to the submodule’s directory
This answer confuses a gitlink (mode 160000) with a filesystem symlink — the gitlink is a tree entry, not a symlink.

The outer repo stores ONE SHA per submodule (the pinned commit) plus a .gitmodules entry for the URL. The submodule’s working files are checked out in the submodule path; its git data (objects, refs, HEAD) lives in the outer repo’s .git/modules/<name>/ — the submodule directory itself contains only a .git text file (a “gitfile”) pointing there, NOT a full .git/ directory.

2. A teammate clones your repo normally with git clone <url>. What do they see at the submodule path?

The submodule’s full contents, checked out automatically
This answer assumes plain git clone fetches submodules; it doesn’t, by default — only --recursive (or a follow-up submodule update) populates them.
An empty folder — they need git submodule update --init --recursive (or originally clone with --recursive) to populate it
An error preventing the clone from finishing
This answer assumes the clone fails; plain clone succeeds, the submodule folder is just empty.
A warning but the content downloads lazily on first access
This answer invents lazy fetching; submodule content fetches require explicit submodule update.

Plain git clone records the submodule entries but does not fetch their content. The folder exists but is empty. git clone --recursive <url> or git submodule update --init --recursive after the fact populates it.

3. Which statements about submodules are true? (Select all that apply) (select all that apply)

A submodule is a full Git repository nested inside another
git clone --recursive <url> clones both outer and submodules
The outer repo stores full copies of the submodule’s files
This answer confuses submodule working tree content with outer-repo gitlink state; the outer repo only stores the SHA pin, not file copies.
The outer repo tracks a submodule with a special gitlink entry (mode 160000) containing the pinned commit SHA

The outer repo stores only a pinned SHA (gitlink, mode 160000) and a .gitmodules entry — not file copies. The submodule is a genuine nested repo.

4. [Synthesis — revisits Steps 1, 3] Why is it internally consistent that a submodule is ‘just a pinned commit SHA’?

Because every Git object is ultimately addressed by a SHA (blobs, trees, commits), the outer repo only needs to point at the submodule’s commit SHA to uniquely identify its entire snapshot — no file duplication needed
Because submodules cannot have their own branches
This answer is irrelevant — submodules can have branches; the question is about content addressing.
Because submodules are always read-only
This answer invents a constraint; submodules can be edited and committed to like any nested repo.
Because Git stores submodules separately in a proprietary database
This answer invents a separate database; submodules use the same Git object format as everything else.

Back to the object model (Step 3). A commit SHA uniquely identifies a whole-project snapshot (commit → tree → blobs). Pinning a commit SHA is enough to reconstruct the submodule’s entire content. No file duplication is necessary — exactly the same property that makes branches cheap (Step 1).

5. [Revisit Step 8] A submodule’s pinned SHA is 40 characters, just like a regular commit SHA. In terms of Git objects, what kind of object does it point to?

A blob object
This answer confuses commit objects with blob objects; gitlinks pin commits, which then resolve to trees and blobs.
A tree object
This answer confuses commit objects with tree objects; only a commit captures parents, author, and message.
A commit object — in the submodule’s separate repository’s object database
A tag object
This answer confuses commit objects with tag objects; tags reference commits, they aren’t stored in gitlinks.

A gitlink pins a commit SHA — which (via the commit’s tree and blobs) uniquely determines the submodule’s entire file state. The commit lives in the submodule’s .git/objects/, not the outer repo’s. This is exactly the same commit-SHA-as-snapshot-identity property rebase relies on (Step 8) and that makes the whole object model coherent.

6. Put in order the commands a teammate runs to add a submodule, commit it, and set up a colleague’s workstation so the submodule files appear. Distractors are verb-variants that look right but fail. (arrange in order)

Correct order:

cd /tutorial/myproject
git submodule add /tutorial/math-utils.git vendor/math-utils
git commit -m "Add math-utils submodule at v0.1.0"
cd /tutorial && git clone --recursive myproject colleague-clone

Distractors (not used):

git submodule init /tutorial/math-utils.git vendor/math-utils # init alone does not fetch
git clone myproject colleague-clone # submodule folder empty
git submodule fetch /tutorial/math-utils.git # not a valid subcommand
git merge /tutorial/math-utils.git # confuses a nested repo with a branch

git submodule add combines clone + config in one step; init without update is half the story. Plain git clone creates an empty submodule folder. git submodule fetch is invented. git merge on a URL is a syntax error. Verb selection is what separates a working submodule workflow from a broken one.

13

Updating Submodules: Upstream Bumps & Resync

🎯 You will learn to

Upgrade a submodule to new upstream work via the two-step dance (fetch/checkout inside, add/commit outside).
Diagnose and fix the “teammate forgot submodule update” trap — muscle memory for post-pull.
Force-resync any drifted submodule back to the pinned SHA with one deterministic command.

🤔 Predict first

Upstream publishes new commits. After you git pull the outer repo, will your local submodule’s working directory show the new content automatically — or do you have to do something extra?

Task 1: Upstream publishes v0.2

/tutorial/publish-math-utils-v0.2.sh
git --git-dir=/tutorial/math-utils.git log --oneline --all
cd /tutorial/myproject
git status            # nothing changed here — push doesn't propagate

Task 2: Fetch + checkout inside the submodule

A submodule is a nested repo. Use normal git inside it:

cd /tutorial/myproject/vendor/math-utils
git fetch
git checkout origin/HEAD
cd /tutorial/myproject
git status            # vendor/math-utils (new commits)
git diff vendor/math-utils

The outer diff is exactly one line — -Subproject commit <old> / +Subproject commit <new>. Line-level diffs live in the submodule’s own object database.

Task 3: Bump the pinned SHA in the outer repo

git add vendor/math-utils
git commit -m "Bump math-utils to v0.2.0 (adds quadruple)"

Task 4: The teammate trap

cd /tutorial/colleague-clone
git pull
cat vendor/math-utils/utils.py     # still v0.1 on disk!

pull updated the pinned SHA in the tree, but did not touch their submodule working directory. Code that imports quadruple now fails. Fix:

git submodule update --init --recursive
cat vendor/math-utils/utils.py     # now has quadruple

💡 Make this a habit (one-time config)

After every pull that might touch submodule paths, run git submodule update --init --recursive. Or, one-time setup:

git config --global submodule.recurse true

Now pull and checkout do the right thing automatically.

Task 5: Force-resync a drifted submodule

Simulate drift:

cd /tutorial/colleague-clone/vendor/math-utils
git checkout HEAD~1
cd /tutorial/colleague-clone
git status            # modified: vendor/math-utils (new commits)
git submodule update --init --recursive
git status            # clean — pinned SHA restored

Same command works for never-initialized, partially-fetched, or drifted submodules.

Updating Submodules — Knowledge Check

Min. score: 80%

1. You bumped a submodule to v0.2 and pushed. A teammate pulls your change and reports tests failing because quadruple does not exist. Most likely cause?

The submodule’s remote is down
This answer blames infrastructure; the submodule’s pinned SHA is fine, the working tree just needs submodule update.
They forgot git submodule update --init --recursive after pulling — their submodule working directory still has v0.1 while main’s code expects v0.2
Their Git is too old
This answer blames version; standard pull semantics have always required explicit submodule sync.
The .gitmodules file is corrupt
This answer assumes file corruption; the symptom is normal post-pull behavior, not corruption.

Classic trap. git pull on the outer repo updates the pinned SHA in the tree but does NOT touch the submodule working directory. They need git submodule update --init --recursive to actually reflect the new SHA on disk. Configure git config submodule.recurse true to make pull do this automatically.

2. Upgrading a submodule requires how many git commit calls in total (inside + outside)?

Zero — git submodule update commits for you
This answer assumes submodule update commits for you; it only resets working state, never commits.
One — only in the outer repo
This answer omits the inner-commit case (when you’re authoring upstream changes yourself before bumping the pin).
Two — if the submodule itself needed new commits (e.g., you wrote a patch inside it), commit there first; then the outer repo commit bumps the SHA. If the new upstream commit already exists, only the outer commit is needed.
Three — submodule, .gitmodules, and outer tree
This answer adds a .gitmodules commit; that file changes only when adding/removing a submodule, not when updating one.

The answer depends on whether you are authoring the upgrade (write code inside submodule → commit inside → push → commit outside) or just pulling in upstream work (checkout new commit inside → commit outside). In either case the outer commit is mandatory — that is the SHA bump.

3. git status in the outer repo shows modified: vendor/math-utils (new commits). What does it mean?

The .gitmodules file has been edited
This answer confuses submodule SHA changes with .gitmodules config edits.
The submodule’s checked-out HEAD differs from the pinned SHA — either you updated the submodule, or it drifted
The submodule is broken
This answer assumes corruption; new commits is normal divergence between pinned and actual HEAD.
Git is about to garbage-collect the submodule
This answer invents a GC warning; git status doesn’t report on gc operations.

The outer repo compares the pinned SHA with the submodule’s actual HEAD. Mismatch → new commits. Fix: git add <path> + commit to pin the new SHA, or git submodule update to snap the submodule back to the pinned SHA.

4. [Revisit Step 1] Why doesn’t git pull automatically update submodule working directories — what Git principle is respected by this design?

Pull is inherently broken for submodules
This answer assumes a tool defect; the design is intentional separation between outer and inner repos.
Submodules are independent Git repos; respecting the outer/inner separation means outer operations don’t silently modify the inner repo’s HEAD. Same separation-of-concerns as detached HEAD (Step 1) — Git is conservative about moving HEAD without explicit instruction
Pull is always read-only
This answer is wrong on its face — pull writes to local refs and the working tree.
It would violate the SHA-1 collision resistance guarantee
This answer invents a cryptographic justification; the design is about repo-boundary respect, not hash collisions.

Git keeps the outer/inner repo boundary strict: an outer pull updates the pinned SHA (a fact about the outer tree) but does not reach into the inner repo and rewrite its HEAD. You must explicitly say git submodule update. Same conservative-HEAD-movement philosophy that makes detached-HEAD-with-uncommitted-changes impossible.

5. [Revisit Step 3] The outer repo’s diff for a submodule change is always just one line: -Subproject commit <old> / +Subproject commit <new>. Why is that enough?

Git is being lazy
This answer treats minimal diff as laziness; it’s the maximally accurate representation given that a SHA is equivalent to a full snapshot.
Because a commit SHA uniquely identifies a snapshot (commit → tree → blobs), the SHA bump is exactly equivalent to a full content change — no additional diff data is needed in the outer tree
Because Git compresses submodule diffs
This answer invents compression; the diff is genuinely a one-line SHA bump, not compressed-then-expanded.
Because submodules don’t actually change content
This answer ignores that the SHA pin’s change is equivalent to a content change.

Step 3’s object-model insight applied again. A commit SHA resolves to a deterministic snapshot. Pinning a new SHA is, by construction, equivalent to changing the entire content — no further diff data is needed in the outer commit. Minimum information, maximum fidelity.

6. [Evaluate] A teammate says: ‘After every git pull I always run git submodule update --init --recursive, even on repos without submodules. Paranoia, or sensible?’

Waste of time — it does nothing on repos without submodules
This answer assumes the command fails or churns on non-submodule repos; it’s a fast no-op.
Sensible. The command is a no-op on non-submodule repos (instant exit) and catches all submodule-state drift. Adding it to muscle memory costs nothing and prevents the most common submodule bug
Harmful — it can corrupt non-submodule repos
This answer assumes the command is dangerous; it’s strictly safe on any repo.
Only useful on teams larger than 10 people
This answer scopes the command to team size; it works regardless of how many collaborators are involved.

The command is safe on any repo. Running it unconditionally is a cheap habit that prevents the most common submodule bug (stale working dir). Equivalent hardening: git config --global submodule.recurse true to make pull/checkout do it automatically.

7. Upstream publishes a v0.2 commit. Put in order the commands that land it as a pinned version bump in your outer repo. Distractors are verb-variants that look right but fail or do the wrong thing. (arrange in order)

Correct order:

cd /tutorial/myproject/vendor/math-utils
git fetch # pulls new commits into submodule
git checkout origin/HEAD # move submodule HEAD to new SHA
cd /tutorial/myproject # back to outer repo
git add vendor/math-utils # stage the gitlink SHA change
git commit -m "Bump math-utils to v0.2.0"

Distractors (not used):

git pull # ambiguous in detached-HEAD submodule
git submodule update # resets TO pinned SHA — opposite of what we want
git commit -am "Bump" # would sweep in unrelated WD changes
git merge origin/HEAD # creates a merge commit inside submodule

git submodule update (distractor 2) is exactly the wrong verb here — it resets the submodule back to whatever the outer tree pins, erasing the new checkout. That’s the single most common submodule confusion, and getting the direction right is the heart of this step. git pull in detached HEAD is unreliable. -am would include unrelated changes. merge creates a commit structure we don’t want inside the submodule.

14

Submodule Internals: What 'Content Changed' Means

🎯 You will learn to

Read modified content vs. new commits straight from git status and pick the right fix.
Execute the six-step publish ceremony without falling into the detached-HEAD trap.
Resync any weird submodule state deterministically with one command.
Reason from first principles — outer repo tracks one SHA; inner repo is a full Git repo; they’re independent.

🤔 Predict first

You edit vendor/math-utils/utils.py directly without cd-ing into the submodule. What does the outer repo’s git status say about vendor/math-utils — modified content, new commits, both, or nothing?

The mental model

The outer repo stores exactly one thing per submodule (besides .gitmodules): the pinned commit SHA. On every git status, Git compares:

SHA the outer tree pins   vs    SHA at the submodule's current HEAD
    (gitlink, mode 160000)         (what's actually checked out)

Condition	Message
SHAs match	clean
Submodule committed new SHA	`new commits`
Submodule working tree dirty	`modified content`
Both	both messages

Nothing else can cause a “modified” submodule.

Task 1: Clean starting state

cd /tutorial/myproject
git submodule status

Prefix: ` ` clean, + HEAD ≠ pinned, - not initialized.

Task 2: Dirty the submodule working tree

Open vendor/math-utils/utils.py. Append:

def halve(x):
    return x / 2

Save. Back in outer:

cd /tutorial/myproject
git status                      # modified content
git diff vendor/math-utils      # no real line diff — just a summary
cd vendor/math-utils && git diff   # the real diff lives here

Task 3: Commit inside the submodule — then try to push

# inside vendor/math-utils
git add utils.py
git commit -m "Add halve helper"
git push                        # FAILS — predict the error

Likely: fatal: You are not currently on a branch (detached HEAD from submodule update) or no upstream branch. This is the top submodule footgun — Step 1’s detached-HEAD concept, encountered here.

Fix:

git switch -c update-halve 2>/dev/null || git switch update-halve
git log --oneline -2
# git push -u origin update-halve   # real push would succeed now

Back in outer:

cd /tutorial/myproject
git status                      # now: new commits (not modified content)

Task 4: Bump the pinned SHA

git add vendor/math-utils
git commit -m "Bump math-utils: add halve helper"
git log -1 -p vendor/math-utils   # shows ONE line: -Subproject commit ... / +Subproject commit ...

💡 The six commands are six invariants — derive them yourself

The ceremony looks arbitrary; each step preserves one invariant:

#	Command	Invariant preserved
1	`cd sub; git switch -c <branch>`	HEAD is branch-attached (not detached)
2	`git commit` inside sub	Your change is a commit object
3	`git push` inside sub	New SHA exists on the sub’s remote
4	`cd ../..; git add <path>`	Outer tree stages the new pinned SHA
5	`git commit` outer	Outer records a commit pinning the new SHA
6	`git push` outer	New pin is visible to teammates

Know the invariants and the commands derive themselves — no memorization needed.

Task 5: Force-resync (the universal fix)

git submodule update --init --recursive
# add --force if local submodule changes should be discarded

🧭 Fixes 95% of “my submodule is weird” moments

git submodule update --init --recursive

Safe on any repo. Set git config --global submodule.recurse true to make pull/checkout do it automatically.

Submodule Internals — Knowledge Check

Min. score: 80%

1. You see modified: vendor/math-utils (modified content) in the outer git status. What caused it?

The submodule’s HEAD differs from the pinned SHA
This answer confuses “modified content” (dirty inner working tree) with “new commits” (HEAD/pin divergence).
The submodule’s working directory has uncommitted changes (untracked or unstaged)
The .gitmodules file was edited
This answer confuses submodule status messages with .gitmodules file edits.
The submodule’s remote has new commits
This answer requires a fetch first; remote state isn’t reflected without one, and even then status would show “new commits”, not “modified content”.

modified content specifically means: the submodule working tree is dirty — files inside are unstaged or untracked. The HEAD may still match the pinned SHA. Running git status inside the submodule will show the dirty files.

2. You see modified: vendor/math-utils (new commits) in the outer git status. What caused it?

A file inside the submodule was edited but not committed
This answer describes “modified content” (dirty inner working tree), not “new commits” (HEAD/pin divergence).
The submodule’s HEAD has moved to a commit different from the one pinned by the outer repo’s tree
The outer repo’s branch pointer moved
This answer is irrelevant; this status compares pinned SHA vs. submodule HEAD, not the outer branch pointer.
The submodule is corrupted
This answer assumes corruption; the symptom is normal pin-divergence, not a broken repo.

new commits means: inside the submodule, HEAD advanced (someone committed, or checked out a different SHA). The outer repo still records the OLD pinned SHA, so it flags the divergence. Fix: git add <path> + commit to bump the pinned SHA, or git submodule update to reset the submodule back to the pinned SHA.

3. You run git diff vendor/math-utils in the outer repo after making and committing a change in the submodule. What do you see?

A full line-by-line diff of what changed inside the submodule
This answer assumes the outer repo can show inner content diffs; it can’t — the inner repo owns those diffs.
One line: -Subproject commit <old> / +Subproject commit <new> — the SHA bump is the only diff the outer repo stores
An error message
This answer assumes failure; outer diff works, it’s just minimal (one SHA-bump line).
The diff of .gitmodules
This answer confuses the gitlink-SHA diff with config-file edits.

The outer repo’s diff for a submodule path is always the gitlink SHA change — one line. To see content-level diffs, cd into the submodule and run plain git diff there. Two repos, two diff domains.

4. Which commands reset a submodule’s working directory and HEAD to exactly the SHA the outer repo pins?

git submodule update --init --recursive (and --force if local changes)
git pull in the outer repo
This answer updates the pinned SHA but doesn’t touch the submodule working tree.
git reset --hard in the outer repo
This answer resets the outer tree but doesn’t reach into submodule HEADs (Git keeps the boundary strict).
rm -rf vendor/math-utils && git clone
This answer is destructive when submodule update does the same job non-destructively.

git submodule update --init --recursive is the deterministic reset. It clones missing submodules and checks out each one at the outer tree’s pinned SHA. git reset --hard in the outer repo does NOT affect submodule working directories — Git treats them as separate repos.

5. [Revisit Step 3] Why is it consistent that the outer repo records ONLY a pinned SHA for each submodule — not the submodule’s files?

Because file content inside a submodule is always confidential
This answer assumes a privacy reason; the design is about deduplication, not confidentiality.
Because a commit SHA uniquely identifies a snapshot (commit → tree → blobs), so one SHA is enough to reconstruct the submodule’s entire content — duplicating files would waste space
Because submodules are always read-only
This answer invents a constraint; submodules are read-write like any nested repo.
Because Git uses a separate database for submodules
This answer invents a separate storage system; submodules use the same Git object format.

Same object-model insight as Step 3. A commit SHA points at a tree that points at blobs — one SHA resolves to a deterministic snapshot. Storing the SHA is equivalent to storing the files. No duplication is needed.

6. [Evaluate] You edited vendor/math-utils/utils.py and saved. Your teammate pulls your branch and sees a clean git status. Why didn’t your edit get to them?

Git hid the change because it was inside a submodule
This answer assumes Git silently drops submodule edits; it doesn’t, you just have to publish them via the two-step ceremony.
You modified the submodule’s working directory but never (a) committed inside the submodule and pushed, NOR (b) bumped the pinned SHA in the outer repo — so nothing in shared history carries the change
Your teammate’s Git is out of date
This answer blames version; the issue is missing publish steps, not tooling.
Submodules never sync changes between collaborators
This answer overstates the problem; submodule changes do sync once properly published.

An edit to a submodule file affects only your working tree until you perform the two-step commit: (1) commit inside the submodule and push its new commit to the submodule’s remote, (2) git add <path> + commit in the outer repo to bump the pinned SHA. Skip either step and the change never reaches teammates.

7. [Revisit Step 1] You edit a file inside a submodule, run git add && git commit inside the submodule, then git push. Git errors with something like fatal: You are not currently on a branch. What Step-1 concept explains this?

Detached HEAD — git submodule update normally checks out submodules in detached HEAD at the pinned SHA. Commits there are orphaned relative to any branch; push needs a branch
Merge conflicts
This answer invents a conflict; the issue is detached-HEAD’s lack of branch context for push.
Missing upstream
This answer is partial; even with an upstream, detached HEAD can’t push without a branch.
Submodules are read-only by default
This answer invents a permission model; submodules are writable.

After git submodule update, submodules are in detached HEAD at the pinned SHA (because that’s what the outer tree specified — no branch context). Any commit you make there is anchored to nothing. Fix: git switch -c <branch> inside the submodule before committing. Same detached-HEAD pattern as Step 1, encountered in a submodule setting.

8. [Revisit Step 8] You ran git rebase main inside a submodule and rewrote three of its commits. The outer repo’s git status says modified: vendor/math-utils (new commits). Is anything wrong with this?

No issue — rebase inside a submodule is business as usual
This answer ignores that the outer repo may still pin pre-rebase SHAs that need pushing for teammates’ historical clones to work.
Rebase inside a submodule is fine in isolation, but the submodule’s old SHAs (still pinned by the outer repo’s older commits) now point to orphaned commits — teammates pulling older outer-repo history may fail to fetch them if they were never pushed. Treat submodule rebases with the same ‘safe zone’ rule as any other rebase
Git prevents rebase inside submodules
This answer assumes Git blocks rebase in submodules; it doesn’t.
Rebase deletes the submodule
This answer invents destructive semantics; rebase doesn’t delete repos.

A submodule is a real Git repo — rebase works there exactly as in Step 8/9. The complication is that the outer repo may still pin the pre-rebase SHAs; if those weren’t pushed, teammates checking out old outer-repo commits will fail to fetch them (fatal: reference is not a tree). Same cardinal rule: rebase only unpushed/local history.

9. The full “publish a submodule change” ceremony. Put the six required commands in order. Distractors are verb-variants that break one or more of the ceremony’s causal invariants. (arrange in order)

Correct order:

cd vendor/math-utils # (1) enter the submodule
git switch -c update-halve # (2) attach HEAD to a branch (NOT detached)
git commit -am "Add halve helper" # (3) create the commit inside submodule
git push -u origin update-halve # (4) publish the submodule SHA to its remote
cd ../.. # (5) return to outer repo
git add vendor/math-utils && git commit -m "Bump math-utils: add halve" # (6) pin new SHA and push outer

Distractors (not used):

git commit -am "..." # BEFORE switching to a branch — detached HEAD, un-pushable
git submodule update --init --recursive # resets inner HEAD, losing your new commit
git push --force origin update-halve # unneeded; no history to overwrite
git rebase origin/main # rewrites SHAs you just created; defeats (4)

Each ceremony step preserves one invariant — branch-attached HEAD, commit-exists-in-submodule, SHA-on-remote, SHA-pinned-in-outer, outer-pushed. Each distractor breaks one. Committing first orphans the commit; submodule update resets it; --force is a shared-history violation; rebase rewrites the commits you just tried to publish. Knowing the invariants is the schema that makes the recipe stick.

15

Capstone: On-Call Debugging Under Pressure

🎯 You will demonstrate you can

Compose 5+ advanced Git tools into one realistic end-to-end workflow — without step-by-step instruction.
Pick squash/rebase/merge based on the history shape you want, not memorized rules.
Trust the reflog safety net after chaining several destructive operations.
Read state first, act second — the professional habit that defeats blind-testing.

🩺 30-second readiness check — answer before starting

Without scrolling, answer from memory. If any feels shaky, revisit the listed step before attempting the capstone. Component-skill research (Lovett 2001, Ambrose et al. 2010): 45 min on a weak skill beats hours on the integrated task.

Where do orphaned commits live, and how do you anchor one as a branch? Shaky? → revisit Step 2 (reflog).
What’s the physical difference between git rebase and git revert in terms of which existing SHAs change? Shaky? → revisit Step 11 (revert) — or really, Step 3.
Why does git stash not include feature.py if you never git add-ed it? Shaky? → revisit Step 4 (stash gotchas).
What’s the verb to finish a paused cherry-pick after resolving conflicts? A paused rebase? Shaky? → revisit Step 5 or Step 8.
After git bisect run, what’s the non-negotiable final command, and why? Shaky? → revisit Step 7 (bisect).

All five clear? Proceed. Two or more shaky? Spend 15 minutes on the weak step first. The capstone is an integration exercise — fragile components compound into frustration.

Scenario — no hand-holding

You’re on-call. Page: absolute(-4) == 4 fails on main. CI red. Teammate left a dirty tree with an unrelated note. Nobody knows which of ~6 recent commits broke things.

Your checklist:

Shelve the unrelated in-progress note (tree must be clean for bisect).
Find the bad commit via binary search.
Read its message and diff before touching code (author intent).
Fix on a dedicated branch. Messy WIP commits expected.
Clean up so main sees one focused commit.
Merge to main.
Restore the shelved note.
Verify reflog could still recover everything you rewrote.

Nothing new — every command came earlier. The point is choice and composition under pressure.

Style. Loop: read state → decide → act → re-read state. git status, git log --oneline --graph --all, git reflog are your dashboard. Lost? Re-read state, don’t guess.

The state you walk into

cd /tutorial/myproject
git status
git log --oneline --graph --all -12
python3 test_calculator.py

Hints — open only if stuck for a minute

Task 1 (shelve WIP)

Step 4. One command, noun form. Bisect needs a clean tree.

Task 2 (find the culprit)

Step 7, automated. Test exits 0 = good, non-zero = bad. Always end with reset.

Task 3 (read intent)

Step 6’s chain: git blame + git show <sha>.

Task 4 (messy fix branch)

Branch off main, iterate, make any number of WIP commits, get tests green.

Task 5 (squash into one)

Step 9 rebase -i + squash, or Step 10 merge --squash. Either is fine.

Task 6 (merge)

Whatever strategy leaves main with one clean fix commit on top.

Task 7 (restore note)

Step 4. Inverse of Task 1. Leave uncommitted.

Task 8 (reflog verify)

Step 2. Read-only check: git reflog still sees your pre-squash commits.

Success criteria

python3 test_calculator.py prints all tests pass.
main ends with exactly one new fix commit.
calculator.py still has your uncommitted # TODO: add clamp helper note.
git reflog retains your intermediate messy commits.

The “burning down the repo” callback

From Step 1’s antipattern: panic = delete the folder, re-clone, force-push. You did the opposite:

Situation	What you did	What novices do
Dirty tree	stash	delete folder
Unknown-culprit regression	bisect	read 30 diffs
Author intent	blame + show	guess
Messy intermediates	rebase / squash	rewrite from scratch
“Lost” commits	reflog	panicked `rm -rf`

Same competence gap you’ll see on every team for the rest of your career.

🏔️ Stretch (optional, not auto-tested)

Re-run with one extra wrinkle: the shelved note conflicts with the bug-fix line on stash pop. Resolve the conflict, pick keep-both or keep-fix, verify tests + reflog. This is the capstone’s capstone.

🗺️ The unifying schema — one picture

Every command from the basic tutorial and these 14 advanced steps falls into exactly one of three categories. Only category 3 is dangerous to push. Internalize this picture and you can predict the safety of any unfamiliar Git command at a glance.

@startuml
layout vertical
box "1. ALWAYS SAFE - reads state or moves refs without changing history\nNo new SHAs, no force-push needed\n- git blame, git log, git show, git diff, git status\n- git branch (create), git switch, git checkout (read mode)" as Safe
box "2. SAFE TO PUSH - appends new SHAs without changing existing ones\nAdditive only - teammates fast-forward cleanly\n- git commit\n- git cherry-pick\n- git revert (the anti-matter commit)\n- git merge (with or without merge commit)\n- git merge --squash + git commit\n- git stash (local by design, never pushed)" as Additive
box "3. DANGEROUS TO PUSH - rewrites or abandons existing SHAs\nLocal/unpushed branches only - needs --force on shared\n- git rebase\n- git rebase -i (squash, drop, fixup, edit, reword)\n- git commit --amend\n- git reset --hard / --mixed / --soft" as Rewriting
@enduml

The single decision rule: before pushing, ask “did I rewrite or abandon any existing SHAs?” If yes, the command lives in category 3 and your teammates’ clones will diverge. Reach for category 2 (revert, merge, cherry-pick) when undoing pushed work.

🌱 What to do this week (post-tutorial spaced retrieval)

Without spaced retrieval, ~50% of what you learned today is gone in a week. Twenty minutes total over the next month locks it in:

When	What
Tomorrow (10 min)	Recreate the capstone from a blank slate — same scenario, same tools, no scrolling back. If you stumble, re-do that step (not the whole capstone).
In 1 week (5 min)	Pick any 3 commands from this tutorial. From memory: state name, scenario, and the Step 3 schema (creates objects? moves pointers? both?).
In 1 month (5 min)	The next time you face a real “lost commit” or “messy branch” at work, reach for `git reflog` first and `rm -rf .git` never. That moment is the highest-value retrieval practice you’ll do.

The Cepeda meta-analysis (254 studies, 14,000+ participants) shows spaced practice produces ~2× better retention than equal-duration massed practice — and the gap widens with delay. This 20 minutes is your highest-ROI study time.

Solution

Commands

cd /tutorial/myproject; { [ -e .git/rebase-merge ] || [ -e .git/rebase-apply ]; } && git rebase --abort 2>/dev/null; [ -e .git/BISECT_START ] && git bisect reset 2>/dev/null; [ -e .git/MERGE_HEAD ] && git merge --abort 2>/dev/null; git switch -q main 2>/dev/null; git reset --hard HEAD; git stash clear -q 2>/dev/null; git branch -D capstone-fix 2>/dev/null; sed -i 's|return x  # simplification|return x if x >= 0 else -x|' calculator.py; git diff --quiet || (git add calculator.py && git commit -m 'Capstone fix: restore negation in absolute'); echo '# TODO: add clamp helper' >> calculator.py

Tool choices (many right answers). The solution shown uses stash → automated bisect → branch + two WIP commits → interactive-rebase fixup → regular merge → stash pop → reflog check. An equally valid path: stash → manual bisect → fix with one commit directly → squash-merge to main → stash pop → reflog check. The tests only verify the end state, not the path.
Why stash first, always. Bisect moves HEAD across historical commits; a dirty working tree would either block bisect or carry uncommitted edits across arbitrary commits. Same principle as Step 4’s “clean tree for context switch.”
Why bisect. Manually reading 5 diffs would work here but would not work at 500. The point is the habit: for regressions, bisect is the default reach, even for small histories.
Why read the culprit’s intent. Step 6’s warning: the author wasn’t malicious. Their commit message and diff may reveal which part of the change was intended and which was the accidental regression — informing whether you fix the bug or revert the whole commit.
Why clean the fix branch before merging. Main’s history is read during future bisects (this one’s regression will be someone else’s bisect in six months). Each commit on main should be one reason, not “WIP, WIP, WIP, real fix.”
Why reflog at the end. Proof that the desirable-difficulty exercise did not actually destroy anything. This is the Step 2 safety-net claim, cashed in on a composite workflow.

Cumulative Final Quiz — Choose the Right Tool

Min. score: 80%

1. Match each scenario to the single best tool. Which option correctly pairs all four?

(a) Backport one bug-fix commit to three release branches → git merge; (b) Integrate 50 commits of a long feature into main → git cherry-pick each one; (c) Clean up 10 WIP commits before opening a PR → git rebase -i; (d) Land a feature branch as one commit on main → git merge --no-ff
This answer rejects cherry-pick’s canonical use case (one-commit backports) and uses merge for WIP cleanup, which doesn’t collapse history.
(a) Backport one bug-fix commit to three release branches → git cherry-pick; (b) Integrate 50 commits of a long feature into main → git merge (or git rebase for linear history); (c) Clean up 10 WIP commits before opening a PR → git rebase -i; (d) Land a feature branch as one commit on main → git merge --squash
(a) All four → git merge
This answer collapses all distinctions into one tool; the four scenarios genuinely call for different verbs.
(a) Backport → git rebase; (b) Feature integration → git cherry-pick; (c) WIP cleanup → git merge; (d) One-commit landing → git rebase
This answer reverses the typical scope-to-tool mapping (rebase for surgery, cherry-pick for bulk).

Cherry-pick is surgical (one commit), merge is bulk (many commits), interactive rebase is for history cleanup, squash-merge collapses a branch into one commit. Steps 5, 8, 9, 10 each framed this table; this question just asks you to recognize when to use which. The others mis-apply cherry-pick (wrong for 50 commits) or merge (doesn’t clean WIP).

2. [Interleaves Steps 3, 5, 8, 9, 10] Which of these operations create commits with new SHAs even when the patch is identical to an earlier commit? Select all that apply. (select all that apply)

git cherry-pick <sha> onto a different branch
git rebase main (non-interactive) on a feature branch
git rebase -i HEAD~5 that reorders commits
git merge --squash feature followed by git commit
git merge feature (fast-forward) — just moves the branch pointer
This answer treats fast-forward merge as creating commits; it only advances a ref pointer — no new SHAs.
git branch feature — creates a branch pointer at the current commit
This answer treats git branch as creating commits; it only writes a 41-byte ref file — no new SHAs.

A commit’s SHA hashes its tree + parent(s) + author + committer + message. Change any of those and the SHA changes. Cherry-pick, rebase, interactive-rebase, and squash-merge all create new commit objects with different parents or combined trees. Fast-forward merge and git branch do NOT create commits — they only move pointers (Step 1’s whole point). This is the deep schema: commits are immutable; “moving” a commit is always “copy + move pointer to copy.”

3. [Interleaves Steps 2, 9, 1] You performed three destructive-feeling operations in sequence: git reset --hard HEAD~3, then git rebase -i dropping a commit, then entering detached HEAD and making a throwaway commit. Which single tool can recover commits lost in all three cases?

git revert — it reverses any recent operation
This answer ignores that revert applies to one commit’s effects, not arbitrary HEAD movements; reflog is the cross-cutting tool.
git reflog + git branch <name> <sha> — reflog logs every HEAD movement regardless of which operation moved it, so the recovery recipe is identical across all three cases
git stash pop — stash tracks destructive operations automatically
This answer confuses stash (working-tree shelving) with reflog (HEAD-position history).
Nothing — destructive operations are final
This answer ignores reflog’s universal HEAD-position record; “destructive” only means refs moved, not data deleted.

Reflog is the universal safety net because it records HEAD’s position history, not the cause. Whether HEAD moved via reset, rebase drop, or leaving detached HEAD, the SHA it was at is recorded. Branch that SHA back into reachability and the “lost” work is found. This is the Step 2 lesson cashed in on a composite workflow — and the reason the tutorial framed destructive commands as “less scary than they sound.”

4. [Interleaves basic tutorial Step 11 + advanced Step 8] You hit a conflict during git rebase main. You edit the file, remove all <<<<<<< / ======= / >>>>>>> markers, and run git add. Which command finishes this one commit’s resolution?

git commit — identical to finishing a merge
This answer treats rebase like merge; rebase needs --continue, not commit — commit would leave the rebase half-done.
git rebase --continue — rebase is replaying commits one at a time; you are telling Git to resume the replay, not create a standalone commit
git merge --continue
This answer uses the wrong verb; this is a rebase, not a merge.
git push --force
This answer skips finalizing the rebase entirely.

A rebase conflict is identical in mechanics to a merge conflict — same markers, same git add to mark resolved — but the final verb differs because rebase is replaying, not merging. Reflex-typing git commit here is the single most common mistake; it leaves the rebase half-done. git rebase --abort at any point restores the pre-rebase state.

5. [Interleaves Steps 6, 7] A bug appeared because someone removed a line of validation that used to prevent it. Which investigation tool finds the commit that introduced the bug?

git blame — it shows the author of every existing line
This answer ignores blame’s deletion blind spot; absent lines are invisible to blame.
git bisect — it binary-searches based on behavior, which is the only signal available when the culprit is a deletion (blame cannot see a line that isn’t there)
git log --grep="delete" — it finds commits with “delete” in the message
This answer assumes the commit message reliably announces deletions; messages don’t index by content.
git show HEAD — it shows the most recent commit
This answer shows only the most recent commit, not the historical regression spanning many commits.

Blame attributes existing lines only — a missing line is invisible to it. Bisect operates on behavioral outcomes (did the test pass or fail?) regardless of whether the change was an addition, modification, or deletion. This is why the capstone you just finished started with bisect, not blame — the bug could just as easily have been a deletion, and starting with bisect generalizes.

6. [Interleaves Steps 3, 11, 13] A submodule is stored in the outer repo as a gitlink entry (mode 160000) containing a 40-character SHA. That SHA references which kind of Git object?

A blob object containing the submodule’s files
This answer confuses commit objects with blob objects; gitlinks pin commits, which then resolve to trees and blobs.
A tree object for the submodule’s root directory
This answer confuses commit objects with tree objects; only a commit captures the snapshot identity.
A commit object in the submodule’s own object database
A tag object
This answer confuses commit objects with tag objects; tags reference commits but aren’t stored in gitlinks.

The gitlink pins a commit SHA (in the submodule’s repo). That commit deterministically resolves to a tree, which resolves to blobs — so one SHA is equivalent to a full content snapshot. Same object-model reasoning as Step 3 — snapshot-identity is carried by the commit SHA, which is why “one 40-char pin” is enough information to reconstruct the entire submodule’s state.

7. [Interleaves Steps 8, 9, 10] Which of these operations are forbidden on a branch that has been pushed and is shared with teammates? Select all that apply. (select all that apply)

git rebase main on the shared branch (rewrites its SHAs)
git rebase -i HEAD~5 on the shared branch (rewrites its SHAs)
git commit --amend on the tip of the shared branch (new SHA)
git push --force (or --force-with-lease) to overwrite the remote
git revert <sha> (appends an anti-commit, rewrites nothing)
This answer correctly classifies revert as additive; selecting it would overstate revert’s risk on shared branches.
git merge feature into the shared branch (appends a merge commit)
This answer correctly classifies merge as additive; selecting it would conflate adding history with rewriting it.

The cardinal rule — anything that rewrites published commits is forbidden on shared branches, because teammates holding the old SHAs will diverge. That rules out rebase (any flavor), amend, and force-push. Revert and merge are additive (they only append new commits without changing existing history), so they are safe. Same rule, different commands. Memorize the property (rewrite = dangerous), not the per-command list.

8. [Interleaves Steps 2, 8] You rebased feature locally, then git push was rejected because teammate Alice had pushed to feature in the meantime. What is the safe recovery sequence? (arrange in order)

Correct order:

git reflog # find the pre-rebase SHA of `feature`
git reset --hard # undo your rebase locally
git pull # merge Alice's changes into your (unrebased) branch
git push # succeeds — no history was rewritten

Distractors (not used):

git push --force
git rebase --abort
git revert HEAD
rm -rf .git && git clone

When rebasing a shared branch goes wrong, the fix is always — undo your rewrite first, then integrate normally. Reflog finds the pre-rebase SHA; reset --hard restores it; pull merges Alice’s work; push succeeds. The distractors represent the antipatterns Step 1 named — push --force overwrites Alice’s work; rebase --abort does not apply after the rebase is complete; revert is for undoing a single commit, not a rebase; rm -rf .git && clone is the “burning down the repo” antipattern.

9. [Interleaves Steps 4, 7] You are mid-edit on a feature when a teammate asks you to bisect a regression on main. Your working tree has uncommitted changes you want to keep. Two tools from this tutorial compose to solve this cleanly — which pair?

git commit -m "WIP" then git bisect — commit your WIP so bisect has a clean tree
This answer pollutes history if pushed; the question prefers a private alternative for in-progress work.
git stash then git bisect, then git stash pop when bisect is done and you are back on a named branch — keeps your WIP private and the tree clean for bisect’s HEAD movements
git restore . then git bisect — discards your WIP so bisect has a clean tree
This answer destroys the WIP that the question wants to preserve.
git bisect then git stash — stash shelves the bisect state
This answer reverses the order; stash must come before bisect to give it a clean tree to walk.

Bisect moves HEAD across arbitrary historical commits — a dirty working tree either blocks it or carries your edits into commits they don’t belong in. Stash is designed exactly for this — private, local, temporary. Committing WIP pollutes history if pushed; git restore . destroys your work. Recognize the compose-two-tools pattern — most real Git tasks chain more than one command.

10. [Interleaves Steps 7, 10] Three months ago your team squash-merged the feature-stats branch into main. A regression has surfaced that bisect on main narrows down to the squash commit. The squash commit changed 800 lines. What is your next move?

Revert the whole squash commit from main and re-investigate later
This answer is too broad; the question asks for fine-grained resolution, not whole-feature reversion.
Read every line of the 800-line squash diff until you find the bug
This answer ignores the available finer-grained source on the feature branch’s preserved commits.
git bisect on the (still-existing) feature-stats branch — its internal commits were preserved, giving fine-grained resolution that the squash on main destroyed
Give up — squash merge permanently discards internal commits
This answer assumes squash discards internal commits; it only flattens main’s history, the feature branch keeps them.

Squash-merge collapses main’s history, not the feature branch’s. The feature branch’s commits still exist (and its reflog too) if it wasn’t deleted. Bisect there pinpoints the exact internal commit. This is the strongest pragmatic argument for keeping merged feature branches around for a while, not deleting them the day they merge. Step 10’s quiz framed this; this question checks that you can reach for the recovery without being reminded.

11. [The unifying insight] After working through the whole advanced tutorial, which statement best captures what every command you learned actually does?

Git commands either delete old state or create new state; they never do both
This answer is the old “either creates or moves, never both” simplification; most operations do both, the real invariant is that existing commits aren’t rewritten.
Existing commit objects are immutable. Every command either creates new immutable objects (commits/trees/blobs), moves one or more refs, updates the index, updates the working tree, or transfers objects/refs to a remote — usually some combination of these. Nothing rewrites old commits in place; “destructive” commands only abandon refs, leaving the objects in .git/objects for reflog to find.
Git commands are magical; the best strategy is to memorize the most common ones
This answer abandons the underlying invariant for rote learning; the schema is what makes Git predictable.
Commands behave unpredictably depending on remote configuration
This answer mistakes per-command behavior for a remote-config sensitivity that doesn’t exist.

This is the load-bearing invariant from Step 3 cashed in across all 15 steps. Branch creation moves a ref. Commit creates an object + moves a ref + clears the index. Rebase creates a series of new objects + moves a ref. Cherry-pick creates one new object + moves a ref. Squash-merge creates one new object + moves a ref. Even the “destructive” commands (reset, rebase drop) only move refs — the old objects remain in .git/objects and reflog keeps their addresses. If you internalize immutability of existing commits, nothing in Git is mysterious.

12. [Evaluate — meta] A junior teammate says: “Destructive Git commands like rebase and reset are too dangerous to use; I’ll stick with merge and revert only.” Evaluate this position.

Correct — rebase and reset should be avoided by anyone who isn’t a Git expert
This answer absolutizes a context-dependent rule; rebase/reset are routine on local branches.
Partially correct — on shared branches, they really are dangerous, and revert/merge are the right tools there. On local/unpushed branches, rebase and reset are routine safe tools because reflog recovers anything within the configured retention window. Avoiding them entirely trades safety for history-quality and is a real cost on teams that value clean history
Incorrect — there is no difference between local and shared branches
This answer denies the very local/shared distinction that resolves the safety question.
Incorrect — reflog auto-recovers destructive operations immediately in all cases
This answer overstates reflog’s reach; it works within retention windows on local repos, not as a universal undo.

Conditional knowledge is the mark of an expert (Ambrose et al. 2010). The cardinal rule is not “rebase = bad” — it is “rebase rewrites history, which is dangerous on shared branches and safe on local ones.” The junior’s heuristic is safer and less effective; teaching them when each tool is appropriate is the goal. The same tool, on the same commits, is either routine or catastrophic depending on one thing — has this history been pushed and pulled by others? This is the single most important distinction the advanced tutorial taught.

Advanced Git: Debugging, History Rewriting, and Submodules

Branches, HEAD, and Detached HEAD

🎯 You will learn to

Why this tutorial exists

Prerequisite self-check

Task 1: Prove a branch is a 41-byte pointer

Task 2: Detach HEAD and feel the difference

Cleanup

✍️ Before moving on (30-second self-test)

Solution

Branch Internals & Detached HEAD — Knowledge Check

Rescuing Lost Work with git reflog

🎯 You will learn to

🤔 Predict first

log --all vs reflog — the load-bearing distinction

Task 1: Deliberately lose work

Task 2: Find the orphan

Task 3: Anchor it with a branch

Solution

git reflog — Knowledge Check

Relative Commit Addresses & Git's Object Database

🎯 You will learn to

🚪 This is the threshold step

Relative references

Task 1: Practice

Task 2: Prove content-addressability

Task 3: Byte-exact means byte-exact

✍️ Before moving on (the unifying invariant)

Solution

Relative Addresses & Object Database — Knowledge Check

Saving Work Temporarily with git stash

🎯 You will learn to

Scenario

🤔 Predict first

Task 1: See the dirty tree

Task 2: Stash it

Task 3: Do the hotfix on a dedicated branch

Task 4: Restore your WIP

Task 5: Finish the feature

Solution

git stash — Knowledge Check

Cherry-Pick: Copy One Specific Commit

🎯 You will learn to

Scenario

🤔 Predict first

Task 1: Inspect

Task 2: Cherry-pick the tip

Task 3: Produce and resolve a conflict

Solution

Cherry-Pick — Knowledge Check

git blame: Who Last Changed This Line (and Why)?

🎯 You will learn to

The two-command forensic workflow

Task 1: Why does this line exist?

Task 2: The reformatter-masked authorship case

Task 3: Default blame vs. HEAD -- blame

Solution

git blame — Knowledge Check

git bisect: Binary Search for the Commit That Broke Things

🎯 You will learn to

🤔 Predict first

Why bisect beats every alternative

Task 1: See the regression

Task 2: Manual bisect (feel the motion)

Task 3: Automated bisect (the real-world default)

Task 4: Fix the bug

🌙 Halftime: take a break before Step 8

Solution

git bisect — Knowledge Check

Rebase: Integrate Changes Without a Merge Commit

🎯 You will learn to

Mental model: the video-editor timeline cut

Task 1: Inspect the divergence

Task 2: Rebase and fast-forward

Task 3: Rebase through a conflict (desirable difficulty)

When to rebase vs merge

Solution

Rebase — Knowledge Check

Interactive Rebase: Edit, Squash, Reorder, Drop

🎯 You will learn to

`log --all` vs `reflog` — the load-bearing distinction

Task 3: Default blame vs. `HEAD --` blame

Task 3: Clone with `--recursive`