September 29th, 2022

I did that merge-as-cherry-pick thing, but my change still didn’t merge correctly

A colleague used the patch-branch technique of merging a single commit to another branch, but found that the subsequent merge of the complete branch produced the wrong results. They asked me to investigate what went wrong.

After untangling the changes to the main and feature branches, I was able to reconstruct a simplified version of what happened.

We start with this:

 
    apple   apple    
    M1 M2   main
apple ↙︎          
A            
  ↖︎          
    F1 F2   feature
    apple   berry    

They created a feature branch and did some work: Commit F1 didn’t affect the file, but commit F2 changed “apple” to “berry”.

They then realized that this “berry” change was something they wanted in the main branch early, so they could do some preliminary integration work before the rest of the feature work was done. They created a second berries-only branch that contained just the “berry” change and merged it into both the main branch (which delivers the payload to the main branch) and into the feature branch (which has no net code effect, but records that the change as have already been incorporated so it doesn’t get counted as payload when the feature branch merges back up to the main branch).

    apple   apple   berry
    M1 M2 M3   main
apple ↙︎     berry ↙︎
A ← ← ← P       patch
  ↖︎       ↖︎
    F1   F2   F3   feature
    apple   berry   berry

The integration validation didn’t turn out so great, so they reverted the change in both the main and feature branches.

    apple   apple   berry   apple
    M1 M2 M3 M4   main
apple ↙︎     berry ↙︎    
A ← ← ← P           patch
  ↖︎       ↖︎    
    F1 F2 F3 F4   feature
    apple   berry   berry   apple

The team continued working on the feature, and this time they felt that they got the “berry” thing right, so they made a commit in their feature branch to change “apple” back to “berry”, this time with feeling.

    apple   apple   berry   apple
    M1 M2 M3 M4       main
apple ↙︎     berry ↙︎        
A ← ← ← P               patch
  ↖︎       ↖︎        
    F1 F2 F3 F4 F5   feature
    apple   berry   berry   apple   berry

All the tests were passing, so they got the green light to merge the feature into the main branch. And that’s where something strange happened.

    apple   apple   berry   apple       apple
    M1 M2 M3 M4 ← ← ← M5   main
apple ↙︎     berry ↙︎          

A ← ← ← P                 patch
  ↖︎       ↖︎          
    F1 F2 F3 F4 F5       feature
    apple   berry   berry   apple   berry    

The result of the merge into the main branch didn’t carry the final “berry” change. The file remained “apple” in the main branch.

As a result, the main branch was broken.

What happened?

The merge of the “only berries” branch into both the main and feature branches established “berry” as the baseline for the next merge. In the main branch, “berry” changed to “apple”. In the feature branch, “berry” underwent some turmoil but emerged unchanged. The three-way merge algorithm therefore saw that the main branch changed “berry” to “apple”, and the feature branch made no (net) change. Therefore, the result of the merge is “apple”. (apple + 0 = apple).

The fatal error was the dual revert.

These reverts were independent and therefore git did not consider them to be related to each other. But really, you wanted them to be considered the same revert, so that the feature branch could un-revert it.

Basically, once you merged the “only berries” branch, the diagram (from the apple/berry’s point of view) simplified to this:

    berry   apple       apple
    M3 M4 ← ← ← M5   main
berry ↙︎          

P                  
  ↖︎              
    F3 F4 F5       feature
    berry   apple   berry    

It’s as if the feature branch was created from the “only-berries” branch, and the main branch continued by reverting “berries” back to “apple”, whereas the feature branch underwent some soul-searching and ultimately made no change to “berries”. Naturally, the result of this merge is that “berries” is reverted to “apple”.

What my colleague should have done was to perform the revert in a separate branch, possibly just extending the “only-berries” branch so it is now a “changed-my-mind-about-berries” branch. Merge that branch into both the main and feature branches, thereby advancing the baseline forward to the shared revert.

    apple   apple   berry   apple       berry
    M1 M2 M3 M4 ← ← ← M5   main
apple ↙︎     berry ↙︎ apple ↙︎      

A ← ← ← P P2             patch
  ↖︎       ↖︎   ↖︎      
    F1 F2 F3 F4 F5       feature
    apple   berry   berry   apple   berry    

By making the revert on the patch branch and merging it into the main and feature branches (instead of reverting separately on the main and feature branches), git now understands that the revert is part of the shared history of the two branches. This time, when the final merge occurs, git sees that the main branch made no changes, and the feature branch did an unrevert, so the result of the merge is the unrevert, and “berry” makes it into the main branch.

Topics
Other

Author

Raymond has been involved in the evolution of Windows for more than 30 years. In 2003, he began a Web site known as The Old New Thing which has grown in popularity far beyond his wildest imagination, a development which still gives him the heebie-jeebies. The Web site spawned a book, coincidentally also titled The Old New Thing (Addison Wesley 2007). He occasionally appears on the Windows Dev Docs Twitter account to tell stories which convey no useful information.

1 comment

Discussion is closed. Login to edit/delete existing comments.

Newest
Newest
Popular
Oldest
  • John McPhersonMicrosoft employee

    Or “reverts are commits too”.

Feedback