DaggyFixes

All software has bugs, and not all changes that you commit to a source tree are entirely good. Therefore, some commits can be considered "development" (new features), and others can be considered "bugfixes" (redoing or sometimes undoing previous changes). It can often be advantageous to separate the two: it is common practice to try and avoid mixing new code and bugfixes together in the same commit, often as a matter of project policy. This is because the fix can be important on its own, such as for applying critical bugfixes to stable releases without carrying along other unrelated changes.

Monotone, and other similar version control tools that utilise a DAG-structured revision history, offer developers an additional option to handle fixes in a way not possible (or at least, not convenient) in other tools with more linear history structures.

This page describes a BestPractices workflow pattern that uses the DAG structure, and the capability to easily diverge and re-merge from arbitrary past revisions, for advantage in handling fixes. It's an application of the same principles used by the ZipperMerge pattern, but for the scenario where you want to make the same change to two or more branches that must remain separate otherwise, rather than merge two branches together.

Scenario

You have a project with a number of revisions and an existing graph that looks like this:

   A
   |
   B
  / \
 C   D
  \ /
   E

It is later discovered that revision B introduced a bug, and there's a series of children B->..->E that have inherited the bug from B. (There could, of course, be many many intermediate revisions between B and E).

In monotone, we have two useful choices for how (or, more specifically, where) to commit the fix:

Both produce a new head with the bug fixed, but they leave behind a different ancestry graph, and that difference can be important and useful.

Fix F1: the old way

The bug is fixed close to the point where development happened to be up to when it was found:

   A
   |
   B
  / \
 C   D
  \ /
   E
   |
   F1

This is the common established practice in traditional sequential VCSs, in part because in those systems, branching and merging aren't lightweight enough to be worthwhile using for a small fix. You might annotate the fix with some mention of B in the changelog as the bad revision being backed out or fixed, but that's usually as far as it goes.

There's nothing really wrong with this (after all, it has worked for most software developers for a long time), but it represents a lost opportunity to use the power of monotone to full advantage.

Fix F2: the daggy way

The bug is fixed close to the point where it was introduced, and then merged back:

   A
   |
   B --+
  / \  |
 C   D F2
  \ /  |
   E   |
    \ /
     M

There are good reasons to prefer this practice. This different graph structure gives you a strong and direct representation of the logical (rather than chronological) relationship between bug, fix, and intermediate development:

This reverses the common practice of "backporting" fixes to older code (more on this below). Instead, the fix is developed against that older code, and then ported forward as part of the merge to the head. This may also make the root cause of the error more apparent and thus easier to learn from, to avoid similar mistakes in future.

There are some variations on this theme that may be useful to consider:

Daggy release management

Many projects create release branches to track critical fixes to stable code while more active development goes on elsewhere. The important thing about a release branch, in this context, is that you want to separate fixes and development:

Lets suppose, further to our example, that you had started a new release branch from D, midway along the span containing the bug:

   A
   |
   B -----+
  / \     |
 C   D    F2
  \ / \   :
   E   R1
   :     \
          R2

You can immediately see that your release branch D->R1->R2 has inherited the bug, and is therefore also going to need the fix.

Even better, because F2 and R2 have a common ancestor B (which introduced the bug), and F2 contains only the fix for the bug, approving F2 to that branch (and merging, as before) does exactly the right thing, producing a fixed revision RF on the branch:

   A
   |
   B ------+
  / \      |
 C   D     F2
  \ / \    : \
   E   R1     |
   :     \    |
          R2  |
            \ |
             RF

Remember that monotone branch memberships are determined by certs, and branches need not be fully connected. After the approve of F2 but before the merge that produced RF, the release branch contained disconnected revisions: D->R1->R2 and F2. These are still multiple heads of the branch, and a merge can succeed because of the common ancestor B even though B is not a member of the branch.

This represents the true place of the fix in the graph and in both branches, without introducing any of the other changes added in C, E, or any other point past D where the branch diverged. If we had committed the fix the old way as F1 instead, and tried to use approve like this, merging the release branch would pull in all the changes from B to E as well, just like a full propagate would.

Although not drawn here, you can just as readily see that any other branches which diverged above B don't need a fix, because the bug was introduced later. If some of those were development branches that had propagated from mainline to them, and some of those propagates included B, then those branches would also need the fix. This knowledge is the benefit of identifying B as the source of the bug, which you might have needed to do anyway regardless of where the fix was applied. Making sure to apply the daggy fix at F2 means that you automatically get the same benefit for knowledge of what has also inherited the fix.

In general, any revision with B as an ancestor and without F2 as an ancestor, has inherited the bug but not the fix.

Remember that in distributed development with monotone, we might not yet have learned about revisions descended from B; in fact we might never learn of someone else's private branch derived from somewhere below B. We still want to publish the fix close to B so that they can then approve F2 for their private branch and inherit the fix as well as the bug.

If you were to add a new test and testresult cert along with the fix in F2, it becomes even clearer where your fix has gone - especially if your tool can colour the graph according to this ancestry or the testresult.

Plucking and CherryPicking

In many projects, the need to apply fixes to older revisions (especially releases) is addressed by applying patches containing the fix, sometimes adjusting or backporting for the older code. These fixes are taken from individual changes mixed with development along the mainline, and need to be separated out. This is often called CherryPicking, and is a frequent feature request for monotone. It's clear that many such requests are motivated by these kinds of concerns, especially from developers accustomed to these practices and familiar with support for similar features in other tools.

Monotone now has a new command, pluck, that can be used to pull the set of changes between two (arbitrary) revisions "over there" into the current workspace, taking into consideration other changes (like file renames) that have happened between "there" and "here". This allows isolated changes to jump across parts of the revision graph, without making full ancestry connections in the same way that propagate and similar commands would.

Doing Daggy fixes, as outlined above, will minimise the need to cherry-pick fixes instead. Daggy fixes mean using rather than losing the true origin and relationship between bugs and fixes in the ancestry graph. However, it's worth looking at some valid and recommended uses for this functionality, even in a project using Daggy Fixes.

Plucking and backporting fixes

Doing daggy fixes all the time isn't for everyone. It's not always so easy to develop a fix directly against the revision where the bug was introduced. Perhaps the bug wasn't discovered until some other more recent code used it in ways that exposed the bug; it would be hard to debug and find the fix without this other code around. Or perhaps the importance or scope of the fix simply hadn't been realised at the time.

So, continuing our scenario, assume the fix had been committed directly as a new head, E->F1, as in the first example.

Now, we've found that the bug originated in B, and realised that the fix will be needed on the release branch as well. Or perhaps the bug was also found on the release branch, and traced back to the common ancestor B from there. Either way, we can use the new pluck command to help us. Once again, we have a couple of useful choices about where to commit the plucked change:

Creating F2 is easy with pluck:

 $ mtn co ... -r B .
 $ mtn pluck -r E -r F1
 $ mtn commit

For simple fixes, this new F2 will merge cleanly with F1 on mainline, and with R2 on the release branch as before.

For more intricate fixes, this step corresponds to backporting the fix from mainline to the point of origin of the bug. This makes it easier to approve and then merge that base fix forward to other branches (which may have diverged since) as a separate change, rather than attempting to adjust the change for both development paths in the one step.

Plucking development

Attractive new features are developed on the mainline, and sometimes it's desirable to add these new features to a stable branch, once it's clear they are good. However, we only want to bring the specific changes related to this feature onto the release branch, and not all of the other unrelated changes that would be carried along with it if we did a propagate. If you can identify the list of changes that correspond to the feature, you can checkout a workspace on the stable branch, and pluck each of these changes into it before committing. However, doing so carries with it the responsibility to ensure that you find all of the relevant changes -- including future changes that fix bugs in the code you just plucked. Because you've bypassed recording the full origin of the changes in the DAG history, daggy fixes won't work: you have to pluck fixes for plucked features. Even then, daggy fixes are preferred: at least they're easier to see because they're kept near the source revisions the features were originally plucked from.

Another example is where you've done a whole pile of development in a private branch, but for one reason or another you really can't or don't see the need to publish the whole gory history and internal working details for a push into a more public repository. You pluck the entire span of the private development branch onto the mainline, producing a single summarised change that rolls up all the intermediate work and rework and rerework, showing only the consolidated result.

last edited 2006-12-08 06:49:21 by LarryHastings