Tree-walking algorithms: Incrementally enumerating leaf nodes of an N-ary tree

Suppose you have an N-ary tree, in which the node operations are

Get first child.
Get next sibling.
Get parent.

For example, this type of tree structure may represent a window hierarchy. You also see it in a TreeView control.

Enumerating the nodes of this tree with a recursive algorithm is relatively straightforward. Doing it incrementally is trickier.

The idea is that we want to walk through the tree following the red arrows, as if we are walking along the outside of the tree with our left hand touching it.

			↙︎	A	↖︎
		↙︎	╱	│	╲	↖︎
	↙︎	B	↷	C	↷	D	↑
↙︎	╱	↷	╲↖︎	→	⤴︎↓︎	│	↑
↓︎	E	↑↓	F	↖︎	↓	G	↑
⤷	→	⤴︎⤷︎	→	⤴︎	⤷	→	⤴︎

The various types of tree walks differ primarily in where you stop to rest. And therefore, the differences are in the state that needs to be preserved when we reach the stopping point, so that we know how to resume our tree walk.

Let’s assume that we have a cursor class that can move through the tree.

class TreeCursor
{
  TreeCursor(TreeNode node);
  bool TryMoveToFirstChild();
  bool TryMoveToNextSibling();
  bool TryMoveToParent();
  TreeNode Current { get; };
};

Our first algorithm is to walk through the tree, stopping at each leaf node. A leaf node is a node with no children.

I choose this as our first algorithm because it has only one state: You’re at a leaf node and need to find the next leaf node. Being at a leaf node means that you’re at the “curve around the bottom of a node ⭯” part of the path around the tree.

To find the first leaf node, we keep moving to the first child until we find a node with no children.

To find the next leaf node, we first realize that since we are at a leaf node, we have no children of our own. So we move up to the parent, and then back down to the next child. That next child is our starting node’s next sibling. Once there, we keep moving to the first child until we find a node with no children.

The last case is where we are at the last sibling. In that case, we move up to the parent node and try again. If we are at the root (no parent node), then we’re done.

Capturing this algorithm results in the following:

class LeafWalker
{
  private TreeCursor cursor;

  public LeafWalker(TreeNode node)
  {
    cursor = new TreeCursor(node);
    GoDeep();
  }

  public bool MoveNext()
  {
    do {
      if (cursor.TryMoveToNextSibling()) {
        GoDeep();
        return true;
      }
    } while (cursor.TryMoveToParent());
    return false;
  }

  public TreeNode Current => cursor.Current;

  private void GoDeep()
  {
    while (cursor.TryMoveToFirstChild()) { }
  }
}

That was a nice warm-up. We’ll try something a little harder next time.

Author

Raymond Chen

Raymond has been involved in the evolution of Windows for more than 30 years. In 2003, he began a Web site known as The Old New Thing which has grown in popularity far beyond his wildest imagination, a development which still gives him the heebie-jeebies. The Web site spawned a book, coincidentally also titled The Old New Thing (Addison Wesley 2007). He occasionally appears on the Windows Dev Docs Twitter account to tell stories which convey no useful information.

4 comments

Dmitry Vozzhaev January 6, 2020

Implementing such iterator requires either something like a stack in the iterator or pointers to siblings in nodes. Former approach is essentially hand made recursion, and latter one will end up with insane amount of bookkeeping in tree operations. So let me guess, you’re going to implement it through pointer arithmetic magic?

Raymond Chen Author January 6, 2020

The requirements of the TreeCursor class are given in the article. There are many cursors that satisfy the requirements, such as DOM Node, XmlNode, GetWindow, and IUIAutomationTreeWalker.

Alex Martin January 6, 2020

That tree diagram is so fragile to site styling or structure changes that it kind of scares me.

Neil Rashbrook January 10, 2020

Worse, something is causing my feed reader to display images rather than three of those Unicode arrows…

Discussion is closed. Login to edit/delete existing comments.

Dmitry Vozzhaev January 6, 2020

Implementing such iterator requires either something like a stack in the iterator or pointers to siblings in nodes. Former approach is essentially hand made recursion, and latter one will end up with insane amount of bookkeeping in tree operations. So let me guess, you’re going to implement it through pointer arithmetic magic?
- Raymond Chen Author January 6, 2020
  
  The requirements of the TreeCursor class are given in the article. There are many cursors that satisfy the requirements, such as DOM Node, XmlNode, GetWindow, and IUIAutomationTreeWalker.
Alex Martin January 6, 2020

That tree diagram is so fragile to site styling or structure changes that it kind of scares me.
- Neil Rashbrook January 10, 2020
  
  Worse, something is causing my feed reader to display images rather than three of those Unicode arrows…

Tree-walking algorithms: Incrementally enumerating leaf nodes of an N-ary tree

Author

4 comments

Read next

Tree-walking algorithms: Incrementally performing a preorder walk of an N-ary tree

Tree-walking algorithms: Incrementally performing a postorder walk of an N-ary tree

Author

4 comments

Read next

Tree-walking algorithms: Incrementally performing a preorder walk of an N-ary tree

Tree-walking algorithms: Incrementally performing a postorder walk of an N-ary tree

Stay informed