February 9th, 2024

10 reactions

On the virtues of the trailing comma

Raymond Chen

Many programming languages allow trailing commas in lists.

C, C++, C# (and probably other languages) permit a trailing comma after the last enumerator:

enum Color
{
    Red,
    Blue,
    Green,
    //   ^ trailing comma
};

They also allow a trailing comma in list initializers.

// C, C++
Thing a[] = {
    { 1, 2 },
    { 3, 4 },
    { 5, 6 },
    //      ^ trailing comma
};

// C#
Thing[] a = new[] {
    new Thing {
        Name = "Bob",
        Id = 31415,
        //        ^ trailing comma
    },
    new Thing {
        Name = "Alice",
        Id = 2718,
        //       ^ trailing comma
    },
//   ^ trailing comma
};

Dictionary d = new Dictionary<string, Thing>() {
    ["Bob"] = new Thing("Bob") { Id = 31415 },
    ["Alice"] = new Thing("Alice", 2718),
    //                                  ^ trailing comma
};

These trailing commas are convenient when you arrange for each element to appear on its own line, like we did in the examples above. It lets you rearrange the items by moving lines around without having to worry about having to add a comma to an element when it moves out of the final position, or removing a comma from the element that moved into the final position.

It also reduces merge risk when people modify the list. For example, if somebody adds a new color “Black” to the end, they won’t have to touch any of the other lines, which means that a change from “Blue” to “LightBlue” won’t result in a merge conflict.

And even when there is a merge conflict due to two simultaneous adds, you can easily resolve it by accepting both.

enum Color
{
    Red,
    Blue,
    Green,
<<< VERSION 1
    Black,
|||
    White,
<<< VERSION 2
};

To resolve this, you can just delete all the conflict markers.

enum Color
{
    Red,
    Blue,
    Green,
    Black,
    White,
};

If your code didn’t use trailing commas, the merge would be messier:

enum Color
{
    Red,
    Blue,
<<< VERSION 1
    Green,
    Black
|||
    Green,
    White
<<< VERSION 2
};

And if you have a lot of these merges to deal with, you might forget to insert a comma after “Black”:

enum Color
{
    Red,
    Blue,
    Green,
    Black // ⇐ oops, forgot a comma
    White
};

Since the trailing comma reduces the number of lines of code that have to be modified when the list is extended, it also makes git blame more accurate. Without the trailing comma, a git blame on enum Color would blame the person who added “Black” for also being the last person to modify the “Green” line. If you’re investigating a problem with “Green”, you might ask that person for help, and they’ll say, “Oh no, I didn’t add ‘Green’. I added ‘Black’. You’ll have to dig further back into the history to figure out who added ‘Green’.”

Thank

C++

you

for

Java

supporting

JSON

Not you

JavaScript

trailing

Rust

commas

Python

lists

Bonus chatter: The trailing comma also makes it easier for code generators, since they can just emit a comma after each element and not have to worry about suppressing the final comma.

Bonus bonus chatter: But why not go all the way and allow a trailing comma in parameter lists?

SomeFunction(1, 2, );
//               ^ trailing comma not allowed

I suspect the primary reason is “nobody asked for it.” Variadic functions are relatively uncommon, so this is not something that code generators stumble across. Also, that extra comma just plain looks weird.

Overloaded functions could pose a parsing problem. If there are 2-parameter and 3-parameter overloads of SomeFunction, is this a call to the two-parameter overload, or is it a call to the three-parameter overload with some sort of default?

Bonus bonus bonus chatter: JavaScript, Rust, and Ruby allow a trailing comma in parameter lists.

Bonus bonus bonus bonus chatter: In the Pascal programming language, the semicolon is a statement separator, not a statement terminator, so you can write

begin
  i := 1;
  j := 2  (* no trailing semicolon *)
end

In practice, everybody puts a semicolon just before the end. Imaging rearranging two lines of code and having to adjust semicolons.

Author

Raymond Chen

Raymond has been involved in the evolution of Windows for more than 30 years. In 2003, he began a Web site known as The Old New Thing which has grown in popularity far beyond his wildest imagination, a development which still gives him the heebie-jeebies. The Web site spawned a book, coincidentally also titled The Old New Thing (Addison Wesley 2007). He occasionally appears on the Windows Dev Docs Twitter account to tell stories which convey no useful information.

25 comments

Discussion is closed. Login to edit/delete existing comments.

Johan Sköld February 15, 2024
Zig also supports trailing comma in parameter lists. Which may appear useless at first glance, but there is a neat side-effect you get from it: If you pass a file into `zig fmt` it will format that file for you. For parameter lists it puts them all on one line. But if the parameter list has a trailing comma, it will instead put them one on each line. So without trailing comma:

<code>

With trailing comma:

<code>

Read more
Zig also supports trailing comma in parameter lists. Which may appear useless at first glance, but there is a neat side-effect you get from it: If you pass a file into `zig fmt` it will format that file for you. For parameter lists it puts them all on one line. But if the parameter list has a trailing comma, it will instead put them one on each line. So without trailing comma:
```
fn add(a: i32, b: i32) i32 {
    return a + b;
}
```
With trailing comma:
```
fn add(
    a: i32,
    b: i32,
) i32 {
    return a + b;
}
```
Read less
Frédéric B. February 12, 2024 · Edited

This is one of the three reasons I’m baffled that .NET Core switched from XML to JSON for config files (the other two being retro-compatibility and vanilla JSON disallowing comments).
George Byrkit February 10, 2024

IIRC, PL/1 was also a ‘semi-colon is statement separator’ language. Made porting from C to PL/1 slightly harder, as you had semicolons to eliminate.
Nathan Mates February 10, 2024

JSON5 allows the trailing commas, etc. It’s available as a library/package/etc for most of your favorite programming languages.
Neil Rashbrook February 10, 2024

Note that Pascal doesn’t allow a semicolon before else. The nearest equivalent in C would be, say, trying to write do { break; }; while (1);
- Dmitry February 10, 2024 · Edited
  Which in fact is entirely in the logic that semicolon is a delimiter, not end-of-statemeny mark, since
  
  <code>
  
  Nothing to separate here. Just like in your do…while example, before while part, where there’s nothing to mark end of, yes.
  
  Although for C it’s better to treat {} as a language construct separate from statements like if, while or for, since the rules for semicolon are not as consistent as in Pascal.
  
  P.S. Thanks, blog engine, for breaking EBNF.
  
  Read more
  Which in fact is entirely in the logic that semicolon is a delimiter, not end-of-statemeny mark, since
  
  <If_statement> ::= "if" <logical_expr> "then" <statement_1> ["else" <statement_2>].
  
  Nothing to separate here. Just like in your do…while example, before while part, where there’s nothing to mark end of, yes.
  
  Although for C it’s better to treat {} as a language construct separate from statements like if, while or for, since the rules for semicolon are not as consistent as in Pascal.
  
  P.S. Thanks, blog engine, for breaking EBNF.
  Read less
  - 紅樓鍮 February 11, 2024
    The do-while loop in C is also consistent with if-else and other control statements:
    <code>
    the can be any statement, including a simple statement that ends in semicolon:
    <code>
    the only places I can think of where braces are required are function bodies and switch statements. In those cases, you can consider the tokens and to directly belong to the grammars for and respectively, and then becomes completely consistent with .
    
    Read more
    The do-while loop in C is also consistent with if-else and other control statements:
    
    <do-while> ::= "do" <stmt> "while" "(" <expr> ")" ";"
    
    the <stmt> can be any statement, including a simple statement that ends in semicolon:
    
    do stmt; while (expr);
    
    the only places I can think of where braces are required are function bodies and switch statements. In those cases, you can consider the tokens "{" and "}" to directly belong to the grammars for <func-def> and <switch-stmt> respectively, and then <compound-stmt> becomes completely consistent with <stmt>.
    
    Read less
  - Dmitry February 11, 2024
    Well, not really. I mean, if we say that semicolon is a statement terminator (which it mostly is) and call {} a compound statement (but still statement, just like if or while) then there’s a problem:
    
    <code>
    
    If braces are compound statement and statements are terminated with semicolon, then we should put semicolons after both closing braces. That would work for the else-branch (although for completely different reason called ”empty statement”) but something goes wrong with the first branch (what do you call it, guys? we call it then(zen)-branch).
    
    To make it at least feel more consistent, it helps thinking that closing brace...
    Read more
    Well, not really. I mean, if we say that semicolon is a statement terminator (which it mostly is) and call {} a compound statement (but still statement, just like if or while) then there’s a problem:
    
    if (some_expr) { ... } else { ... }
    
    If braces are compound statement and statements are terminated with semicolon, then we should put semicolons after both closing braces. That would work for the else-branch (although for completely different reason called ”empty statement”) but something goes wrong with the first branch (what do you call it, guys? we call it then(zen)-branch).
    
    To make it at least feel more consistent, it helps thinking that closing brace implicitly contains the semicolon. Still not as consistent as in Pascal though.
    
    Read less
  - 紅樓鍮 February 12, 2024 · Edited
    
    Yes, the real inconsistency is in that some statements end in a semicolon while others don’t (most notably { ... }, but things like if (...) ... else ... also themselves don’t have the semicolon for that matters 🙂
紅樓鍮 February 9, 2024
And the chad F# allows you to write lists and argument lists with no commas at all!
```
[
    "Red"
    "Blue"
    "Green"
]

SomeFunction
    1
    2
```
GL February 9, 2024

In C++ you can use trailing commas in brace initialization, and the extra comma does not change how overload resolution works. In C# if you have a normal variadic (i.e., params array, not vaargs) method and you find yourself constantly changing the number of arguments or shuffling them, then it’s better to call it with an explicitly initialized array, for which you can use trailing comma.
Andy Janata February 9, 2024

Not only does Go support the trailing comma, it _requires_ it in the situations shown here.
Rohitab Batra February 9, 2024

JSON – Not you 🙂
Swap Swap February 9, 2024 · Edited
The worst consequence of not using a trailing comma is that you can make a mistake when merging two versions or rearranging the lines, and the compiler won’t warn you. For example:
```
"Red",
"Green"
"Blue"
```
Here, Green and Blue were added in two different branches. They will be concatenated together by the preprocessor, so the effective code will be:
```
"Red",
"GreenBlue"
```
which is a bug.

On the virtues of the trailing comma

Author

25 comments

Read next

How can I get the Windows Runtime HttpClient to display a basic authentication prompt?

Functions that return the size of a required buffer generally return upper bounds, not tight bounds

Author

25 comments

Read next

How can I get the Windows Runtime HttpClient to display a basic authentication prompt?

Functions that return the size of a required buffer generally return upper bounds, not tight bounds

Stay informed