A simplified overview of ways to add or update elements in a std::map

Some time ago, I mentioned how the std::map subscript operator is a dangerous convenience. In that article, I linked to an overview of the insertion emplacement methods, but I’m going to recapture the essential points in a table.¹

In the table below, the discussion of “consumed” or “not consumed” refers to the case that v is an rvalue reference like std::move(something).

Statement	If key present	If key not present
`m.insert({ k, v });`	No effect, `v` is consumed	Added, `v` is consumed
`m.emplace(k, v);`	No effect, `v` is consumed	Added, `v` is consumed
`m.try_emplace(k, v);`	No effect, `v` is not consumed	Added, `v` is consumed
`m.insert_or_assign(k, v);`	Updated, `v` is consumed	Added, `v` is consumed
`m.at(k) = v;`	Updated, `v` is consumed	Throws, `v` is not consumed

We can reorganize the table by effect.

		If key present
		No effect `v` not consumed	No effect `v` consumed	Updated `v` consumed
If key not present	Added	`m.try_emplace(k, v);`	`m.insert({ k, v });` `m.emplace(k, v);`	`m.insert_or_assign(k, v);`
If key not present	Throws			`m.at(k) = v;`

Exercise: Why are the bottom left two boxes blank?

Sidebar: I intentionally omit m[k] = v; as a possibility because behaves the same as insert_or_assign, but with worse performance, and works in fewer circumstances: If the key does not exist, then m[k] = v first creates a default-constructed value and inserts it into the map, and then overwrites that default-constructed value with v. This creates an empty value unnecessarily, and it requires that the value type be default-constructible. End sidebar

Often overlooked is that the “no effect if key is present” methods also tell you whether it did anything. This means that you can avoid double-probing the map.

// inefficient version
if (map.contains(k)) {
    already_present();
} else {
    map[k] = v;
    newly_added();
}

// more efficient alternative
// (assuming v is easy to calculate)
if (map.try_emplace(k, v).second) {
    newly_added();
} else {
    already_present();
}

Instead of checking whether the map contains a key before adding the element, just try to add it and deal with the collision.

Here’s another simplification:

// inefficient version
do {
    k = gen_random_key();
} while (map.contains(k));
map[k] = v;

// more efficient alternative
do {
    k = gen_random_key();
while (!map.try_emplace(k, v).second);

Again, instead of checking whether the map contains a key before adding the element, just try to add it and deal with the collision.

For completeness, here’s a table for reading from a map.

Statement	If key present	If key not present
`auto& v = m[k];`	Reference to existing value	Reference to newly-created value
`auto& v = m.at(k);`	Reference to existing value	Throws

Another mistake I see is code where each line makes sense on its own, but they perform duplicate work that could be combined.

// inefficient version
if (m.find(k) != m.end()) {
    m[k].SomeMethod();
}

// more efficient alternative
if (auto found = m.find(k); found != m.end()) {
    found->second.SomeMethod();
}

The two pieces make sense separately. The m.find(k) != m.end() is the standard way to detect whether a key is present in the map. And the m[k].SomeMethod() is a standard way to invoke a method on a value stored in the map (though really, should be m.at(k).SomeMethod()). But the second statement repeats work done by the first statement, because m.at(k) and m[k] are just a m.find(k)->second, with an exception if the item is not found (for m.at()) or creating the entry if the item is not found (for m[]). If you’ve already done the m.find(k) in the if statement, you can use that result instead of making m.at() and m[] search for it a second time. (On top of that, m[] has to create an empty entry if the key is missing.)

I’ve also seen code that doesn’t realize that the [] operator creates an empty entry if the key is not present. This manifests in two ways:

std::map<Key, std::vector<Value>> m;

// problem 1: Creating unnecessary empty entries
m[k] = new_value;

// problem 2: Manually creating empty entries
std::vector<Value> strings;
if (auto found = m.find(k); found != m.end())
{
    strings = found->second;
}
strings.push_back(new_value);
m[k] = strings;

// better version of 1: Use insert_or_assign
m.insert_or_assign(k, new_value);

// better version of 2: Create the empty entry on demand
m[k].push_back(new_value);

Bonus chatter: In retrospect, there probably shouldn’t have been a [] operator for std::map, since it takes the most convenient syntax for a rarely-needed operation. It should have been called something like m.create_or_get(k).

Answer to exercise: The bottom left two boxes correspond to hypothetical operations that throw if the key is missing, and do nothing if the key is present. This is not very useful, so the standard provides no method to perform it. I guess you could use (void)m.at(k).

¹ More accurately, in another table.

Author

Raymond Chen

Raymond has been involved in the evolution of Windows for more than 30 years. In 2003, he began a Web site known as The Old New Thing which has grown in popularity far beyond his wildest imagination, a development which still gives him the heebie-jeebies. The Web site spawned a book, coincidentally also titled The Old New Thing (Addison Wesley 2007). He occasionally appears on the Windows Dev Docs Twitter account to tell stories which convey no useful information.

4 comments

Igor Levicki January 20, 2025

Raymond,

I am looking at this new if syntax, it’s mind boggling that no one thought of adding a new keyword (e.g. when) instead of using a semicolon:

if (auto found = m.find(k) when found != m.end()) {
    // do something with it
}

It helps differentiate new if syntax from existing loop (for / do / while) syntax, better visually separates initialization from conditional expression all while clearly labelling said conditional expression. It would’ve been so much more readable.

Georg Rottensteiner January 13, 2025

I wonder how much compiler optimization comes into play here.
I was under the impression, that yes, the m[k] = v would cause a construction plus assignment, but the compiler would optimize that away?
One of the reasons there’s such a performance difference between debug and release builds with STL container usage.

Kevin Norris January 13, 2025

Frankly, the compiler is just not very good at optimizing out things like this. In order to remove the default construction, the compiler needs to prove that the constructor has no side effects that escape. If LTO is disabled, then this never gets off the ground, unless the constructor happens to be in the same TU as the callsite. But even when LTO is enabled, the default constructor will usually recursively construct all fields of the class or struct, and proving that none of those constructors go on to have any side effects may be too hard for big or...
Read more
Frankly, the compiler is just not very good at optimizing out things like this. In order to remove the default construction, the compiler needs to prove that the constructor has no side effects that escape. If LTO is disabled, then this never gets off the ground, unless the constructor happens to be in the same TU as the callsite. But even when LTO is enabled, the default constructor will usually recursively construct all fields of the class or struct, and proving that none of those constructors go on to have any side effects may be too hard for big or complicated classes.

This is the same reason that copy elision was invented – just like it’s hard to prove that default constructors don’t have side effects, it is also hard to prove that copy and move constructors don’t have side effects.

Read less
- Igor Levicki January 20, 2025
  
  The MSVC compiler isn’t good at optimizing pretty much anything. It still can’t vectorize almost any loops apart from the simplest memset or SAXPY types.

Discussion is closed. Login to edit/delete existing comments.

Igor Levicki January 20, 2025
Raymond,

I am looking at this new if syntax, it’s mind boggling that no one thought of adding a new keyword (e.g. when) instead of using a semicolon:
```
if (auto found = m.find(k) when found != m.end()) {
    // do something with it
}
```
It helps differentiate new if syntax from existing loop (for / do / while) syntax, better visually separates initialization from conditional expression all while clearly labelling said conditional expression. It would’ve been so much more readable.
Georg Rottensteiner January 13, 2025

I wonder how much compiler optimization comes into play here.
I was under the impression, that yes, the m[k] = v would cause a construction plus assignment, but the compiler would optimize that away?
One of the reasons there’s such a performance difference between debug and release builds with STL container usage.
- Kevin Norris January 13, 2025
  
  Frankly, the compiler is just not very good at optimizing out things like this. In order to remove the default construction, the compiler needs to prove that the constructor has no side effects that escape. If LTO is disabled, then this never gets off the ground, unless the constructor happens to be in the same TU as the callsite. But even when LTO is enabled, the default constructor will usually recursively construct all fields of the class or struct, and proving that none of those constructors go on to have any side effects may be too hard for big or...
  Read more
  Frankly, the compiler is just not very good at optimizing out things like this. In order to remove the default construction, the compiler needs to prove that the constructor has no side effects that escape. If LTO is disabled, then this never gets off the ground, unless the constructor happens to be in the same TU as the callsite. But even when LTO is enabled, the default constructor will usually recursively construct all fields of the class or struct, and proving that none of those constructors go on to have any side effects may be too hard for big or complicated classes.
  
  This is the same reason that copy elision was invented – just like it’s hard to prove that default constructors don’t have side effects, it is also hard to prove that copy and move constructors don’t have side effects.
  
  Read less
  - Igor Levicki January 20, 2025
    
    The MSVC compiler isn’t good at optimizing pretty much anything. It still can’t vectorize almost any loops apart from the simplest memset or SAXPY types.

A simplified overview of ways to add or update elements in a `std::map`

Author

4 comments

Read next

How does `GetUserDefaultGeoName` choose between ISO 3166-1 and UN M.49 codes?

In a C++ class template specialization, how can I call the unspecialized version of a method?

Author

4 comments

Read next

How does Get­User­Default­Geo­Name choose between ISO 3166-1 and UN M.49 codes?

In a C++ class template specialization, how can I call the unspecialized version of a method?

Stay informed

How does `GetUserDefaultGeoName` choose between ISO 3166-1 and UN M.49 codes?