May 7th, 2026

3 reactions

When you upgrade your resource strings to Unicode, don’t forget to specify the L prefix

Raymond Chen

Some time ago, I discussed how the Resource Compiler defaults to CP_ACP, even in the face of subtle hints that the file is UTF-8.

After yet another incident of Visual Studio secretly changing the file encoding from 1252 to UTF-8 and breaking all non-ASCII strings, combined with Azure DevOps and Visual Studio simply ignoring encoding changes when showing diffs, a colleague decided to solve the problem once and for all by using explicit Unicode escapes \x#### to represent non-ASCII characters. That way, it doesn’t matter whether the file encoding is 1252 or UTF-8 because the two code pages agree on the common ASCII subset.

What used to be

IDS_AWESOME "That’s great!"

was changed to

IDS_AWESOME "That\x2019s great!"

Unfortunately, the resulting string that appeared on screen was

That 19s great!

What went wrong?

If you are encoding Unicode into your string, you have to put an L prefix on the quoted string. Otherwise, the \xABCD sequence is interpreted as an 8-bit \xAB escape sequence, followed by two literal characters CD. In this case, the \x2019 was interpreted as \x20 (which encodes a space) followed by the literal characters 19, resulting in the string That␣19s great!.

The correct conversion includes the L prefix.

IDS_AWESOME L"That\x2019s great!"

Topics

Code

Author

Raymond Chen

Raymond has been involved in the evolution of Windows for more than 30 years. In 2003, he began a Web site known as The Old New Thing which has grown in popularity far beyond his wildest imagination, a development which still gives him the heebie-jeebies. The Web site spawned a book, coincidentally also titled The Old New Thing (Addison Wesley 2007). He occasionally appears on the Windows Dev Docs Twitter account to tell stories which convey no useful information.

3 comments

Discussion is closed. Login to edit/delete existing comments.

Petteri Aimonen May 18, 2026
Interestingly this differs from how C parses hex escapes, which is confusing in a different way.
For example
```
"\x20are you sure?"
```
gets printed as a newline and “re you sure”, because the hex escape eats any number of hex characters.
Sometimes compilers are smart enough to give a warning when this happens.
Simon Geard May 8, 2026

This is how Java ended up with a "native2ascii" command. The actual localisation framework required properties files to be in 8859-1 — which is stupid for a localisation framework, since people actually wanted their localisations (especially non-roman ones like CJK) to be readable/maintainable in source code — so they gave us a command line tool for converting properties files in arbitrary encodings to 8859-1, encoding everything with \uxxxx sequences that the Java runtime could read. Meanwhile, various frameworks added their own replacements for the built-in APIs, ones which supported UTF-8, etc.

Fortunately, they eventually agreed that this was stupid, and changed...
Read more
This is how Java ended up with a “native2ascii” command. The actual localisation framework required properties files to be in 8859-1 — which is stupid for a localisation framework, since people actually wanted their localisations (especially non-roman ones like CJK) to be readable/maintainable in source code — so they gave us a command line tool for converting properties files in arbitrary encodings to 8859-1, encoding everything with \uxxxx sequences that the Java runtime could read. Meanwhile, various frameworks added their own replacements for the built-in APIs, ones which supported UTF-8, etc.

Fortunately, they eventually agreed that this was stupid, and changed Java to assume UTF-8 by default instead of ISO-8859-1. This probably broke some applications, but very much worth it.

Read less
- Me Gusta May 8, 2026
  
  It isn't that bad, as was mentioned, the biggest issue was that for whatever reason, the resources needed to use the codepage to interpret the strings correctly. This would obviously mean that if Windows is set to UTF-8 as the codepage then resource compiler would interpret strings as UTF-8. Also, as noted, the resource compiler is able to just use UTF-16 strings anyway.
  
  There is another interesting thing too, if you open a Visual C++ command prompt that references a pretty recent version of the Windows SDK, rc.exe /? will show a /8 command line option. The description of this is...
  Read more
  It isn’t that bad, as was mentioned, the biggest issue was that for whatever reason, the resources needed to use the codepage to interpret the strings correctly. This would obviously mean that if Windows is set to UTF-8 as the codepage then resource compiler would interpret strings as UTF-8. Also, as noted, the resource compiler is able to just use UTF-16 strings anyway.
  
  There is another interesting thing too, if you open a Visual C++ command prompt that references a pretty recent version of the Windows SDK, rc.exe /? will show a /8 command line option. The description of this is “Enable UTF-8-only mode”. This isn’t currently documented in the resource compiler documentation, but I find that it is safe to assume what it does. It is easy to see why this would be an opt in option too. Hopefully it will be properly documented soon.
  
  Read less

Stay informed

Get notified when new posts are published.

Email *

Country/Region *

I would like to receive the The Old New Thing Newsletter. Privacy Statement.

Follow this blog

When you upgrade your resource strings to Unicode, don’t forget to specify the L prefix

Category

Topics

Author

3 comments

Read next

Developing more confidence when tracking renames via `ReadDirectoryChangesW`

Additional notes on controlling which handles are inherited by `CreateProcess`

Category

Topics

Share

Author

3 comments

Read next

Developing more confidence when tracking renames via Read­Directory­ChangesW

Additional notes on controlling which handles are inherited by Create­Process

Stay informed

Developing more confidence when tracking renames via `ReadDirectoryChangesW`

Additional notes on controlling which handles are inherited by `CreateProcess`