C++ Guide:`wstring` – Wide Character String Type

1. What is `wstring`?

std::wstring is a wide character (wchar_t) string type provided by the C++ standard library. Unlike std::string, std::wstring is used to store Unicode characters, making it suitable for applications that require multilingual support, such as Chinese, Japanese, Korean, and other non-ASCII languages.

2. Underlying Data Type of `wstring`

📌 `std::string` vs. `std::wstring`

String Type	Stored Character Type	Common Encoding
`<span>std::string</span>`	`<span>char</span>` (single-byte)	ASCII / UTF-8
`<span>std::wstring</span>`	`<span>wchar_t</span>` (wide character)	UTF-16 / UTF-32

Note:

std::wstring uses wchar_t to store characters, which may vary in size across different operating systems:

Windows: wchar_t is typically 2 bytes (UTF-16)
Linux/macOS: wchar_t is typically 4 bytes (UTF-32)

wstring is suitable for programs that need to handle multi-byte character sets (MBCS) or wide character sets (WCS).

3. Basic Usage of `wstring`

📌 1️⃣ Creating `wstring`

#include <iostream>
#include <string>

int main() {
    std::wstring ws1 = L"你好，世界！"; // L prefix indicates wide string
    std::wstring ws2 = L"Hello, Wide World!";
    
    std::wcout << L"Wide character string: " << ws1 << std::endl;
    std::wcout << L"English string: " << ws2 << std::endl;

    return 0;
}

💡 Note:

Use the L prefix to indicate a wide character string, for example L"你好".
std::wcout **is used to output wstring** (std::cout cannot directly output wstring).
It is necessary to set the locale to correctly display wstring (see section 5 below).

📌 2️⃣ Common Operations on `wstring`

#include <iostream>
#include <string>

int main() {
    std::wstring ws = L"宽字符字符串";

    // Length
    std::wcout << L"String length: " << ws.length() << std::endl;

    // Concatenate strings
    ws += L" - additional content";
    std::wcout << L"After concatenation: " << ws << std::endl;

    // Access character
    std::wcout << L"First character: " << ws[0] << std::endl;

    // Find substring
    size_t pos = ws.find(L"追加");
    if (pos != std::wstring::npos) {
        std::wcout << L"Found '追加', position: " << pos << std::endl;
    }

    return 0;
}

🛠 Common Functions:

Function	Purpose
`<span>length()</span>`	Get string length
`<span>append()</span>` or `<span>+=</span>`	Concatenate strings
`<span>find(L"substring")</span>`	Find substring
`<span>substr(start, length)</span>`	Get substring
`<span>compare()</span>`	Compare strings
`<span>empty()</span>`	Check if empty

4. Converting between `string` and `wstring`

📌 1️⃣ `wstring` → `string` (narrow character)

#include <iostream>
#include <string>
#include <locale>
#include <codecvt>

std::string wstringToString(const std::wstring&amp; wstr) {
    std::wstring_convert<std::codecvt_utf8<wchar_t>> converter;
    return converter.to_bytes(wstr);
}

int main() {
    std::wstring ws = L"你好，世界！";
    std::string s = wstringToString(ws);
    std::cout << "Converted string: " << s << std::endl;
    return 0;
}

📝 std::wstring_convert is a encoding conversion tool introduced in C++11, which converts wstring to a UTF-8 encoded string.

📌 2️⃣ `string` → `wstring` (wide character)

#include <iostream>
#include <string>
#include <locale>
#include <codecvt>

std::wstring stringToWstring(const std::string&amp; str) {
    std::wstring_convert<std::codecvt_utf8<wchar_t>> converter;
    return converter.from_bytes(str);
}

int main() {
    std::string s = "Hello, 世界！";
    std::wstring ws = stringToWstring(s);
    std::wcout << L"Converted wstring: " << ws << std::endl;
    return 0;
}

5. Resolving the Issue of `std::wcout` Not Displaying Chinese Characters Correctly

In Windows cmd or Linux terminal, directly using std::wcout may not display wstring correctly.Solution:

📌 1️⃣ Windows (UTF-16)

#include <iostream>
#include <string>
#include <locale>

int main() {
    setlocale(LC_ALL, ""); // Set locale to support Chinese output

    std::wstring ws = L"你好，世界！";
    std::wcout << L"Correctly displaying wide character: " << ws << std::endl;

    return 0;
}

💡 setlocale(LC_ALL, "") allows <code>std::wcout to correctly display Unicode characters.

📌 2️⃣ Linux/macOS (UTF-32)

The Linux terminal typically uses UTF-8, so it is recommended to use wstring_convert for conversion:

#include <iostream>
#include <string>
#include <locale>
#include <codecvt>

int main() {
    std::wstring ws = L"你好，世界！";

    // Convert wstring → string
    std::wstring_convert<std::codecvt_utf8<wchar_t>> converter;
    std::string utf8_str = converter.to_bytes(ws);

    std::cout << "UTF-8 display: " << utf8_str << std::endl;
    return 0;
}

6. Summary

Operation	Method
Create `<span>wstring</span>`	`<span>std::wstring ws = L"你好";</span>`
Output `<span>wstring</span>`	`<span>std::wcout << ws;</span>` (requires `<span>setlocale(LC_ALL, "")</span>`)
Concatenate strings	`<span>ws += L"追加";</span>`
Find substring	`<span>ws.find(L"substring")</span>`
`<span>wstring</span>` to `<span>string</span>`	`<span>std::wstring_convert<std::codecvt_utf8<wchar_t>></span>`
`<span>string</span>` to `<span>wstring</span>`	`<span>std::wstring_convert<std::codecvt_utf8<wchar_t>></span>`

🚀 wstring is suitable for multilingual support and international applications, but be aware of the character encoding of the platform!

C++ Guide: Understanding the wstring Family Member – Wide Character String Type

C++ Guide:`<span>wstring</span>` – Wide Character String Type

1. What is `<span>wstring</span>`?

2. Underlying Data Type of `<span>wstring</span>`

📌 `<span>std::string</span>` vs. `<span>std::wstring</span>`

3. Basic Usage of `<span>wstring</span>`

📌 1️⃣ Creating `<span>wstring</span>`

📌 2️⃣ Common Operations on `<span>wstring</span>`

4. Converting between `<span>string</span>` and `<span>wstring</span>`

📌 1️⃣ `<span>wstring</span>` → `<span>string</span>` (narrow character)

📌 2️⃣ `<span>string</span>` → `<span>wstring</span>` (wide character)

5. Resolving the Issue of `<span>std::wcout</span>` Not Displaying Chinese Characters Correctly

📌 1️⃣ Windows (UTF-16)

📌 2️⃣ Linux/macOS (UTF-32)

6. Summary

Leave a Comment Cancel reply

C++ Guide:<span>wstring</span> – Wide Character String Type

1. What is <span>wstring</span>?

2. Underlying Data Type of <span>wstring</span>

📌 <span>std::string</span> vs. <span>std::wstring</span>

3. Basic Usage of <span>wstring</span>

📌 1️⃣ Creating <span>wstring</span>

📌 2️⃣ Common Operations on <span>wstring</span>

4. Converting between <span>string</span> and <span>wstring</span>

📌 1️⃣ <span>wstring</span> → <span>string</span> (narrow character)

📌 2️⃣ <span>string</span> → <span>wstring</span> (wide character)

5. Resolving the Issue of <span>std::wcout</span> Not Displaying Chinese Characters Correctly

📌 1️⃣ Windows (UTF-16)

📌 2️⃣ Linux/macOS (UTF-32)

6. Summary

Related posts

Leave a Comment Cancel reply

C++ Guide:`<span>wstring</span>` – Wide Character String Type

1. What is `<span>wstring</span>`?

2. Underlying Data Type of `<span>wstring</span>`

📌 `<span>std::string</span>` vs. `<span>std::wstring</span>`

3. Basic Usage of `<span>wstring</span>`

📌 1️⃣ Creating `<span>wstring</span>`

📌 2️⃣ Common Operations on `<span>wstring</span>`

4. Converting between `<span>string</span>` and `<span>wstring</span>`

📌 1️⃣ `<span>wstring</span>` → `<span>string</span>` (narrow character)

📌 2️⃣ `<span>string</span>` → `<span>wstring</span>` (wide character)

5. Resolving the Issue of `<span>std::wcout</span>` Not Displaying Chinese Characters Correctly