c++ - fgetws 无法从 FILE* 获取确切的宽字符字符串

Question

我正在使用 fgetws 从文件中逐行获取一些字符串。我拥有的文件来自一个 popen 命令。这是代码片段：

    FILE* pInstalledApps = popen( command.c_str(), "r" );
    if( NULL != pInstalledApps )
    {
        wchar_t currentAppPath [kMaximumAppPathLength];

        // Reading app paths one line at a time.
        while ( ! feof (pInstalledApps) )
        {
            if ( fgetws  ( currentAppPath, kMaximumAppPathLength, pInstalledApps) == NULL )
            {
                break;
            }
            wchar_t *pCharPos = NULL;
            if ( ( pCharPos = wcschr( currentAppPath, L'\n' ) ) != NULL )
            {
                *pCharPos = L'\0';
            }
            std::wstring appPath( currentAppPath );

                            //Do something with the wstring
        }
        pclose( pInstalledApps );
    }

当我得到的字符串 currentAppPath 具有宽字符字符串时，我得到的 appPath 没有预期的字符串。例如，如果我从 FILE 获得的字符串是10teciêênks我的 appPath 变量，则将具有10tecieÌeÌnks.

score 1 · Accepted Answer

这看起来像是一个编码（或更具体地说，解码）问题。宽字符 API 函数不会自动检测数据的字符编码。您需要在应用程序中进行设置，例如：

#include <locale.h>

setlocale(LC_ALL, "en.UTF-8");

状态的手册页fgetws：

The behavior of fgetws() depends on the LC_CTYPE category of the current locale.

所以使用：

setlocale(LC_CTYPE, "en.UTF-8");

也应该工作。

注意：以上假设数据是 UTF-8 编码的。

更新：可以通过执行以下操作来保留当前的语言环境：

char *prev_locale = strdup(setlocale(LC_CTYPE, NULL));
setlocale(LC_CTYPE, "en.UTF-8");
// ...
setlocale(LC_CTYPE, prev_locale);
free(prev_locale);

c++ - fgetws 无法从 FILE* 获取确切的宽字符字符串

1 回答 1

Related

Reference