php - PHP utf-8 按字符读取文件

Question

我在处理带有 utf-8 编码的 php 文件时遇到了一些严重的问题。我需要逐个字符或逐行读取文件，并带有固定的小块。我尝试使用fgetc和fgets但以 utf8 编码的字符被分成几个“字符”而不是一个。我正在尝试找到一个解决方案，它甚至可能是一个仅包含 Unicode 字符的小“字符串”，但必须从文件中正确读取。

我想使用的伪代码：

while ( ($c == read_utf8_char($file)) != false ) {
    if ($c == 'ó') do_something();
}

我会很感激你的帮助。

score 0 · Accepted Answer

function mb_str_split($string) { 
    return preg_split('/(?<!^)(?!$)/u', $string ); 
} 

$chars = mb_str_split($file);

foreach ($chars as $c) {
    if ($c == 'ó') do_something();
}

~ 来源：http ://www.php.net/manual/en/function.mb-split.php#99851

php - PHP utf-8 按字符读取文件

1 回答 1

Related

Reference