1

几周前我意识到我们网站上的一些旧列表中有一些奇怪的字符

我有一些像这样的字符  等于双引号 (") 我有一些字符像 â??

我想修复它们,但我不知道如何。

自从我使用 preg_replace 时,如果我制作此代码

$text = 'this is a listingTitle  with an encoding problem';
$test = preg_replace("/ÂÂ/",'"',$text);
echo$test;

$test 在字符串中仍然会有那个编码错误,所以就像 preg_replace 看不到编码问题一样。

有没有人知道我可以做些什么来修复这些编码错误?或者有什么办法我应该继续

谢谢

在此编辑------------------------------------------------ --

好的,我现在要做的是将每个字符转换回 utf8 并且由于某种原因,我的字符串与一些编码混合在一起。

所以这里是一个字符串和构成该字符串的字符的细分。

Milwaukee 2415-21 M12 无绳锂离子 3/8Â 直角钻头/螺丝刀套件 wi

这就是崩溃

atChar[0] = 'M' encoding is = ASCII
atChar[1] = 'i' encoding is = ASCII
atChar[2] = 'l' encoding is = ASCII
atChar[3] = 'w' encoding is = ASCII
atChar[4] = 'a' encoding is = ASCII
atChar[5] = 'u' encoding is = ASCII
atChar[6] = 'k' encoding is = ASCII
atChar[7] = 'e' encoding is = ASCII
atChar[8] = 'e' encoding is = ASCII
atChar[9] = ' ' encoding is = ASCII
atChar[10] = '2' encoding is = ASCII
atChar[11] = '4' encoding is = ASCII
atChar[12] = '1' encoding is = ASCII
atChar[13] = '5' encoding is = ASCII
atChar[14] = '-' encoding is = ASCII
atChar[15] = '2' encoding is = ASCII
atChar[16] = '1' encoding is = ASCII
atChar[17] = ' ' encoding is = ASCII
atChar[18] = 'M' encoding is = ASCII
atChar[19] = '1' encoding is = ASCII
atChar[20] = '2' encoding is = ASCII
atChar[21] = ' ' encoding is = ASCII
atChar[22] = 'C' encoding is = ASCII
atChar[23] = 'o' encoding is = ASCII
atChar[24] = 'r' encoding is = ASCII
atChar[25] = 'd' encoding is = ASCII
atChar[26] = 'l' encoding is = ASCII
atChar[27] = 'e' encoding is = ASCII
atChar[28] = 's' encoding is = ASCII
atChar[29] = 's' encoding is = ASCII
atChar[30] = ' ' encoding is = ASCII
atChar[31] = 'L' encoding is = ASCII
atChar[32] = 'i' encoding is = ASCII
atChar[33] = 't' encoding is = ASCII
atChar[34] = 'h' encoding is = ASCII
atChar[35] = 'i' encoding is = ASCII
atChar[36] = 'u' encoding is = ASCII
atChar[37] = 'm' encoding is = ASCII
atChar[38] = '-' encoding is = ASCII
atChar[39] = 'I' encoding is = ASCII
atChar[40] = 'o' encoding is = ASCII
atChar[41] = 'n' encoding is = ASCII
atChar[42] = ' ' encoding is = ASCII
atChar[43] = '3' encoding is = ASCII
atChar[44] = '/' encoding is = ASCII
atChar[45] = '8' encoding is = ASCII
atChar[46] = 'Â' encoding is = UTF-8
atChar[47] = '' encoding is = 
atChar[48] = ' ' encoding is = ASCII
atChar[49] = 'R' encoding is = ASCII
atChar[50] = 'i' encoding is = ASCII
atChar[51] = 'g' encoding is = ASCII
atChar[52] = 'h' encoding is = ASCII
atChar[53] = 't' encoding is = ASCII
atChar[54] = ' ' encoding is = ASCII
atChar[55] = 'A' encoding is = ASCII
atChar[56] = 'n' encoding is = ASCII
atChar[57] = 'g' encoding is = ASCII
atChar[58] = 'l' encoding is = ASCII
atChar[59] = 'e' encoding is = ASCII
atChar[60] = ' ' encoding is = ASCII
atChar[61] = 'D' encoding is = ASCII
atChar[62] = 'r' encoding is = ASCII
atChar[63] = 'i' encoding is = ASCII
atChar[64] = 'l' encoding is = ASCII
atChar[65] = 'l' encoding is = ASCII
atChar[66] = '/' encoding is = ASCII
atChar[67] = 'D' encoding is = ASCII
atChar[68] = 'r' encoding is = ASCII
atChar[69] = 'i' encoding is = ASCII
atChar[70] = 'v' encoding is = ASCII
atChar[71] = 'e' encoding is = ASCII
atChar[72] = 'r' encoding is = ASCII
atChar[73] = ' ' encoding is = ASCII
atChar[74] = 'K' encoding is = ASCII
atChar[75] = 'i' encoding is = ASCII
atChar[76] = 't' encoding is = ASCII
atChar[77] = ' ' encoding is = ASCII
atChar[78] = 'w' encoding is = ASCII

我现在能做什么?

4

2 回答 2

11

检查这个 PHP 类编码,在github中具有流行的编码

用法:

$utf8_string = Encoding::toUTF8($utf8_or_latin1_or_mixed_string);

$latin1_string = Encoding::toLatin1($utf8_or_latin1_or_mixed_string);

还:

$utf8_string = Encoding::fixUTF8($garbled_utf8_string);

例子:

echo Encoding::fixUTF8("Fédération Camerounaise de Football");
echo Encoding::fixUTF8("FÃédÃération Camerounaise de Football");
echo Encoding::fixUTF8("FÃÃédÃÃération Camerounaise de Football");
echo Encoding::fixUTF8("FÃÃÃédÃÃÃération Camerounaise de Football");

将输出:

Fédération Camerounaise de Football
Fédération Camerounaise de Football
Fédération Camerounaise de Football
Fédération Camerounaise de Football

更新:

检查这个:(我检查了这个工作)

$output = 'this is a listingTitle  with an encoding problem';
$output = preg_replace('/[^(\x20-\x7F)]*/','', $output);
echo($output);

输出:

this is a listingTitle with an encoding problem.

输出图片:

在此处输入图像描述

于 2013-09-11T15:20:22.623 回答
0

xml早些时候,我在从多种语言的提要中获取数据时也遇到了同样的问题。

经过努力,我通过在utf8编码中创建所有内容找到了解决方案。

1) 数据库联盟和字符集。

2) 数据库连接。

3)html页面类型。

于 2013-09-11T15:13:32.843 回答