encoding - “â€™” 显示在页面上，而不是“'”

Question

â€™显示在我的页面上，而不是'.

我的标签和 HTTP 标头中都有Content-Type设置：UTF-8<head>

<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />

在此处输入图像描述

另外，我的浏览器设置为Unicode (UTF-8)：

在此处输入图像描述

那么问题是什么，我该如何解决呢？

score 249 · Accepted Answer

249

于 2010-03-19T13:08:44.857 回答

score 59 · Accepted Answer

确保浏览器和编辑器使用 UTF-8 编码而不是 ISO-8859-1/Windows-1252。

或使用’.

score 17 · Accepted Answer

17

于 2015-06-19T00:02:25.360 回答

score 16 · Accepted Answer

16

于 2013-10-24T18:16:47.483 回答

score 14 · Accepted Answer

This sometimes happens when a string is converted from Windows-1252 to UTF-8 twice.

We had this in a Zend/PHP/MySQL application where characters like that were appearing in the database, probably due to the MySQL connection not specifying the correct character set. We had to:

Ensure Zend and PHP were communicating with the database in UTF-8 (was not by default)

Repair the broken characters with several SQL queries like this...

UPDATE MyTable SET 
MyField1 = CONVERT(CAST(CONVERT(MyField1 USING latin1) AS BINARY) USING utf8),
MyField2 = CONVERT(CAST(CONVERT(MyField2 USING latin1) AS BINARY) USING utf8);

Do this for as many tables/columns as necessary.

You can also fix some of these strings in PHP if necessary. Note that because characters have been encoded twice, we actually need to do a reverse conversion from UTF-8 back to Windows-1252, which confused me at first.

mb_convert_encoding('â€™', 'Windows-1252', 'UTF-8');    // returns ’

score 10 · Accepted Answer

You have a mismatch in your character encoding; your string is encoded in one encoding (UTF-8) and whatever is interpreting this page is using another (say ASCII).

Always specify your encoding in your http headers and make sure this matches your framework's definition of encoding.

Sample http header:

Content-Type    text/html; charset=utf-8

Setting encoding in asp.net

<configuration>
  <system.web>
    <globalization
      fileEncoding="utf-8"
      requestEncoding="utf-8"
      responseEncoding="utf-8"
      culture="en-US"
      uiCulture="de-DE"
    />
  </system.web>
</configuration>

Setting encoding in jsp

score 7 · Accepted Answer

如果您的内容类型已经是 UTF8 ，那么很可能数据已经以错误的编码到达。如果您从数据库中获取数据，请确保数据库连接使用 UTF-8。

如果这是来自文件的数据，请确保文件正确编码为 UTF-8。您通常可以在您选择的编辑器的“另存为...”对话框中进行设置。

如果当您在源文件中查看数据时数据已经损坏，则很可能它曾经是一个 UTF-8 文件，但在此过程中以错误的编码保存。

score 5 · Accepted Answer

If someone gets this error on WordPress website, you need to change wp-config db charset:

define('DB_CHARSET', 'utf8mb4_unicode_ci');

instead of:

define('DB_CHARSET', 'utf8mb4');

score 3 · Accepted Answer

If the other answers haven't helped, you might want to check whether your database is actually storing the mojibake characters. I was viewing the text in utf-8, but I was still seeing the mojibake and it turned out that, due to a database upgrade, the text had been permanently "mojibaked".

In this case, one option is to "fix" the text with Python's ftfy package (or JavaScript verion here).

score 0 · Accepted Answer

You must have copy/paste text from Word Document. Word document use Smart Quotes. You can replace it with Special Character (’) or simply type in your HTML editor (').

I'm sure this will solve your problem.

score 0 · Accepted Answer

0

于 2019-10-23T00:09:09.357 回答

score -3 · Accepted Answer

The same thing happened to me with the '–' character (long minus sign).
I used this simple replace so resolve it:

htmlText = htmlText.Replace('–', '-');

encoding - “â€™” 显示在页面上，而不是“'”

12 回答 12

Related

Reference