5

我尝试使用标准版本在 Ubuntu 12.04.1 LTS 上安装 cloudera 管理器,当我想添加新主机时,出现下一个错误:

Installation failed.Failed to receive heartbeat from agent.
Ensure that the host's hostname is configured properly.
Ensure that port 7182 is accesible on the Cloudera Manager server (check firewall rules).
Ensure that ports 9000 an 9001 are free on the host being added.
Check agent logs in /var/log/cloudera-scm-agent/ on the host being added (some of the logs can be found in the installation details).

/etc/hosts文件中,我将其配置为:

127.0.0.1 localhost
127.0.0.1 hadoop-ubuntu
192.168.5.xyz hadoop-ubuntu.dana.local hadoop-ubuntu
192.168.3.xyz ro-m81.dana.local ro-m81
192.168.3.abc ro-m41.dana.local ro-m41

以下行对于支持 IPv6 的主机是可取的

::1 ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters     
The **/var/log/cloudera-scm-agent/cloudera-scm-agent.log** shows the next error::   
[09/Oct/2013 16:04:23 +0000] 4532 MainThread agent ERROR Heartbeating to 192.168.5.xyz:7182 failed.
Traceback (most recent call last):
File "/usr/lib64/cmf/agent/src/cmf/agent.py", line 747, in send_heartbeat
response = self.requestor.request('heartbeat', dict(request=heartbeat))
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 145, in request
return self.issue_request(call_request, message_name, request_datum)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 256, in issue_request
call_response = self.transceiver.transceive(call_request)
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 485, in transceive
result = self.read_framed_message()
File "/usr/lib64/cmf/agent/build/env/lib/python2.6/site-packages/avro-1.6.3-py2.6.egg/avro/ipc.py", line 489, in read_framed_message
response = self.conn.getresponse()
File "/usr/lib64/python2.6/httplib.py", line 990, in getresponse
response.begin()
File "/usr/lib64/python2.6/httplib.py", line 391, in begin
version, status, reason = self._read_status()
File "/usr/lib64/python2.6/httplib.py", line 349, in _read_status
line = self.fp.readline()
File "/usr/lib64/python2.6/socket.py", line 433, in readline
data = recv(1)
error: [Errno 104] Connection reset by peer

请帮助我找出为什么会出现此错误或缺少什么。

4

5 回答 5

1

我遇到过同样的问题。这就是我的诀窍。

输入 ifconfig 并找到您的 IP 地址。不是 127.0.0.1。

输入 $hostname 并找到您的主机名

编辑/etc/hosts 文件

在那里为您的 ipaddress 添加一个条目。就像是

192.168.8.xxx   hostname.test.com   hostname

重启cloudera服务。转到 sonic.test.com:7180 并重试。它应该工作。即使没有工作,去 http://hostname.test.com:7180/cmf/home检查主机的状态。

事实证明,即使我收到心跳错误,主机实际上已经启动并运行。

于 2013-11-21T03:10:00.827 回答
1

我遇到了同样的问题,然后我找到了解决方案

我用了两台机器,一台用于master,另一台用于slave

具有cloudera-scm-server.

/etc/hosts在两台机器上都配置了,最后错误消失了。

主机IP为:192.168.1.10

In Master Machine /etc/hosts

127.0.0.1       localhost

192.168.1.10     <hostname>

从机IP为:192.168.1.8

In Slave Machine /etc/hosts

127.0.0.1       localhost

192.168.1.8     <hostname>
于 2018-10-10T07:52:51.517 回答
0

检查集群中所有节点上的主机文件后,确保在安装程序上打开端口 7180 和 7182,在集群节点(安装程序除外)上打开端口 9000。

我从 Cloudera 安装中收到“检查器失败。抛出 IO 异常”错误,直到我查看安装程序(服务器)日志并看到客户端无法在端口 9000 上通信。

于 2014-08-19T15:18:02.827 回答
0
  1. 首先使用“sudo service cloudera-scm-agent status”检查Cloudera scm代理状态是否正在运行

2.查看/var/log/cloudera-scm-agent/这个目录下的代理日志文件

分辨率资源:http ://commandstech.com/what-is-heartbeat-in-hadoop-how-to-resolve-heartbeat-lost-in-cloudera-and-hortonworks/

于 2019-09-22T10:27:38.113 回答
0

我和你有同样的问题,我终于解决了。

我的问题是代理的版本和cloudera-scm-agent服务器的版本不一样cloudera-scm-server,你可以用dpkg或者yum自己检查。

于 2015-09-24T05:11:39.157 回答