0

我将 Cloudify 2.7 与 OpenStack Icehouse 一起使用。特别是,我已将云驱动程序配置为引导 2 个管理 VM (numberOfManagementMachines 2)。

有时,当我引导虚拟机时,我会收到以下错误:

cloudify@default> bootstrap-cloud --verbose openstack-icehouse-<project_name>
...
Starting agent and management processes:
[VM_Floating_IP] nohup gs-agent.sh gsa.global.lus 0 gsa.lus 1 gsa.gsc 0 gsa.global.gsm 0 gsa.gsm 1 gsa.global.esm 1 >/dev/null 2>&1
[VM_Floating_IP] STARTING CLOUDIFY MANAGEMENT
[VM_Floating_IP] .
[VM_Floating_IP] Discovered agent nic-address=177.86.0.3 lookup-groups=gigaspaces-Cloudify-2.7.1-ga.
[VM_Floating_IP] Detected LUS management process started by agent null  expected agent a0eec4e5-7fb0-4428-80e1-ec13a8b1c744
[VM_Floating_IP] Detected LUS management process started by agent a0eec4e5-7fb0-4428-80e1-ec13a8b1c744
[VM_Floating_IP] Detected GSM management process started by agent a0eec4e5-7fb0-4428-80e1-ec13a8b1c744
[VM_Floating_IP] Waiting for Management processes to start.
[VM_Floating_IP] Waiting for Elastic Service Manager
[VM_Floating_IP] Waiting for Management processes to start.
[VM_Floating_IP] .
[VM_Floating_IP] Waiting for Elastic Service Manager
[VM_Floating_IP] Waiting for Management processes to start.
[VM_Floating_IP] .
[VM_Floating_IP] Waiting for Elastic Service Manager
[VM_Floating_IP] Waiting for Management processes to start.
[VM_Floating_IP] .
[VM_Floating_IP] Waiting for Elastic Service Manager
[VM_Floating_IP] Waiting for Management processes to start.
[VM_Floating_IP] .failure occurred while renewing an event lease: Operation failed. net.jini.core.lease.UnknownLeaseException: Unknown event id: 3
[VM_Floating_IP]        at com.sun.jini.reggie.GigaRegistrar.renewEventLeaseInt(GigaRegistrar.java:5494)
[VM_Floating_IP]        at com.sun.jini.reggie.GigaRegistrar.renewEventLeaseDo(GigaRegistrar.java:5475)
[VM_Floating_IP]        at com.sun.jini.reggie.GigaRegistrar.renewEventLease(GigaRegistrar.java:2836)
[VM_Floating_IP]        at com.sun.jini.reggie.RegistrarGigaspacesMethodinternalInvoke16.internalInvoke(Unknown Source)
[VM_Floating_IP]        at com.gigaspaces.internal.reflection.fast.AbstractMethod.invoke(AbstractMethod.java:41)
[VM_Floating_IP]        at com.gigaspaces.lrmi.LRMIRuntime.invoked(LRMIRuntime.java:464)
[VM_Floating_IP]        at com.gigaspaces.lrmi.nio.Pivot.consumeAndHandleRequest(Pivot.java:561)
[VM_Floating_IP]        at com.gigaspaces.lrmi.nio.Pivot.handleRequest(Pivot.java:662)
[VM_Floating_IP]        at com.gigaspaces.lrmi.nio.Pivot$ChannelEntryTask.run(Pivot.java:196)
[VM_Floating_IP]        at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
[VM_Floating_IP]        at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
[VM_Floating_IP]        at java.lang.Thread.run(Thread.java:662)
[VM_Floating_IP]
[VM_Floating_IP]
[VM_Floating_IP] Waiting for Elastic Service Manager
[VM_Floating_IP] Waiting for Management processes to start.
....
[VM_Floating_IP] ....Failed to add [Processing Unit Instance] with uid [8038e956-1ae2-4378-8bb1-e2055202c160]: Operation failed. java.rmi.ConnectException: Connect Failed to [NIO://177.86.0.3:7011/pid[4390]/164914896032_3_8060218823096628119_details[class org.openspaces.pu.container.servicegrid.PUServiceBeanImpl]]; nested exception is: 
[VM_Floating_IP]        java.net.SocketTimeoutException
...
[VM_Floating_IP] Failed to add [GSM] with uid [3c0e20e9-bf85-4d22-8ed6-3b387e690878]: Operation failed. java.rmi.ConnectException: Connect Failed to [NIO://177.86.0.3:7000/pid[4229]/154704895271_2_2245795805687723285_details[class com.gigaspaces.grid.gsm.GSMImpl]]; nested exception is:
[VM_Floating_IP]        java.net.SocketTimeoutException
...
[VM_Floating_IP] Failed to add GSC with uid [8070dabb-d80d-43c7-bd9c-1d2478f95710]: Operation failed. java.rmi.ConnectException: Connect Failed to [NIO://177.86.0.3:7011/pid[4390]/164914896020_2_8060218823096628119_details[class com.gigaspaces.grid.gsc.GSCImpl]]; nested exception is:
[VM_Floating_IP]        java.net.SocketTimeoutException
...
[VM_Floating_IP] Failed to add [GSA] with uid [a0eec4e5-7fb0-4428-80e1-ec13a8b1c744]: Operation failed. java.rmi.ConnectException: Connect Failed to [NIO://177.86.0.3:7002/pid[4086]/153569177936_2_8701370873164361474_details[class com.gigaspaces.grid.gsa.GSAImpl]]; nested exception is:
[VM_Floating_IP]        java.net.SocketTimeoutException
...
[VM_Floating_IP] Waiting for Management processes to start.
[VM_Floating_IP] Failed to connect to LUS on 177.86.0.3:4174, retry in 73096ms: Operation failed. java.net.ConnectException: Connection timed out
...
[VM_Floating_IP] .Failed to add [ESM] with uid [996c8898-897c-4416-a877-82efb22c7ea6]: Operation failed. java.rmi.ConnectException: Connect Failed to [NIO://177.86.0.3:7003/pid[4504]/172954418920_2_5475350805758957057_details[class org.openspaces.grid.esm.ESMImpl]]; nested exception is:
[VM_Floating_IP]        java.net.SocketTimeoutException

有人可以向我建议任何解决方案吗?我应该配置任何超时值吗?

谢谢。

------------------------------------编辑-------

我会补充一些信息。

每个管理器实例有 4 个 VCPU、8GB RAM、20GB 磁盘。

每个管理器实例都有 Cloudify 创建的安全组,即:

cloudify-manager-cluster    

Egress  IPv4    Any -       0.0.0.0/0 (CIDR)    
Egress  IPv6    Any -       ::/0 (CIDR)

cloudify-manager-management

Egress  IPv4    Any -       0.0.0.0/0 (CIDR)    
Egress  IPv6    Any -       ::/0 (CIDR) 
Ingress IPv4    TCP 22      0.0.0.0/0 (CIDR)    
Ingress IPv4    TCP 4174    cfy-mngt-cluster    
Ingress IPv4    TCP 6666    cfy-mngt-cluster    
Ingress IPv4    TCP 7000    cfy-mngt-cluster    
Ingress IPv4    TCP 7001    cfy-mngt-cluster    
Ingress IPv4    TCP 7002    cfy-mngt-cluster    
Ingress IPv4    TCP 7003    cfy-mngt-cluster    
Ingress IPv4    TCP 7010 - 7110 cfy-mngt-cluster    
Ingress IPv4    TCP 8099    0.0.0.0/0 (CIDR)    
Ingress IPv4    TCP 8100    0.0.0.0/0 (CIDR)

此外,Cloudify 创建了一个私有网络“cloudify-manager-Cloudify-Management-Network”,子网为 177.86.0.0/24,并为每个 VM 请求一个浮动 IP。

4

1 回答 1

0

ESM 是 Cloudify 的 Orchestrator。任何时候都应该只运行它的一个实例。该错误表明 boostrap 进程期望找到一个正在运行的 ESM,但没有找到。这似乎与管理器实例之间的通信错误有关 - 为管理器定义的安全组是否可能没有打开管理器之间的所有端口?

安全组/防火墙配置是常见的问题。管理器 VM 也可能太小 - 它应该至少有 4 GB 内存和 2 个 vCPU。

请记住,Cloudify 2.X 已结束生命周期,不再受支持。您可能想查看 Cloudify 3。

于 2015-05-20T12:49:02.397 回答