Installation failed. Failed to receive heartbeat from agent
I am receiving an error on two of the three machines:
Ensure that the host's hostname is configured properly.
Ensure that port 7182 is accessible on the Cloudera Manager server (check firewall rules).
Ensure that ports 9000 and 9001 are free on the host being added.
Check agent logs in /var/log/cloudera-scm-agent/ on the host being added (some of the logs can be found in the installation details).
Solution:
1. first to check /etc/hosts file
Your hostname needs to be tied to an actual IP address on your local network which can send/receive traffic between all the hosts. (an error is the address you have associated your hostname with is the loopback address, which cannot route actual network traffic off the host.)
Use "ifconfig -a" to see a listing of your network interfaces and choose one that has an actual IP address.
2. check if another process already uses the port 9000 and 9001:
(for my instance, it (scm agent) actually uses 19001 port, some configuration file specified that, cant remember where it resides now)to find the process that uses the port, kill the process and retry on the failed machine.
[root@hd-1] netstat -apn | grep 9001
tcp 0 0 127.0.0.1:35582 127.0.0.1:19001 ESTABLISHED 6330/python
[root@hd-1] kill -9 6330