I am trying to configure a node to access a Hadoop cluster that is located behind a firewall. The cluster is accessible via SSH, so I have set up a socks proxy and instructed Hadoop to use it. The setup works fine, except that the node is registered to the cluster using the IP of the proxy and not its own (which makes it inaccessible to the cluster).
To illustrate the situation here is the output of hdfs dfsadmin -report.
# This is the node I am trying to connect to the cluster. Host name is correct, but the name and the IP are of the proxy, The node is run on a different port. Otherwise it fails to register altogether.
Name: 192.168.254.9:51010 (narsil.xx.xx)
# This is the node running namenode, datanode and also acts as a proxy. This is correct.
Name: 192.168.254.9:50010 (narsil.xx.xx)
Any pointers on how to fix this situation? Many thanks in advance.