Sorry about the late response. Distcp would be a feature to use, there are integration tools such as talend. As far as your specific use case where the clusters are distant from each other physically, theoretically it should work. The hortonworks services team may have done something similar in the past. Please feel free to reach out to them if you are still need assistance in designing this use case.