Thanks for your reply. The item 3 is still happening. The cluster has lot of disk space. These files that I am loading are few Gb long while each machine on the cluster has over 1.5 TB available. Any idea what configuration param should I look at? This installation is done by non-operations people and it is our first installation on multiple clusters, so it is possible we didn’t set something properly.
Another question is about the error message. It says ;could only be replicated to 0 nodes, instead of 1′. If Hive spawns MR on all the nodes and if I am using HDFS, then should the hive table not be distributed on all the nodes rather than just one?