The HCatlog interfaces can support PIG and MR (Hive) for unstructured and structured data. Since its XML I recommend using the HCatlog and PIG interfaces called “load and store”. Here is more info about the interfaces:
You may also structure the XML as a table and use the Hive interfaces (Hcatinputformat, Hcatoutoutformat). Hope this helps.