Hive / HCatalog Forum

Pig to load HCatalog table with ORC File storage

  • #54089

    Hi,
    I am trying to figure out a way to load Hive tables using Pig. I have the below questions:
    1. Do I need to create a HCatalog table for PIG to be able to load data or can it load directly to a Hive table
    2. How to create a Hcatalog table with ORC storage. I checked the documentation and HDP1.3 mentions HCatalog can support future Hive storage formats like ORC but i am not able to find references from HDP 2.1

to create new topics or reply. | New User Registration

  • Author
    Replies
  • #54094

    I have found a link which mentions that HCatalog works with ORC files but no confirmation anywhere else.
    https://issues.apache.org/jira/browse/HCATALOG-632

    #54121
    Eugene Koifman
    Moderator

    Just create a Hive table and specify storage, then use HCatStorer to have Pig write it it.

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.