Hortonworks Sandbox Forum

Any HBASE success with Pig & Hue

  • #58652
    Scott Saufferer

    I’ve been trying to run various tests/tutorials, etc of using Pig via Hue to read and write to HBASE in the freshly downloaded and installed sandbox environment. Every trial fails with some java error or another. Many of which are class not found exceptions (do I need to register every jar myself in the pig script?)

    Is anyone else having luck with using HBASE via Pig in the sandbox?

    Here’s an example of the many different errors I’m getting:

    1) java.lang.ClassNotFoundException: Class org.apache.hadoop.hbase.mapreduce.TableSplit not found
    2) 2014-08-12 16:38:41,737 [main] ERROR org.apache.pig.tools.grunt.GruntParser – ERROR 2998: Unhandled internal error. org.apache.hadoop.hbase.protobuf.generated.ClientProtos$MutationProto$MutationType

to create new topics or reply. | New User Registration

  • Author
  • #58884
    Christopher Even

    I have had no luck either. Went through the same process of copying around jars till the class not founds were gone.
    Then I was getting a reverse DNS error on tablesplit which I fixed by changing the DNS server to my own and adding a PTR.
    Now i am getting:
    ERROR org.apache.pig.tools.grunt.Grunt – ERROR 1066: Unable to open iterator for alias raw. Backend error : org.apache.hadoop.hbase.TableName

    Seems i am almost there…
    I am running latest download on Hyper-V.

    I am also VERY interested in whether anyone has this actually working!

    Christopher Even

    I got it working!
    log in as root:
    sudo cp /usr/lib/hbase/lib/hbase-*hadoop2.jar /usr/lib/Hadoop/lib
    sudo cp /usr/lib/hbase/lib/htrace*.jar /usr/lib/Hadoop/lib
    sudo cp /usr/lib/hbase/lib/protobuf*.jar /usr/lib/Hadoop/lib

    + i have the reverse DNS set up on my own DNS server. Changed resolv.conf to point to my own server.. NOTE: it changes back each reboot.

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.