Hue Forum

Fail to import data

  • #49600
    Stanley Nguyen
    Participant

    Hi,

    I have the latest of everything running on 3 separate machines. There’s no configuration issue when running the check from HUE and everything looks green from Ambari. When I try to import the data from HUE, the table schema is created but it does not import any data. I look @ Job Browser and there’s no error. Can anyone help point me to where I need to check?

    Thanks

    Stan

to create new topics or reply. | New User Registration

  • Author
    Replies
  • #49731
    Stanley Nguyen
    Participant

    I found out that it works if I use Beeswax to create/import data but it fails with HCat. The weird thing is it is able to create the table schema but not to import the data. Not sure where to look for

    #49774
    Stanley Nguyen
    Participant

    Resolved!

    #49843
    Dave
    Moderator

    Hi Stanley,

    Could you let me know what the root cause for this was?
    Was it permissions or configuration?

    Thanks

    Dave

    #49846
    Stanley Nguyen
    Participant

    Hi Dave,

    It was the permission. The folder was owned by hdfs:hdfs but I logged in as hue. Once I created another account “hdfs” in hue, it works fine. It took me a while to figure out since the log I think was somewhere else. I think probably since I’m new with HDP, debugging the issue is a bit challenge. Like right now, I have an issue with Pig script got stuck in HUE but running successfully from grunt. Can’t find the error anywhere. From your professional experience, what’s the best way to debug in HDP?

    Thanks,

    Stan

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.