Problems with tutorial "Refining and Visualizing Sentiment Data"

to create new topics or reply. | New User Registration

Tagged: ,

This topic contains 2 replies, has 2 voices, and was last updated by  Juan Giraldo 1 year, 7 months ago.

  • Creator
    Topic
  • #43901

    Juan Giraldo
    Member

    Hi,
    I’m having troubles with the tutorial “Refining and Visualizing Sentiment Data”. I’m trying to run the script hiveddl.sql but I’m getting some weird log results when the scripts get completed. When I try to browse the data from the table tweets_raw by HCat, I got the following error:

    Unknown exception.

    Log trace:
    13/11/17 11:38:56 WARN lazybinary.LazyBinaryStruct: Extra bytes detected at the end of the row! Ignoring similar problems.
    13/11/17 11:38:56 INFO lazybinary.LazyBinaryStruct: Missing fields! Expected 3 fields but only got 1! Ignoring similar problems.
    13/11/17 11:38:56 ERROR security.UserGroupInformation: PriviledgedActionException as:hue (auth:SIMPLE) cause:BeeswaxException(message:java.io.IOException: java.lang.ArrayIndexOutOfBoundsException: 1949199507, log_context:e6b6e37d-cedc-4e82-9c3d-00889a9463fa, handle:QueryHandle(id:e6b6e37d-cedc-4e82-9c3d-00889a9463fa, log_context:e6b6e37d-cedc-4e82-9c3d-00889a9463fa), SQLState: )
    13/11/17 11:38:56 ERROR beeswax.BeeswaxServiceImpl: Caught unexpected exception.
    java.lang.reflect.UndeclaredThrowableException
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1504)
    at com.cloudera.beeswax.BeeswaxServiceImpl.doWithState(BeeswaxServiceImpl.java:772)
    at com.cloudera.beeswax.BeeswaxServiceImpl.fetch(BeeswaxServiceImpl.java:980)
    at com.cloudera.beeswax.api.BeeswaxService$Processor$fetch.getResult(BeeswaxService.java:987)
    at com.cloudera.beeswax.api.BeeswaxService$Processor$fetch.getResult(BeeswaxService.java:971)
    at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:39)
    at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:39)
    at org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:206)
    at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
    at java.lang.Thread.run(Thread.java:662)
    Caused by: BeeswaxException(message:java.io.IOException: java.lang.ArrayIndexOutOfBoundsException: 1949199507, log_context:e6b6e37d-cedc-4e82-9c3d-00889a9463fa, handle:QueryHandle(id:e6b6e37d-cedc-4e82-9c3d-00889a9463fa, log_context:e6b6e37d-cedc-4e82-9c3d-00889a9463fa), SQLState: )
    at com.cloudera.beeswax.BeeswaxServiceImpl$RunningQueryState.fetch(BeeswaxServiceImpl.java:545)
    at com.cloudera.beeswax.BeeswaxServiceImpl$5.run(BeeswaxServiceImpl.java:986)
    at com.cloudera.beeswax.BeeswaxServiceImpl$5.run(BeeswaxServiceImpl.java:981)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:396)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1491)
    … 10 more

    I don’t know what i’m doing wrong but I got stuck on this problem

    I appreciate your help.

    Thanks!!

Viewing 2 replies - 1 through 2 (of 2 total)

You must be to reply to this topic. | Create Account

  • Author
    Replies
  • #43919

    Juan Giraldo
    Member

    Hi Cheryle,
    Yes, i’m working in sandbox version 2.0.

    I’m going to be aware for any update.

    Thank you!!

    Collapse
    #43915

    Cheryle Custer
    Moderator

    Hi,

    Are you running the Sentiment tutorial in Sandbox version 1.3 or 2.0? There were some changes in the Sandbox 2.0 that prevent the tutorial from being run in 2.0. We’re working on updating it and will get it republished soon.

    Cheryle

    Collapse
Viewing 2 replies - 1 through 2 (of 2 total)
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.