Hortonworks Sandbox Forum

Hive and ORDER BY on Sandbox

  • #31095
    Mark Robinson
    Participant

    I would very much appreciate some help as to what is the problem with the HQL I am using in Hive in the Sandbox.

    The following two queries work and return the correct results:

    select taccode, count(*)
    from tac_code
    group by taccode
    order by taccode;

    select substring(taccode, 1, 4), count(*)
    from tac_code
    group by substring(taccode, 1, 4);

    The following two queries do not work, with error: Error occurred executing hive query: Unknown exception.

    select substring(taccode, 1, 4), count(*)
    from tac_code
    group by taccode
    order by taccode;

    select substring(taccode, 1, 4), count(*)
    from tac_code
    group by substring(taccode, 1, 4)
    order by taccode;

    select substring(taccode, 1, 4), count(*)
    from tac_code
    group by substring(taccode, 1, 4)
    order by substring(taccode, 1, 4);

    As you can see there is not much difference between the queries but I can only assume that the substring and order by combined is related to the problem.

    Can anybody please help with this? Thank you!

to create new topics or reply. | New User Registration

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.