The Hortonworks Community Connection is now live. A completely rebuilt Q&A forum, Knowledge Base, Code Hub and more, backed by the experts in the industry.

You will be redirected here in 10 seconds. If your are not redirected, click here to visit the new site.

The legacy Hortonworks Forum is now closed. You can view a read-only version of the former site by clicking here. The site will be taken offline on January 31,2016

Hortonworks Sandbox Forum

Tutorials have no output

  • #51499
    Edward Cheadle

    The first tutorial listing NYSE stocks runs, says it succeeded, but there is no output. The Second is the same. I noticed that when running the program the interface is slightly different than the tutorial, the pig helper button was moved to the top, and somewhere in the information it said the api changed. So I was wondering if the pig statements are correct for the version of software I have. I am running version 2.0 of the sandbox. I checked the code and made sure it was exactly the same as in the tutorial, but still after I ran the pig script I did not get any output. I was wondering if the scripts work or has something changed?

  • Author
  • #51624

    Hi Edward,
    I just verified that they do still work You are correct that the UI may have slightly changed but there should be no effect from that. The PIG statements are correct and it should look like this:
    a = LOAD ‘nyse_stocks’ USING org.apache.hcatalog.pig.HCatLoader();
    b = filter a by stock_symbol == ‘IBM’;
    c = group b all;
    d = foreach c generate AVG(b.stock_volume);
    dump d;

    Please ensure the data loaded properly. I hope this helps.


    Edward Cheadle

    You are absolutely right. I tried using the dump command after each statement and found the errors, rookie mistakes. This is all very new to me and I was making simple mistakes.

    However, in the tutorial HCatalog, Basic Pig & Hive Commands, it says it will use two files:

    I tried using the Batting.csv file as it states and nothing worked. It said change the column labled r to Runs and I could not find the column in the Batting.csv file. I changed to the BattingPost.csv file.found the r column changed it to Runs and all the subsequent commands worked. I am off to learn more about pig.

    I am trying to get some groups at our company who are looking into Big Data, to review your website. I am a systems administrator, so i am somewhat removed from the decision making but it seems as though you have great way of introducing Hadoop concepts so I am trying to learn more about it so I am better versed in telling people about what you do. The tutorials are interesting and I have learned a lot about Hadoop. Thank you.

The forum ‘Hortonworks Sandbox’ is closed to new topics and replies.

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.