Get fresh updates from Hortonworks by email

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Sign up for the Developers Newsletter

Once a month, receive latest insights, trends, analytics information and knowledge of Big Data.


Get Started


Ready to Get Started?

Download sandbox

How can we help you?

* I understand I can unsubscribe at any time. I also acknowledge the additional information found in Hortonworks Privacy Policy.
closeClose button
August 22, 2016
prev slideNext slide

Top Articles on Apache Hadoop — From HCC

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week.

Top Articles from HCC

  1. Pig Doing Yoga: How to Build Superflexible Pig Scripts by:gkeys We know that parameter passing is valuable for pig script reuse. One lesser known understanding is that parameters do not simply pass variables to pig scripts but rather (and more fundamentally) they pass text that replaces placeholders in the script. This is a subtle but powerful difference.. Read more on HCC.
  2. Incremental Fetch in Apache NiFi with QueryDatabaseTable by:mburgess
    NiFi is most effectively used as an “always-on” system, meaning that the data flows are often always operational (running). Doing batch processing is a more difficult task and usually requires some user intervention (such as stopping a source processor)….Read more on HCC.
  3. Teragen, Terasort, and Teravalidate Performance testing on Bigstep by:smanjee
    Faster & cheaper data processing — IaaS. Read about REAL WORLD experience with the typically IaaS providers has been generally slow on performance. Not to say hadoop/hbase/spark/etc jobs will not perform; however, you need to be familiar with what you’re getting into and set realistic expectations. Read more on HCC.
  4. How do I login to Apache Zeppelin when Security is enabled using HDP 2.5 Tech Preview Sandbox by:phargis
    Apache Zeppelin (version 0.6.0) includes the ability to securely authenticate users and require logins. It uses the Apache Shiro security framework to accomplish this objective. Read more on HCC.
  5. Python Script in Apache NiFi by:vvagias
    In NiFi the data being passed between operators is referred to as a FlowFile and can be accessed via various scripting languages in the ExecuteScript operator. In order to access the data in the FlowFile you need to understand a few requirements first.Read more on HCC.

Top 5 Questions — last week

  1. How do you fix login issues after restarting Cloudbreak deployer instance on Amazon?
  2. I am facing issue of huge data in mysql table which is increasing very fast , so to scale what is the other alternative?
  3. can i change log location in HDP installation
  4. Nifi JsonSplit Doesn’t Work
  5. HortonWorks License Vs HortonWorks Free Comparison


Come over to HCC and participate.


Himani Bansal says:

“Pig Doing Yoga”:).. Awesomely written and justified the topic. I would also like to contribute here to the audience a nicely structured series of Hadoop tutorials for learning hadoop from the beginning – All the Best.

Leave a Reply

Your email address will not be published. Required fields are marked *

If you have specific technical questions, please post them in the Forums