cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button
August 22, 2016
prev slideNext slide

Top Articles on Apache Hadoop — From HCC

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week.

Top Articles from HCC

  1. Pig Doing Yoga: How to Build Superflexible Pig Scripts by:gkeys We know that parameter passing is valuable for pig script reuse. One lesser known understanding is that parameters do not simply pass variables to pig scripts but rather (and more fundamentally) they pass text that replaces placeholders in the script. This is a subtle but powerful difference.. Read more on HCC.
  2. Incremental Fetch in Apache NiFi with QueryDatabaseTable by:mburgess
    NiFi is most effectively used as an “always-on” system, meaning that the data flows are often always operational (running). Doing batch processing is a more difficult task and usually requires some user intervention (such as stopping a source processor)….Read more on HCC.
  3. Teragen, Terasort, and Teravalidate Performance testing on Bigstep by:smanjee
    Faster & cheaper data processing — IaaS. Read about REAL WORLD experience with the typically IaaS providers has been generally slow on performance. Not to say hadoop/hbase/spark/etc jobs will not perform; however, you need to be familiar with what you’re getting into and set realistic expectations. Read more on HCC.
  4. How do I login to Apache Zeppelin when Security is enabled using HDP 2.5 Tech Preview Sandbox by:phargis
    Apache Zeppelin (version 0.6.0) includes the ability to securely authenticate users and require logins. It uses the Apache Shiro security framework to accomplish this objective. Read more on HCC.
  5. Python Script in Apache NiFi by:vvagias
    In NiFi the data being passed between operators is referred to as a FlowFile and can be accessed via various scripting languages in the ExecuteScript operator. In order to access the data in the FlowFile you need to understand a few requirements first.Read more on HCC.

Top 5 Questions — last week

  1. How do you fix login issues after restarting Cloudbreak deployer instance on Amazon?
  2. I am facing issue of huge data in mysql table which is increasing very fast , so to scale what is the other alternative?
  3. can i change log location in HDP installation
  4. Nifi JsonSplit Doesn’t Work
  5. HortonWorks License Vs HortonWorks Free Comparison

 

Come over to HCC and participate.

Leave a Reply

Your email address will not be published. Required fields are marked *

If you have specific technical questions, please post them in the Forums

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>