cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

From the Dev Team

Originally posted in HCC 1. Introduction NiFi is a powerful and easy to use technology to build dataflows from diverse sources to diverse targets while transforming and dynamically routing in between. NiFi is packaged in HDF 2.0 which (in addition to bundling Kafka and Storm for a complete data movement platform) pushes NiFi to enterprise […]

Apache Spark has been Open Source’s new kid on the block. Companies are using Spark to develop sophisticated models that would enable them to discover new opportunities or avoid risk. But what does the future or at least the near future hold for Spark? In this blog we have outlined five trends we see in […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC One Way Trust – MIT KDC to Active Directory by:emaxwell One Way Trust – MIT KDC to Active Directory Many security environments have strict policies on […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Supporting Custom Properties for Expression Language in Apache NiFi by:ydavis NiFi has previously supported the ability to refer to flow file attributes, system properties and environment […]

With the release of Hortonworks 2.5 Sandbox several new exciting features have been added to Apache Spark and Apache Zeppelin. Apache Spark Updates One of the most powerful new Hortonworks 2.5 Sandbox features is the ability to run two versions of Spark alongside in the same environment: a Generally Available (GA) Spark 1.6.2 and a […]

It’s never been easier to get started with Apache Hadoop. The Hortonworks Sandbox combines 100% open-source Apache Hadoop and its data access engines (Apache Spark, Apache Hive, Apache HBase, Apache Solr, Apache Pig) with enterprise-grade Operations (Apache Ambari), Security (Apache Ranger and Apache Knox) and Governance (Apache Atlas).  The Sandbox also provides tools for devOps, […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. How to pull data from Twitter and push data to Elasticsearch using Apache NiFi. by:myoung Short tutorial that walk you through the process of using NiFi to pull data from Twitter and […]

Originally posted in HCC – Hortonworks Community Connection Prerequisites Download HDP Sandbox MySQL database (Should already be present in the sandbox) Nifi 0.6 or later ( Download and install a new version of NIFI or use Ambari to install NIFI in the sandbox) MySQL setup (Source Database) In this setup we will create a table in […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Apache NiFi 1.0.0 – Zero-Master Clustering by:mpayne One of the most highly anticipated features of Apache NiFi 1.0.0 is the introduction of Zero-Master Clustering. Read more… […]

Originally posted in HCC. Ambari Views Server is the Standalone Ambari Server used for hosting Views and Ambari Server is the Operational Ambari Server which manages a Hadoop Cluster Before Ambari 2.4, when Ambari Views Servers are setup, the only way to configure views was to use ‘Custom Configuration’. In this method details had to […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC An introduction to Ambari Views 2.4 new feature- Remote cluster configuration by:abilgi This article discusses this new feature. Ambari Views Server is the Standalone Ambari Server […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Pig Doing Yoga: How to Build Superflexible Pig Scripts by:gkeys We know that parameter passing is valuable for pig script reuse. One lesser known understanding is […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Top Articles from HCC Hive on Tez vs PySpark for weblogs parsing by:bmathew Synopsis Both Pig and Spark (PySpark) excel at iterative data processing against weblogs […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Nifi 1.0.0 Beta UI Introduction by:hsowell The Apache Nifi community recently released the beta version of Apache Nifi 1.0.0. This version comes with significant updates, which […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Implementing a real-time Hive Streaming example by:mjohnson The Hive Streaming API enables the near real-time data ingestion into Hive. This two part posting reviews some of […]