cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

This is the second of a three part series of the evolution of the Hortonworks and Microsoft relationship. Ask around and you’ll hear a variety of different drivers for moving IT resources to the cloud, from cost to business agility. In fact, as adoption has progressed and companies have learned more about what the cloud makes […]

One the most enjoyable parts of my job is working with customers and partners who have innovated on the Hortonworks Connected Data Platform.  Companies like Servient. Here’s a great real example of a recent use case for a customer we worked together on in the energy vertical.  I’ve removed the actual name for obvious reasons. […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]

THE MONEY LAUNDERING CHALLENGE CONTINUES UNABATED… As this blog has repeatedly catalogued over the last year here[1], here[2] and here[3], Money Laundering is a massive global headache and one of the biggest crimes against humanity. Not a month goes by when we do not hear of billions of dollars in ill gotten funds being stolen from […]

This is the first of a three part series of the evolution of the Hortonworks and Microsoft relationship. Microsoft has led one tech industry revolution after another from the dawn of personal computing to the cloud. Hortonworks is defining a new generation of innovation and impact with its pioneering work in Big Data. You already […]

Hortonworks Big-Data Maturity Scorecard v2.0 The fourth Industrial revolution is here, and competing to succeed in the 4.0 ‘digital’ world entails making the right decisions based on data driven pointers, to successfully implement your strategy. As we work with the entire stack of Fortune 100 organizations, we often see companies—particularly those operating across business lines […]

Cloud Computing is one of the big three trends impacting IT architectures today.  What some may not realize is that an underlying connected data architecture is not only essential for cloud, but sits at the confluence of all three trends. Here’s why. The first big trend is IoT. According to BI Intelligence, we can now […]

The Hadoop community is gathering this week to hear from data scientists, innovators and thought leaders on the state of the data industry. A wide range of topics will be covered, ranging from Hadoop use cases to data visualization and user experience. Customers looking for comprehensive solutions to manage all of their data needs rely […]

In the US fast food industry, this is a common question when you order a burger.  ‘You want fries with that?’   It’s in the American psyche at this point, and has become common parlance. I was recently heard this exchange: ‘Hey, can I get a copy of your targeted promos report?’    ‘Sure!  You want […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Supporting Custom Properties for Expression Language in Apache NiFi by:ydavis NiFi has previously supported the ability to refer to flow file attributes, system properties and environment […]

We recently concluded this webinar series, with 7 webinars and 77 questions answered. All webinars, slides, Q&A and related info are available below. Should you have any more questions, anytime, we encourage you to check out the Data Ingestion & Streaming track of Hortonworks Community Connection where an entire community of folks are monitoring and […]

With the release of Hortonworks 2.5 Sandbox several new exciting features have been added to Apache Spark and Apache Zeppelin. Apache Spark Updates One of the most powerful new Hortonworks 2.5 Sandbox features is the ability to run two versions of Spark alongside in the same environment: a Generally Available (GA) Spark 1.6.2 and a […]

It’s never been easier to get started with Apache Hadoop. The Hortonworks Sandbox combines 100% open-source Apache Hadoop and its data access engines (Apache Spark, Apache Hive, Apache HBase, Apache Solr, Apache Pig) with enterprise-grade Operations (Apache Ambari), Security (Apache Ranger and Apache Knox) and Governance (Apache Atlas).  The Sandbox also provides tools for devOps, […]

How Hortonworks can help hotel industry capture value through Insights Aggregation and Predictive Analytics Big Data has transformed every industry including the hospitality vertical. Through customer analytics, targeted segmentation, and campaigning, hotels would like to focus on delivering personalized promotions, cross and up-selling travel services. Our objective is to address these challenges through an open-source […]

Show us what you can do! Here at Hortonworks, we’ve been showing people how fast and easy it is to use Hortonworks DataFlow, powered by Apache NiFi to easily, quickly and securely move data to where you need it. So we thought we’d test it out – and we are offering a speed test challenge! […]