Hadoop Ecosystem

Industry news, partner stories, buzz and happenings

In this post, we’ll walk through the process of deploying an Apache Hadoop 2 cluster on the EC2 cloud service offered by Amazon Web Services (AWS), using Hortonworks Data Platform.

Both EC2 and HDP offer many knobs and buttons to cater to your specific, performance, security, cost, data size, data protection and other requirements. I will not discuss most of these options in this blog as the goal is to walk through one particular path of deployment to get started.…

Xplenty is a Hortonworks Technology Partner offering Hadoop as a service. We invited Yaniv Mor, Co-founder and CEO of Xplenty to be our guest blogger today sharing his views on HDP and Hadoop. 

We founded Xplenty to make Apache Hadoop easier. A lot easier. We believe Hadoop’s big data revolution should be available for companies of all sizes and intuitive for everyone to use. Whether it’s designing data­flows, setting up clusters, or managing and monitoring them, our platform as a service makes it happen, code­free.…

Apache Accumulo is gaining momentum in markets such as government, financial services and health care for its enhanced security and performance. Hortonworks has a long history with this technology and has multiple committers to the Accumulo project on staff – at least one of whom literally helped to write the book on Accumulo. This has enabled Hortonworks to provide enterprise support for Accumulo within the Hortonworks Data Platform for some time now.…

On Feb 8th and 9th, Hortonworks, Microsoft and Elastacloud will be hosting a hackathon at the Microsoft Campus in Mountain View, CA. Whether you’re a newbie or ninja, developer or scientist, we’d love to see you there. Register here.

The focus of the hackathon will be city datasets. For instance, we’ll be drawing on datasets from San Francisco that will measure things like:

  • Pedestrian safety: where accidents occur, how they occur and who has caused them.

We’re kicking off 2014 with an evolution to our Modern Data Architecture webinar series. Last year we focused on how your existing technologies integrate with Apache Hadoop. This year we will focus on use cases for how Hadoop and your existing technologies are being used to get real value in the enterprise. Join Hortonworks, along with Microsoft, Actian, Splunk and others as we continue our journey on delivering Apache Hadoop as an Enterprise Data Platform.…

This guest blog post is from Syncsort, a Hortonworks Technology Partner and certified on HDP 2.0, by Keith Kohl, Director, Product Management, Syncsort (@keithkohl)

Several years ago, Syncsort set on a journey to contribute to the Apache Hadoop projects to open and extend Hadoop, and specifically the MapReduce processing framework.  One of the contributions was to open the sort – both map side sort and reduce side – and to make it pluggable. …

This guest post from Simon Elliston Ball, Head of Big Data at Red Gate and all round top bloke. 

Hadoop is a great place to keep a lot of data. The data-lake, the data-hub and the data platform;  it’s all about the data. So how do you manage that data? How do you get data in? How do you get results out? How do you get at the logs buried somewhere deep in HDFS?…

Today, we are excited to announce the agenda for Hadoop Summit Europe 2014.  We welcome you to check it out now and hopefully start planning you trip to Amsterdam now!

The call for abstracts for Hadoop Summit Europe was open for just over two months and we received an unbelievable 354 submissions.  Wow!  Further, as we read through them, the quality was amazing.  We quickly surmised that the show was going to be great, but the selection process was going to rough.…

Microsoft and Hortonworks have been working together for over two years now with the goal of bringing the power of Big Data to a billion people. As a result of that work, today we announced the General Availability of HDP 2.0 for Windows with the full power of YARN.

There are already over half a billion Excel users on this planet.

So, we have put together a short tutorial on the Hortonworks Sandbox where we walk through the end-to-end data pipeline using HDP and Microsoft Excel in the shoes of a data analyst at a financial services firm where she:

  • Cleans and aggregates 10 years of raw stock tick data from NYSE
  • Enriches the data model by looking up additional attributes from Wikipedia
  • Creates an interactive visualization on the model

You can find the tutorial here.…

This guest post from Eric Hanson, Principal Software Development Engineer on Microsoft HDInsight, and Apache Hive committer.

Hive has a substantial community of developers behind it, including a few from the Microsoft HDInsight team. We’ve been contributing to the Stinger initiative since it was started early in 2013, and have been contributing to Hadoop since October of 2011. It’s a good time to step back and see the progress that’s been made on Apache Hive since fall of 2012, and ponder what’s ahead.…

Congrats to our partner, Revolution Analytics, on the general availability of Revolution R Enterprise 7 (RRE7). With this release, you can now run R natively in Hortonworks Data Platform by simply moving their R-powered analytics to Hadoop. Users will be able to run the high-performance distributed R functions in Revolution R Enterprise without having to move the data out of Hadoop, and using the Hadoop nodes as a parallel computation grid.…

Step one in the development of the Hadoop Summit Europe content tracks is complete!  Thank you to everyone who participated in the Hadoop Summit Community Choice voting process. We counted over 14,000 votes, setting a new record for participation in this program. The turnout far exceeded our expectations and it is terrific that the momentum behind Apache Hadoop continues to go from strength to strength… especially in Europe!

Before we announce the winners….…

It’s been an amazing year of expansion for the Hadoop ecosystem. If you’re looking to use Hadoop in your infrastructure, see how these hundreds of amazing partners can help. If you would like to become a partner, come talk to us – we’d love to have you on-board.

The Hortonworks partner program has had a terrific year across many different measures not the least of which is the fact that the Hortonworks partner community grew by more than 240 percent.…

This is a guest post from our partner, Revelytix who recently created a step-by-step tutorial on using Loom with the Hortonworks Sandbox. 

Enterprises are excited about the Hortonworks Data Platform (HDP) for many reasons, such as low cost, scalability, and flexibility. The latter in particular holds out new possibilities for data science. The Hadoop Distributed File System (HDFS) accepts files of any type and format, unlike traditional data warehouses which require a schema up front.…

With businesses demanding faster and easier access to information in order to make reliable and smart decisions, in-memory processing is an emerging technology that is gaining the attention of businesses of all sizes and across industries. Kognitio, a Hortonworks Technology Partner, uses an in-memory technology solution to provide scalable compute power for rapid execution of complex analytical queries.

Join us for the webinar on December 10 at 10am PT / 1pm ET “The Modern Data Architecture: In-Memory and Hadoop – The New BI”

What is In-Memory Processing?…
Go to page:« First...56789...Last »