The Hortonworks Blog

Posts categorized by : Hadoop Ecosystem

Microsoft and Hortonworks have been working together for over two years now with the goal of bringing the power of Big Data to a billion people. As a result of that work, today we announced the General Availability of HDP 2.0 for Windows with the full power of YARN.

There are already over half a billion Excel users on this planet.

So, we have put together a short tutorial on the Hortonworks Sandbox where we walk through the end-to-end data pipeline using HDP and Microsoft Excel in the shoes of a data analyst at a financial services firm where she:

  • Cleans and aggregates 10 years of raw stock tick data from NYSE
  • Enriches the data model by looking up additional attributes from Wikipedia
  • Creates an interactive visualization on the model

You can find the tutorial here.…

In God we trust, all others must bring data.
Dr. W. Edwards Deming
Dr. W. Edwards Deming was a statistician and manufacturing consultant who worked on Japanese reconstruction after WWII. His quality control methods influenced innovative Japanese manufacturing processes that simultaneously increased volume, reduced cost, and improved quality. Near the end of his career, Deming taught the same lessons to U.S. automakers.

To this day, the “Deming Prize” is one of the highest rewards for Total Quality Management in the world.…

It’s been an amazing year of expansion for the Hadoop ecosystem. If you’re looking to use Hadoop in your infrastructure, see how these hundreds of amazing partners can help. If you would like to become a partner, come talk to us – we’d love to have you on-board.

The Hortonworks partner program has had a terrific year across many different measures not the least of which is the fact that the Hortonworks partner community grew by more than 240 percent.…

2013 was certainly a revealing year for the Enterprise Hadoop market. We witnessed the emergence of the YARN-based architecture of Hadoop 2 and a strong ecosystem embracement that will fuel its next big wave of innovation. The analyst community accurately predicted Hadoop’s market momentum would greatly accelerate, but none predicted a pure play vendor would publicly declare its intent to pivot away from the Enterprise Hadoop market. Interesting times indeed!

Join us on Tuesday January 21st where we’ll be covering the Enterprise Hadoop State of the Union in more detail.…

This is a guest post from our partner, Revelytix who recently created a step-by-step tutorial on using Loom with the Hortonworks Sandbox. 

Enterprises are excited about the Hortonworks Data Platform (HDP) for many reasons, such as low cost, scalability, and flexibility. The latter in particular holds out new possibilities for data science. The Hadoop Distributed File System (HDFS) accepts files of any type and format, unlike traditional data warehouses which require a schema up front.…

With businesses demanding faster and easier access to information in order to make reliable and smart decisions, in-memory processing is an emerging technology that is gaining the attention of businesses of all sizes and across industries. Kognitio, a Hortonworks Technology Partner, uses an in-memory technology solution to provide scalable compute power for rapid execution of complex analytical queries.

Join us for the webinar on December 10 at 10am PT / 1pm ET “The Modern Data Architecture: In-Memory and Hadoop – The New BI”

What is In-Memory Processing?

Recently, SAP and Hortonworks announced the next step in the relationship with SAP, where SAP resells and provided enterprise support for the Hortonworks Data Platform. Since then, we’ve been working together to showcase how SAP HANA + Hortonworks Data Platform provide “Instant Insight and Infinite Scale”. The combination of HANA and the Hortonworks Data Platform is a perfect match. SAP HANA uniformly amplifies the value of Big Data across this data fabric including large data sets that are stored in Hadoop.…

Hortonworks customers can now enhance their Hadoop applications with Elasticsearch real-time data exploration, analytics, logging and search features, all designed to help businesses ask better questions, get clearer answers and better analyze their business metrics in real-time.

Hortonworks Data Platform and Elasticsearch make for a powerful combination of technologies that are extremely useful to anyone handling large volumes of data on a day-to-day basis. With the ability of YARN to support multiple workloads, customers with current investments in flexible batch processing can also add real-time search applications from Elasticsearch.…

Join Hortonworks and Pactera for a Webinar on Unlocking Big Data’s Potential in Financial Services Thursday, November 21st at 12:00 EST.

Have you ever had your debit or credit card declined for seemingly no reason? Turns out, the rejections are not so random. Banks are increasingly turning to analytics to predict and prevent fraud in real-time. That can sometimes be an inconvenience for customers who are traveling or making large purchases, but it’s necessary inconvenience today in order for banks to reduce billions in losses due to fraud.…

We had a lot of fun in NYC and hope you did too. Thanks to the hundreds of you who dropped by the booth, attended dinners, parties, meetups and sessions.

As we have known for some time, Hortonworks customers are already building a modern data architecture with Hadoop as the technology of choice for handling the data they have streaming in from all directions. They care that it matches their needs, integrates with their existing infrastructure and solves real problems with flexibility.…

One of the great things about working in open source development is working with other experts round the work on big projects – and then having the results of that work in the hands of users within a short period of time.

This is why I’m really excited about the Rackspace announcement of their HDP-based Big Data offerings, both “on-prem” and in cloud. Not just because its partners of us offering a service based on Hadoop, but because it shows how Hadoop integration with OpenStack has reached a point where it’s ready for production use.…

You’re a Java developer, you use Spring and you’re just itching to get your arms around some big data. Well, now you can do that even easier than before as we announced this morning that Spring is now certified for Hortonworks Data Platform.

To celebrate this development, we have a community tutorial for Sandbox (1.3 currently) that shows you how to use Spring XD to collect data streamed from Twitter, load into HDFS and then run simple sentiment analysis with Apache Hive.…

Today we announced the expansion of our strategic relationship with HP enabling HP to resell Hortonworks Data Platform (HDP). As data volumes grow and new data sources emerge it is important for enterprises have access to production ready enterprise Apache Hadoop to meet their big data needs.

With HDP, HP customers can now seamlessly incorporate Hadoop into their modern data architectures to power a variety of new applications and to support existing ones with additional data sources.…

Today our partner Rackspace announced their Big Data solution for dedicated and cloud environments, powered by Hortonworks Data Platform. This collaboration between Hortonworks and Rackspace provides customers a flexible choice of deployment offerings of Apache Hadoop from one of the most trusted vendors in the cloud computing market.

Enterprise adoption of Apache Hadoop

This expanded collaboration is a strong indicator of the ecosystem rallying around Hortonworks Data Platform and our goal at Hortonworks of making Apache Hadoop a core component of the modern data architecture, whether on premise, in a VM, as an appliance, or in the cloud.…

Behind all the Big Data hype, there is one common thread: Apache Hadoop and its associated components ARE the technology platform of choice. And here at Hortonworks, that’s what we do: Hadoop.

That is also why we are so excited about the incredible growth in customers who have chosen to work with us to ensure their implementation of Hadoop and realize their vision of a modern data architecture.

Here are the key reasons we believe that we can best help your enterprise with Apache Hadoop.…

Go to page:12345...10...Last »