cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

We are very excited to be bringing you DataWorks Summit/Hadoop Summit this year. It’s the industry’s premier event focusing on next-generation big data solutions. We hope that you’ll be able to attend this year and learn from your peers and industry experts about how open source technologies like Apache Hadoop, Apache Spark, and Apache NiFi […]

As we kick off the new year I wanted to thank our customers, partners, Apache community members, and of course the amazing Hortonworks team, for an amazing 2016. Let’s take a step back and look at some of the Hortonworks highlights from last year… IN THE ECOSYSTEM there was tremendous acceleration. At the beginning of […]

Bob Glithero Analytics Product Marketing Manager, Pivotal Over the last five years, mobile network operators (MNOs) realized 15% lower compound revenue growth on average than other types of communication service providers. With few exceptions, MNOs globally have seen a long-term decline in average revenue per user (ARPU). To reinvigorate growth, innovative MNOs are searching for […]

As you may have read yesterday, we are expanding Hadoop Summit to address the growing ecosystem to bring you DataWorks Stummit. DataWorks Summit and Hadoop Summit are global events for business and technical audiences who want to learn how data is transforming business and the underlying technologies that are driving that change. We are happy […]

Wow, I really can’t believe it has only been one year since we launched Hortonworks Community Connection — HCC. What started as a project to make communication between our technical teams more transparent has blossomed into a fantastic and engaging website. Here are just some of the interesting numbers: There are now over 40,000 assets […]

We were really excited to welcome a sold out crowd at the first Hadoop Summit in Tokyo last week.  This was a fantastic response, based on the huge interest around a technology that is transforming industries across Asia and Pacific. We could not put this kind of conference on without the help of our sponsors […]

I started my journey at Hortonworks a little over five years ago and have been to many Hadoop Summits in the US, Europe and now Asia.  I just kicked off a two day sold out show in Tokyo and shared my thoughts and stories on architectures and business transformation.    John Kreisa’s post here has […]

Last week we announced third quarter results, and it was a milestone quarter.   Customers in 60 countries chose Hortonworks to help with their on-premise, hybrid and public cloud data management strategies. A big thanks to the Hortonworks team. I wanted to stop for a second and note the significance of the fact that we just […]

“The most damaging phrase in the language is, ‘It’s always been done that way.’” – Rear Admiral Grace Hopper, United States Navy This year Hortonworks was a Gold Sponsor at the Grace Hopper Celebration of Women in Computing Conference in Houston, Texas. This conference is named in honor of Grace Hopper, a pioneer in the […]

It’s no secret that there is a data explosion. A recent IDC analyst report from April 2014 indicated the volume of data, known as the digital universe, is doubling in size every two years. And by 2020, there will be as many digital bits as there are stars in the universe. There are many reasons […]

Provenance, Lineage & Chain of Custody The models of Provenance, Lineage and Chain of Custody are used in fine art to determine when a piece was created, the sequence of locations where it was held, how it was touched along the way, and who has owned it since creation, all with the purpose of authenticating the piece. […]

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

The Financial regulators are driving a Data Evolution Traditionally technology moves fast, regulators react slow. When technology leaps forward, it enables financial firms to change the nature of their business – often into un-regulated territory; Regulators react to pass regulation to catch up. This model can work in slow moving markets, but in todays interconnected […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]