Get Started


Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

It’s no secret that there is a data explosion. A recent IDC analyst report from April 2014 indicated the volume of data, known as the digital universe, is doubling in size every two years. And by 2020, there will be as many digital bits as there are stars in the universe. There are many reasons […]

Provenance, Lineage & Chain of Custody The models of Provenance, Lineage and Chain of Custody are used in fine art to determine when a piece was created, the sequence of locations where it was held, how it was touched along the way, and who has owned it since creation, all with the purpose of authenticating the piece. […]

People often think about cloud architecture in simplistic terms: you’re either public, private, or hybrid. (In fact, there’s even confusion about the meaning of the term “hybrid” itself—this video helps clear it up: In the real world, of course, virtually every implementation is hybrid—no company puts 100% of its IT environment into one single cloud. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

The Financial regulators are driving a Data Evolution Traditionally technology moves fast, regulators react slow. When technology leaps forward, it enables financial firms to change the nature of their business – often into un-regulated territory; Regulators react to pass regulation to catch up. This model can work in slow moving markets, but in todays interconnected […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]

This is the first of a three part series of the evolution of the Hortonworks and Microsoft relationship. Microsoft has led one tech industry revolution after another from the dawn of personal computing to the cloud. Hortonworks is defining a new generation of innovation and impact with its pioneering work in Big Data. You already […]

Cloud Computing is one of the big three trends impacting IT architectures today.  What some may not realize is that an underlying connected data architecture is not only essential for cloud, but sits at the confluence of all three trends. Here’s why. The first big trend is IoT. According to BI Intelligence, we can now […]

Hortonworks DataFlow (HDF) 2.0 is now available! HDF is powered by Apache NiFi 1.0.0, which recently underwent a major redesign. Whether you’re a current user or just now planning to try it out, this is exciting news. A lot of new feature content went into this release such as multi-tenancy and zero-master clustering. The purpose […]

I just left a sold-out Melbourne Hadoop Summit 2016 in Australia. This was the first Summit in Asia Pacific and I was excited by tremendous response from the global and local community, and from regional organizations and businesses.  The buzz was everywhere. We’re proud to be the host and the organizer.   We couldn’t pull […]

I cannot believe we are less than one week out for Hadoop Summit Melbourne! Following on from our first guest blog from Rikki-Lee Brandon, she’s been kind enough to pull together as list of her top restaurant and bar spots in the City. If you visit any of these recommendations during Hadoop Summit week, please […]

With only three weeks to go until Hadoop Summit Melbourne, anticipation is building for what looks set to be a fantastic event for the Hadoop community.   Whether it’s pre-event training, our keynotes, sponsor sessions or the 3-4 business or technical sessions you have to choose from every hour, your agenda is undoubtedly going to be […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Implementing a real-time Hive Streaming example by:mjohnson The Hive Streaming API enables the near real-time data ingestion into Hive. This two part posting reviews some of […]

みなさま Hortonworksでマーケティングを担当している北瀬です。Hortonworksに入社して1か月がたちましたので、ブログなど書いてみようかと思います。と言っても今回は「Hadoop Summit 2016 Tokyo」の紹介になります。 先日、6月28日〜30日、アメリカ、サンノゼで行われていた「Hadoop Summit 2016 San Jose」の様子はこちらで紹介されていますが、その熱気が日本にもやってきます!10月26日、27日に日本で初めてApache Hadoopのグローバルイベント「Hadoop Summit」が開催されます。只今、Hadoop Summit 実行委員会ではスピーカーを募集しています。ご興味ある方は、是非ご応募ください。 Hadoop Summit 2016 San Joseの様子 10周年を迎えたHadoop,データ分析の主戦場はクラウドとデータセンターの連携に ―「Hadoop Summit 2016 San Jose」レポート Hadoop Summitに見る、ビッグデータエコシステムの秩序と分断 テクノロジとプレーヤーが出揃った!北瀬公彦の「Hadoop Summit 2016」レポート Hadoop Summit 2016 Tokyo セッション募集カテゴリー ビジネス ビジネスに影響を与えた実際の事例 テクニカル Apache コミッターによる発表 アプリケーション開発、分析、データサイエンス ガバナンス、セキュリティ、運用管理 モダンデータアプリケーション、IoT、ストリーミング 応募に関して 応募方法: 下記より応募ください。発表は日本語でも問題ありませんが、応募に関しては英語お願いいたします。 Hadoop Summit 2016 Tokyo Call For Abstracts 締め切り: 8月12日(金) 何かご質問などありましたら、Melissa […]

Following the success of our sold-out 2015 Roadshow, we are pleased to announce our worldwide Future of Data Roadshow 2016! The Roadshow brings the innovators driving the future of data to you and offers insightful content for both business and technical attendees. This is an invaluable opportunity to network with leaders who are transforming their business […]