Get Started


Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

It’s no secret that there is a data explosion. A recent IDC analyst report from April 2014 indicated the volume of data, known as the digital universe, is doubling in size every two years. And by 2020, there will be as many digital bits as there are stars in the universe. There are many reasons […]

Provenance, Lineage & Chain of Custody The models of Provenance, Lineage and Chain of Custody are used in fine art to determine when a piece was created, the sequence of locations where it was held, how it was touched along the way, and who has owned it since creation, all with the purpose of authenticating the piece. […]

The 100% open source and community driven innovation of Apache Hive 2.0 and LLAP (Long Last and Process) truly brings agile analytics to the next level. It enables customers to perform sub-second interactive queries without the need for additional SQL-based analytical tools, enabling rapid analytical iterations and providing significant time-to-value. TRY HIVE LLAP TODAY Read about […]

The Financial regulators are driving a Data Evolution Traditionally technology moves fast, regulators react slow. When technology leaps forward, it enables financial firms to change the nature of their business – often into un-regulated territory; Regulators react to pass regulation to catch up. This model can work in slow moving markets, but in todays interconnected […]

Hadoop’s ability to work with Amazon S3 storage goes back to 2006 and the issue HADOOP-574, “FileSystem implementation for Amazon S3”. This filesystem client, “s3://” implemented an inode-style filesystem atop S3: it could support bigger files than S3 could then support, some its operations (directory rename and delete) were fast. The s3 filesystem allowed Hadoop […]

Hortonworks DataFlow (HDF) 2.0 is now available! HDF is powered by Apache NiFi 1.0.0, which recently underwent a major redesign. Whether you’re a current user or just now planning to try it out, this is exciting news. A lot of new feature content went into this release such as multi-tenancy and zero-master clustering. The purpose […]

It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Implementing a real-time Hive Streaming example by:mjohnson The Hive Streaming API enables the near real-time data ingestion into Hive. This two part posting reviews some of […]

みなさま Hortonworksでマーケティングを担当している北瀬です。Hortonworksに入社して1か月がたちましたので、ブログなど書いてみようかと思います。と言っても今回は「Hadoop Summit 2016 Tokyo」の紹介になります。 先日、6月28日〜30日、アメリカ、サンノゼで行われていた「Hadoop Summit 2016 San Jose」の様子はこちらで紹介されていますが、その熱気が日本にもやってきます!10月26日、27日に日本で初めてApache Hadoopのグローバルイベント「Hadoop Summit」が開催されます。只今、Hadoop Summit 実行委員会ではスピーカーを募集しています。ご興味ある方は、是非ご応募ください。 Hadoop Summit 2016 San Joseの様子 10周年を迎えたHadoop,データ分析の主戦場はクラウドとデータセンターの連携に ―「Hadoop Summit 2016 San Jose」レポート Hadoop Summitに見る、ビッグデータエコシステムの秩序と分断 テクノロジとプレーヤーが出揃った!北瀬公彦の「Hadoop Summit 2016」レポート Hadoop Summit 2016 Tokyo セッション募集カテゴリー ビジネス ビジネスに影響を与えた実際の事例 テクニカル Apache コミッターによる発表 アプリケーション開発、分析、データサイエンス ガバナンス、セキュリティ、運用管理 モダンデータアプリケーション、IoT、ストリーミング 応募に関して 応募方法: 下記より応募ください。発表は日本語でも問題ありませんが、応募に関しては英語お願いいたします。 Hadoop Summit 2016 Tokyo Call For Abstracts 締め切り: 8月12日(金) 何かご質問などありましたら、Melissa […]

It has been another exciting week on Hortonworks Community Connection HCC. We have lots of great technical content and are continuing to see great activity. We recommend the following assets from last week: Top Articles from HCC Adding KDC Administrator Credentials to the Ambari Credential Store by:rlevas Rack Awareness by:rbiswas Spark+Pycharm+Pybuilder on Docker by:smanjee YARN […]

Early this year, we announced our partnership with Pivotal and Syncsort,  incorporating key technologies from the ecosystem to optimize the value from Hortonworks Connected Data Platforms. Today, I am very excited to announce an addition with our partnership to provide global access to and resell AtScale. Customers are constantly asking us to find simpler, faster […]

About the author: Paul Boal is the big data practice lead at Amitech Solutions. At StampedeCon in St. Louis on July 26-28, 2016 he will be presenting more details on the use of NiFi and Hadoop to manage and analyze data from wearable fitness devices in a population health management solution with Big Cloud Analytics. […]

The previous two posts have covered the business & strategic need for Wealth Management IT applications to reimagine themselves to support their clients. How is this to be accomplished and what does a candidate architectural design pattern look like? What are the key enterprise wide IT concerns? This third & final post (3/3) tackles these questions. […]

There were a lot of great activities and sessions at the recent Apache: Big Data North America in Vancouver, B.C. I enjoyed the technical level of the sessions and meeting others who contribute to projects in the Apache Software Foundation (ASF). The sessions I went to had a high level of interesting technical content, with […]

  At Hortonworks, we work with hundreds of enterprises to ensure they get the most out of Apache Hadoop and the Hortonworks Data Platform. A critical part of making that possible is ensuring operators can quickly identify the root cause if something goes wrong. A few weeks ago, we presented our vision for Streamlining Apache […]

by Tendu Yogurtcu, PhD – General Manager of Big Data, Syncsort This week, Hortonworks announced an exciting expansion of our long-standing partnership. Hortonworks will now resell Syncsort’s leading Hadoop data integration software, DMX-h for onboarding ETL processing in Hadoop. DMX-h will enable our joint customers to easily access and collect data from a diverse set […]