The Hortonworks Blog

Are your business users able to quickly access and report on the massive amount of data flowing into Hadoop? Learn how leading companies are already accelerating the speed of innovation using the combination of Hadoop and the Actian Analytics Platform. In this webinar, Hortonworks and Actian will describe how you can:

  • Deploy modern data architecture accessible to business users
  • Combine flows from any data source and Hadoop to deliver transformative business value
  • Unleash business analysts on Hadoop data without coding
  • Deliver analytic results right at the point of business: the right offer, at the right price, to the right prospect, in a timeframe of relevance

Register now to accelerate Big Data 2 with Hortonworks and Actian!…

There is no doubt that Hadoop has proven value for many companies via more efficient use of resources or through new business value derived from new sets of data. However, the limited availability of trained personnel that have the necessary skills to develop and integrate with Hadoop has proven difficult for many organizations to overcome.

Please join Talend and Hortonworks on this webinar where we present an end-to-end use case across data load, processing and delivery of results for analysis of machine/sensor data without writing a line of code.…

Today we’re delighted to announce our acquisition of XA Secure to provide comprehensive security capabilities for Enterprise Hadoop. Please join us in welcoming XA Secure to the Hortonworks family.

Register for the Webinar

Hortonworks Data Platform has seen phenomenal adoption across an ever-growing number of organizations. As part of that adoption, and thanks to Apache Hadoop YARN, businesses are moving from single-purpose Hadoop clusters to a versatile, integrated data platform hosting multiple business applications – combining data sets with diverse processing needs in one place.…

Last week Vinay Shukla and Kevin Minder hosted the first of our seven Discover HDP 2.1 webinars. Vinay and Kevin covered three important topics related to new Apache Hadoop security features in HDP 2.1:

  • REST API security with Apache Knox Gateway
  • HDFS security with Access Control Lists (ACLs)
  • SQL security and next-generation Hive authorization

Here is the complete recording of the webinar.

Here are the presentation slides: http://www.slideshare.net/hortonworks/discoverhdp21security

Attend our next Discover HDP 2.1 webinar tomorrow, Thursday, May 15 at 10am Pacific Time: Interactive SQL Query in Hadoop with Apache Hive

We’re grateful to the many participants who joined and asked excellent questions.…

For years, experts in the healthcare industry have been searching for ways to detect (and possibly cure) Alzheimer’s disease, the most common form of dementia. Current estimates indicate that 35.6 million people are living with dementia, projected to jump to 135 million by 2050, according to the Global CEO Initiative on Alzheimer’s Disease. At a projected cost of over $600 billion each year, it’s a looming global health and fiscal crisis.…

Rainstor is a Hortonworks Certified Technology Partner and provides an efficient database that reduces the cost, complexity and compliance risk of managing enterprise data. RainStor’s patented technology enables customers to cut infrastructure costs and scales anywhere; on-premise or in the cloud and natively on Hadoop. RainStor’s customers are 20 of the world’s largest communications providers and 10 of the biggest banks and financial services organizations. 

Rainstor’s Mark Cusack, Chief Architect, writes about the benefits of certification on HDP 2.1.…

Join us for an update on Hortonworks strategy and market perspective, successes and key learnings exclusively for Hortonworks partners and prospective partners. Hortonworks’ executives are hosting this informative afternoon with the goal of helping grow the Hadoop ecosystem and your big data business. The event will take place the day before Summit begins on June 2nd, from 2-5pm at the San Jose Convention Center. Register here, – space is limited.

Note: This is not part of the Hadoop Summit.…

Syncsort is a Hortonworks Certified Technology Partner and has over 40 years of experience helping organizations integrate big data…smarter. Keith Kohl, Director of Product Management, Syncsort, is our guest blogger. Below he talks about the importance of certification and how it benefits Syncsort’s customers and prospects interested in Hadoop.

Back in January, Syncsort announced our partnership with Hortonworks and the certification of DMX-h on HDP 2.0. I was also given the opportunity to write a guest BLOG on the Hortonworks site about HDP 2 and the GA of YARN (thanks Hortonworks!).…

There’s no denying that the information collected by Big Data architectures such as Hortonworks Data Platform (HDP) is revolutionizing how enterprises view and understand their business. The data contains deep insights into many aspects of the business such as sales, customer trends and buying patterns.

The problem has been not only how to extract those insights from the data but how to get it quickly and easily into the hands of the people who need it the most. …

Fino Consulting is a new Consulting and Systems Integration Partner of Hortonworks serving Fortune 1000 companies with winning business solutions through data science. Fino is an early mover in cloud computing, challenging clients to “Re-think what they know about cloud-computing” to build high-performance sustainable applications and stretch the boundaries of enterprise data. Fino uses HDInsight from Microsoft for client solutions because of its versatile, cloud-based data platform that manages data of any type, while leveraging all the features and functionality of Microsoft’s resources.…

I’m a pretty heavy Unix user and I tend to prefer doing things the Unix Way™, which is to say, composing many small command line oriented utilities. With composability comes power and with specialization comes simplicity. Although, sometimes if two utilities are used all the time, sometimes it makes sense for either:

  • A utility that specializes in a very common use-case
  • One utility to provide basic functionality from another utility

For example, one thing that I find myself doing a lot of is searching a directory recursively for files that contain an expression:

find /path/to/root -exec grep -l "search phrase" {} \;

Despite the fact that you can do this, specialized utilities, such as ack have come up to simplify this style of querying.…

Hadoop 2 and its YARN-based architecture has increased the interest in new engines to be run on Hadoop and one such workload is in-memory computing for machine learning and data science use cases. Apache Spark has emerged as an attractive option for this type of processing and today, we announce availability of our HDP 2.1 Tech Preview Component of Apache Spark.  This is a key addition to the platform and brings another workload supported by YARN on HDP.…

The first use of the term BoF session was used at the Digital Equipment Users’ Society (DECUS) conference in the 1960s. Its essence was to bring together like minds and thought leaders—just as birds of the feather flock together— to share and exchange computing ideas, in an informal yet spirited way. Since then, the organizers and sponsors of most computing conferences have been loyal to its essence and spirit.

For ideas and innovation happen in collaboration—not in isolation. …

This is the second in our series on the motivations and architecture for improvements to the Apache Hadoop YARN’s Resource Manager Restart resiliency. Other in the series are:

Introduction: Phase I – Preserve Application-queues

In the introductory blog, we previewed what RM Restart Phase I entails. In essence, we preserve the application-queue state into a persistent store and reread it upon RM restart, eliminating the need for users to resubmit their applications.…

This is the first post in our series on the motivations and architecture for improvements to the Apache Hadoop YARN’s Resource Manager Restart resiliency. Other in the series are:

Resource Manager (RM) is the central authority of Apache Hadoop YARN for resource management and scheduling. It is responsible for allocation of resources to applications like Hadoop MapReduce jobs, Apache TEZ DAGs, and other applications running atop YARN.…

Go to page:« First...678910...203040...Last »