The Hortonworks Blog

Communication service providers aim to enhance customer experience and build strong and long-lasting relationships with their customers. This has become increasingly difficult as customer interactions now occur across many channels. Hence, it’s important to understand customer behavior across all channels to create the best experience for each individual. Join us on August 5 for a webinar with Hortonworks and Apigee to learn more.

Register Now

In today’s guest blog post, Sanjay Kumar, General Manager, Telecommunications at Hortonworks, and Sanjeev Srivastav, Vice President, Data Strategy at Apigee, discuss how service providers can capture and visualize customer behavior as a graph connecting the interaction points such as IVR, chat and call events, and combine it with network data to predict future call or chat patterns.…

On August 4th at 10:00 am PST, Eric Thorsen, General Manager Retail/CP at Hortonworks and Krishnan Parasuraman, VP Business Development at Splice Machine, will be talking about how Hadoop can be leveraged as a scale-out relational database to be the System of Record and power mission critical applications.

In this blog, they provide answers to some of the most frequently asked questions they have heard on the topic.

Register Now

  • Hadoop is primarily known for running batch based, analytic workloads.
  • On July 22nd, we introduced the general availability of HDP 2.3. In part 2 of this blog series, we explore notable improvements and features related to Data Access.

    We are especially excited about what these data access improvements mean for our Hortonworks subscribers.

    Russell Foltz-Smith, Vice President of Data Platform, at TrueCar summed up the data access impact to his business using earlier versions of HDP, and his enthusiasm for the innovation in this latest release:

    TrueCar is in the business of providing truth and transparency to all the parties in the car-buying process,” said Foltz-Smith.…

    A recent article in PropertyCasualty 360, The Internet of Things: Insurers must prepare for disruption, customer impact, highlights the imperative for insurer strategies that address the emergence of the Internet of Things (IoT) and how it changes customer behaviors and their views of risk. The article predicts that as consumers have more access to data through their connected devices:

    The IoT will fundamentally change what consumers know, when they know it, and how they interact with businesses that serve them.…

    For Hortonworks, working with and enabling the Hadoop ecosystem is one of our core tenants, and we’re proud of the 1,100+ partners that have joined us in the journey to ensure that Open Enterprise Hadoop interoperates with your existing data center technologies. Today, we’re delighted that we have been named to The Channel Company’s exclusive 2015 CRN® Emerging Vendors List. The annual list features technology vendors that have introduced innovative new products, creating opportunities for channel partners in North America to create solutions for customers.…

    We are very pleased to announce that Hortonworks Data Platform (HDP) Version 2.3 is now generally available for download. HDP 2.3 brings numerous enhancements across all elements of the platform spanning data access to security to governance. This version delivers a compelling new user experience, making it easier than ever before to “do Hadoop” and deliver transformational business outcomes with Open Enterprise Hadoop.

    As we announced at Hadoop Summit in San Jose, there are a number of significant innovations as part of this release including:

    HDP 2.3 represents the very latest innovation from across the Hadoop ecosystem.…

    In version 1.2.0, Apache Spark introduced a Data Source API (SPARK-3247) to enable deep platform integration with a larger number of data sources and sinks. We are proud to announce that support for the Apache Optimized Row Columnar (ORC) file format is included in Spark 1.4 as a new data source. This support was added through a collaboration between Hortonworks and Databricks, tracked by SPARK-2883.

    The Apache ORC file format and associated libraries recently became a top level project at the Apache Software Foundation.…

    Apache Spark has garnered a lot of developer attention and is often the top of agenda in my customer interactions. Since we announced support for Spark in HDP, we have seen broad customer adoption of our Spark offering. Our customers love Spark for the simplicity of its API, speed of development and the runtime performance. Spark is also democratizing Machine Learning and making it easier and approachable to more developers.

    Today Microsoft announced support for Spark in HDInsight – this is a big step towards driving customer adoption for Spark workloads on Hadoop clusters in Azure.…

    Drink from Elephant’s Well Of Knowledge

    Developer success starts with open and reusable code, and a community that allows for both consumption of code and contribution of updates to the code base. This success engenders a thriving and evolving community.

    To that end, today we are announcing the Hortonworks Gallery for developers. Located on GitHub, the Gallery brings together the Hortonworks’ Apache Hadoop code, Apache Ambari Views and extensions, as well as related resources into a single view for developers to use within the familiar context of Git and open source software.…

    Early this year, ApacheTM FalconTM became a Top Level Project (TLP) in the Apache Software Foundation.

    The project continues to mature as a framework for simplifying and orchestrating data lifecycle management in Hadoop by offering out-of-the-box data management policies. The Apache Falcon 0.6.1 release builds on this foundation by providing simplified mirroring functionality and a new user interface (UI).

    The community worked very diligently to offer more than 150 product enhancements, and over 30 new features and improvements.…

    Hortonworks is always pleased to see new contributions come into the open-source community. We worked with our customer, Hotels.com, to help them develop libraries and utilities around Apache Hive, the Apache ORC format and Cascading. It’s great to see the results released for the community. In this guest blog, Adrian Woodhead, Big Data Engineering Team Lead at Hotels.com, discusses the CORC project.

    Hotels.com is pleased to announce the open source release of Corc, a library for reading and writing files in the Apache ORC file format using Cascading.…

    The Apache Lucene/Solr community is continuing its rapid release cycles to meet community and customer requirements. In this guest blog, we have invited Sarath Jarugula from Lucidworks to share with us the many improvements in the Apache Solr 5.2 release.

    The Apache Solr community has announced its Solr 5.2 release. Solr 5.2 is a follow-up release to Solr 5.0, a significant major release in February 2015. The community has delivered 25 new features, 5 optimizations, and 38 bug fixes in this release.…

    As YARN drives Hadoop’s emergence as a business-critical data platform, the enterprise requires more stringent data security capabilities. The Apache Ranger delivers a comprehensive approach to security for a Hadoop cluster. It provides a platform for centralized security policy administration across the core enterprise security requirements of authorization, audit and data protection.

    On June 10th, the community announced the release of Apache Ranger 0.5.0. With this release, the community took major steps to extend security coverage for Hadoop platform and deepen its existing security capabilities.…

    Earlier this month, Hortonworks had the pleasure of joining Yahoo! in hosting the 8th Annual Hadoop Summit, the leading conference for the Apache Hadoop community. Summit is always an important and exciting experience, bringing together thought leaders, technologists, and data specialists from throughout the community to explore and advance the art and science of Big Data.

    This year’s event came at a pivotal time for Hadoop and Hortonworks, with news about Open Enterprise Hadoop and the launch of the newest version of Hortonworks Data Platform (HDP 2.3™) poised to transform the way large organizations in every industry process data.…

    In his blog, Tim Hall wrote, “Enterprises are embracing Apache Hadoop to enable their modern data architectures and power new analytic applications. The freedom to choose the on-premises or cloud environments for Hadoop that best meets the business needs is a critical requirement.”

    One of the choices in deploying Hadoop in the cloud environment is with Microsoft Azure using Cloudbreak. Other choices include Google Cloud Platform, Openstack, and AWS.

    But in this blog, I’ll show how you can deploy Hadoop in Azure with few clicks by running HDP multimode cluster in Azure’s Linux VM using Cloudbreak.…