The Hortonworks Blog

Are you still learning about the Data Lake? Wondering how it can help your organization manage and leverage massive amounts of data? On September 8th, VHA, the largest member-owned health care company delivering supply chain management services and clinical services to its members, will share their experience and explain how they simplified data management and enabled faster data discovery with Hadoop and data virtualization.

Register Now

At VHA, product, supplier and member information, among other data, was siloed across multiple sources.…

This is a guest blog post from Jerry Megaro, Merck’s Director of Innovation and Manufacturing Analytics. Jerry established the practice of Data Excellence and Data Sciences within the Merck Manufacturing Division and now leads initiatives to transform Merck Manufacturing into a data-driven organization that enhances the company’s performance across the supply chain.

Hortonworks experience working with top pharma manufacturers indicates an exciting opportunity to improve manufacturing performance by proactively managing process variability.…

All segments of the oil and gas industry are adopting Hadoop, from exploration through to drilling, production, transportation, refining, and retail.

The Hortonworks Oil and Gas team will be demonstrating some of the Hadoop-based advanced analytics applications for the upstream oil and gas industry at PNEC Houston (the International Conference on Petroleum Data Integration, Information, and Data Management) running from May 19-21.

A Transformation in O&G

On a daily basis, the geological and geophysical discipline in upstream oil and gas must deal with a significant number of disparate datasets.…

Next week, in Las Vegas, thousands of attendees will join Informatica World to explore just how far data can take them. Many companies already rely on massive volumes of internal and external data to create new insights and build innovative and profitable business models. Where are you on your journey?

To learn more about how Hortonworks and Informatica partner to optimize the entire big data supply chain on Hadoop and can help you turn data into actionable information to drive business value, join the following sessions:

  • On Tuesday, May 12, during the Big Data Ready Summit, John Kreisa, VP Strategic Marketing at Hortonworks, will be part of the Succeeding with Big Data and Avoiding the Pitfalls panel.

The connected and collected vehicle data, emitted through embedded smart sensors, are transforming the automotive industry. Is this hype or reality?

To discuss the reality of this transformation, to tackle management of streams of data from connected cars, and to share new data architectures that process, manage and analyze volumes of data, automakers and key industry innovators will gather in Berlin for Telematics Berlin 2015 on May 11-12th.

Data Deluge

Because legacy architectures have limited capacity to store streams of unstructured and varied data at petabyte scale, lack the ability to analyze data in real-time and offer value and insights, automakers are looking to next generation data platforms.…

This week we are participating in the Microsoft Ignite conference in Chicago. Microsoft Ignite focuses on all Microsoft technologies and professionals and we are excited to demonstrate all of the ways we’ve been working with Microsoft to Do Hadoop together. As a long time Microsoft partner we are glad to be participating in this event for the 3rd year in a row showing of a history of joint engineering and commitment to the Microsoft platforms and users.…

It’s going to be a big week at EMC World! We’ll be exhibiting at the event and there are a number of opportunities to meet with us and hear about the partnership between EMC and Hortonworks. We look forward to seeing you there!

Booth

Hortonworks will be in booth #132, right next to the EMC Open@EMC booth. We’d love to meet with you to discuss how EMC Isilon and the Hortonworks Data Platform deliver a Modern Data Architecture.…

In today’s healthcare industry, shifting reimbursement models and increasing costs for supplies and labor come at the same time as mandates to improve care delivery while lowering costs. Improved healthcare outcomes should usually come at a higher cost, but more and better data can drive insights and efficiencies that help with both of those opposing pressures—helping care providers create new ways to both practice medicine and do business.

In her blog post entitled “Top 5 Health Care Trends to Watch in 2015,” Susan DeVore described this year’s top healthcare challenges and how the industry is addressing them.…

Having just returned from our Hadoop Summit Europe event, I was struck by the number of sessions that involved large scale businesses outlining the impact of their advanced analytic applications (built on Hadoop) and how those analytics are empowering better business decisions.

The story of business value is significant. Session after session, representatives from various industries talked about how their modern data architectures with Hadoop led to increased agility, new innovative customer experiences, and lower cost structures.…

On April 30, learn from experts at Hortonworks, Cisco, and Red Hat about accelerating the implementation of a scalable, cost-efficient and robust Big Data solution. Here is a sneak preview of what you’ll hear from our speakers:

  • Ali Bajawa, Senior Partner Solution Engineer, Hortonworks
  • Ron Graham, System Engineer for Big Data Analytics, Cisco
  • Irshad Raihan, Senior Principal, Big Data Product Marketing, Red Hat

Register Now

1. What should a company consider when looking for a big data solution?…

Waterline Data is a Hortonworks Technology Partner and recently earned HDP Certification and YARN Ready with their solution that automates the inventory of data assets in the data lake, enables data governance, and provides self-service to data engineers and data scientists to find and understand their data. Learn more by joining the upcoming webinar on May 6, download the Sandbox tutorial or joint whitepaper. Our guest blogger is Oliver Claude, CMO at Waterline Data.…

Can you identify the unused data in your data warehouse? Are you using your “big data” efficiently? Are your data migration projects cost effective? Is your data in compliance with industry regulations? If you answered “no” to any or all of these questions, then you may want to learn more about how to optimize your data warehouse.

On April 23rd at 11:00 am PST, Adis Cesir, Big Data Solution Engineer at Hortonworks, Ramu Kalvakuntla, Principal at RCG Global Services Big Data Practice, and Santosh Chitakki, Director of Product Management at Attunity, will be telling us more about rebalancing data warehouses and integrating your current enterprise data warehouse with a Modern Data Architecture.…

Hortonworks is pleased to announce the general availability of Apache Spark in Hortonworks Data Platform (HDP)— now available on our downloads page. With HDP 2.2.4 Hortonworks now offers support for your developers and data scientists using Apache Spark 1.2.1.

HDP’s YARN-based architecture enables multiple applications to share a common cluster and dataset while ensuring consistent levels of service and response. Now Spark is one of the many data access engines that works with YARN and that is supported in an HDP enterprise data lake.…

Today EMC is launching their EMC® Business Data Lake solution, the first fully-engineered, enterprise-grade solution for a Data Lake running on EMC infrastructure. At Hortonworks, we’ve been assisting customers on their journey to a data lake via a Modern Data Architecture (MDA) and our vision and EMC’s vision are highly complementary and so we’re delighted to be part of the EMC Business Data Lake.

The Data Lake enabled by a Modern Data Architecture allows enterprises to be a Data-First Enterprise.…

Forrester recently called Apache Hadoop adoption “mandatory” for the enterprise. For most organizations, moving forward with Hadoop is no longer a question of if, but when. Hadoop-powered insight into big data is enabling market disruption in every industry and the market winners are those who handle that data most effectively and at the lowest cost.

As with any new platform, making decisions on how best to implement and for what purpose can be challenging.…