cta

Get Started

cloud

Ready to Get Started?

Download sandbox

How can we help you?

closeClose button

The Hortonworks Blog

With the San Jose DataWorks Summit (June 13-15) just two months away, we’re busy finalizing the lineup of an impressive array of speakers and business use cases. This year our Enterprise Adoption Track will include Nick Evans and Kevin Brown from ExxonMobil with Wade Salazar from Hortonworks. Big Data is driving major advances in the oil […]

Hive View 2.0 is New in Apache Ambari 2.5 Ambari’s Hive View gives analysts and DBAs a convenient web interface to Apache Hive which allows SQL analytics, data management and performance diagnostics. Ambari 2.5 introduces Hive View 2.0 with a brand new user experience plus a slew of great new tools to help DBAs run […]

You may have read yesterday’s blog post that summarizes how Yahoo! Japan, the largest web portal in Japan, scaled its business analytics with access to over 75 petabytes of data in Hortonworks Data Platform (HDP). You can read the full English translation of that Japanese case study here. This post is about SoftBank Corp, another […]

Andrew Ng, the renowned data scientist, has said that artificial intelligence (AI) needs to be a company-wide strategic decision. Companies that don’t strategically invest in AI will slowly lose market share to companies whose core businesses are built around AI. AI enables the prediction, planning and automation of a variety of tasks, and for enterprises, […]

This is a guest blog post by Charles Boicey, Chief Innovation Officer at Clearsense. Clearsense was born out of a passion for helping healthcare organizations realize the promise of their data and its ability to help them make better, faster clinical decisions—to meet the challenges of value-based care, drive research, improve patient care, and ultimately […]

This week I attended the 2017 Automotive Cyber Security Summit in Detroit with my colleague Mike Schiebel (General Manager, Cyber Security, Hortonworks). Together, we were speakers in a session entitled “Securing the Connected Car in a Connected World”. Here are highlights of what we presented: How Did We Get Here? A Historical Perspective As the […]

OPEN SOURCE HADOOP NOW RUNS ON AN OPEN COMPUTE PLATFORM The software market is undergoing a major transition, moving away from proprietary software that leads to customer lock-in. Open source software offers freedom, more flexibility, and faster innovation – all at a lower cost. With the release of HDP 2.6 now available on IBM Power Systems, […]

HDP 2.6 takes a huge step forward toward true data management by introducing SQL-standard ACID Merge to Apache Hive. As scalable as Apache Hadoop is, many workloads don’t work well in the Hadoop environment because they need frequent or unpredictable updates. Updates using hand-written Apache Hive or Apache Spark jobs are extremely complex.  Not only […]

Now Generally Available in HDP 2.6 Hive LLAP (Low Latency Analytical Processing) is Hive’s new architecture that delivers MPP performance at Hadoop scale through a combination of optimized in-memory caching and persistent query executors that scale elastically within YARN clusters. Hive LLAP — MPP Performance at Hadoop Scale   Since Hive LLAP was introduced as […]

Human Assisted AI Another common trend is pairing humans to evaluate results from Artificial Intelligence (AI). As great and sensational AI has been made out to be recently, it is still long way from having human-like abilities of comprehension, reasoning and intuition. For instance, in radiology, given lymph node cells, AI alone had 7.5 percent […]

Large-scale Machine Learning The ability to learn without being explicitly programmed, Machine Learning, has been around for a long time and is well understood. What is different is the relatively recent emergence of general purpose tools, such as Apache Spark, that enable processing of very large datasets. Additionally, data scientists can now collaborate and rapidly […]

The automotive industry is going through profound change driven by a myriad of new data sources and the new business models that they enable. Just last week, it was reported that Intel paid $15B for Mobileye, a technology company that allows autonomous vehicles to “see” through cameras and other sensors. Mobileye also allows crowdsourcing data […]

Hortonworks continues to expand its list of customers in the Asia Pacific region, as well as in the housing and building industry. We recently completed a case study to showcase how LIXIL Corporation uses HDP to be first in manufacturing for the Japanese Smart Home Market. READ THE FULL LIXIL CASE STUDY HERE LIXIL is a […]

Did you know every Hortonworks HDP support subscription comes with SmartSense? Advanced Analytics of Diagnostic Data Prevents Issues SmartSense uses advanced analytics to make suggestions and recommendations based on the deep knowledge of our Hortonworks engineers and committers to prevent issues and improve performance of your HDP cluster. Based on the diagnostic data collected from […]

In this blog, we will be discussing, SAS® Grid Manager for Hadoop. There are some very compelling reasons to modernize data architectures with Hadoop. Anyone responsible for administering SAS workloads on Hadoop or considering this path should know about SAS Grid Manager for Hadoop. What is SAS Grid Computing? SAS Grid Computing has been offering […]