<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Hortonworks</title>
	<atom:link href="http://hortonworks.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://hortonworks.com</link>
	<description>Develops, Distributes and Supports Enterprise Apache Hadoop.</description>
	<lastBuildDate>Fri, 24 May 2013 22:09:20 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
		<item>
		<title>Get Started with Hadoop on Hortonworks Data Platform 1.1 for Windows</title>
		<link>http://hortonworks.com/blog/get-started-with-hadoop-on-hortonworks-data-platform-1-1-for-windows/</link>
		<comments>http://hortonworks.com/blog/get-started-with-hadoop-on-hortonworks-data-platform-1-1-for-windows/#comments</comments>
		<pubDate>Fri, 24 May 2013 15:55:09 +0000</pubDate>
		<dc:creator>Rohit Bakhshi</dc:creator>
				<category><![CDATA[Apache Hadoop]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=26238</guid>
		<description><![CDATA[<p><p>We are excited to <a href="http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/">release the Hortonworks Data Platform 1.1</a> for Windows as a Generally Available product. In this blog post, I&#8217;m going to outline how to get started with HDP 1.1 for Windows.</p>
<p><a href="http://hortonworks.com/downloads"></a>With HDP for Windows, you can deploy Apache Hadoop and the HDP stack of components natively on a Windows Server cluster. The <a href="http://hortonworks.com/thankyou-hdp11-win/">HDP for Windows download</a> includes an MSI and remote installation scripts. With these artifacts, you can setup a multi-node Hadoop cluster in either a Workgroup or Active Directory Domain networking configuration. This enables HDP for Windows to be deployed for production use in Windows Data centers.</p>
<p>The best way to get started and evaluate HDP is to set up a single node cluster. We&#8217;ve written a <a href="http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-Win-1.1.0/bk_installing_hdp_for_windows/content/win-chap2-singlenode.html">quick start guide</a> that walks you through all the pre-requisites and install steps needed to get going. With a single node cluster, you can experience the full functionality of the product &#8211; load data into HDFS, execute Hive, Pig and MapReduce jobs, schedule processing workflows through Oozie.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/get-started-with-hadoop-on-hortonworks-data-platform-1-1-for-windows/">Get Started with Hadoop on Hortonworks Data Platform 1.1 for Windows</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>We are excited to <a href="http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/">release the Hortonworks Data Platform 1.1</a> for Windows as a Generally Available product. In this blog post, I&#8217;m going to outline how to get started with HDP 1.1 for Windows.</p>
<p><a href="http://hortonworks.com/downloads"><img class="size-medium wp-image-26243 alignright" alt="HDP for Windows" src="http://hortonworks.com/wp-content/uploads/2013/05/hdpwin-300x224.png" width="300" height="224" /></a>With HDP for Windows, you can deploy Apache Hadoop and the HDP stack of components natively on a Windows Server cluster. The <a href="http://hortonworks.com/thankyou-hdp11-win/">HDP for Windows download</a> includes an MSI and remote installation scripts. With these artifacts, you can setup a multi-node Hadoop cluster in either a Workgroup or Active Directory Domain networking configuration. This enables HDP for Windows to be deployed for production use in Windows Data centers.</p>
<p>The best way to get started and evaluate HDP is to set up a single node cluster. We&#8217;ve written a <a href="http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-Win-1.1.0/bk_installing_hdp_for_windows/content/win-chap2-singlenode.html">quick start guide</a> that walks you through all the pre-requisites and install steps needed to get going. With a single node cluster, you can experience the full functionality of the product &#8211; load data into HDFS, execute Hive, Pig and MapReduce jobs, schedule processing workflows through Oozie.</p>
<p>HDP enables seamless integration with the Microsoft BI tool ecosystem. You can explore data in HDFS through the <a href="http://office.microsoft.com/en-us/excel/download-data-explorer-for-excel-FX104018616.aspx">Data Explorer  in Excel</a>. You can query and analyze Hive data in Excel by using the <a href="http://hortonworks.com/thankyou-hdp12-hive-odbc-driver/?mdl=13577&amp;ao=3&amp;lnk=1">ODBC driver</a> to connect to Hive Server 2. You can import/export data from and to SQL Server through Apache Sqoop.</p>
<p>These integrations enable HDP to become an integral part of your Enterprise Data Architecture, and allow you to utilize the same tools that you are familiar with to interact with HDP.</p>
<p><b>Learn More. </b>Please take a look at the <a href="http://docs.hortonworks.com/HDPDocuments/HDP1/HDP-Win-1.1.0/index.html">Hortonworks Documentation</a> to learn more about installing and using HDP 1.1 for Windows.</p>
<div><b>Tell Us About It. </b>Please visit the <a href="http://hortonworks.com/community/forums/forum/hdp-for-windows/">HDP 1.1 for Windows Forum</a> to ask questions, get help, provide feedback and hear what others are doing with HDP.</div>
<p>The post <a href="http://hortonworks.com/blog/get-started-with-hadoop-on-hortonworks-data-platform-1-1-for-windows/">Get Started with Hadoop on Hortonworks Data Platform 1.1 for Windows</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/get-started-with-hadoop-on-hortonworks-data-platform-1-1-for-windows/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Mobile Telco Dials In and Harnesses Big Data with Hadoop</title>
		<link>http://hortonworks.com/blog/mobile-telco-dials-in-harnesses-big-data-with-hadoop/</link>
		<comments>http://hortonworks.com/blog/mobile-telco-dials-in-harnesses-big-data-with-hadoop/#comments</comments>
		<pubDate>Thu, 23 May 2013 19:30:17 +0000</pubDate>
		<dc:creator>Lisa Sensmeier</dc:creator>
				<category><![CDATA[Apache Hadoop]]></category>
		<category><![CDATA[Hadoop Ecosystem]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=26206</guid>
		<description><![CDATA[<p><p>Smartphones have transformed our daily lives. A key indicator of this trend is our increased spend on data plans versus voice. We are a new generation of people who are in a constant state of activity, communication, and community building wherever we go ─ including the couch in front of the television where we can multi-screen and multi-task!</p>
<p>What does this mean for the Mobile Telecom industry?  For one of the top five mobile phone service providers in the world, responsible for developing and managing advanced data services for European countries with data services including mobile internet access for various devices, mobile email, instant messaging, news, weather updates and traffic reports ─ it means as mobile data services grow in revenue, so does the need to monitor that contribution easily and accurately. While that sounds obvious, the mobile telecom growth rate has expanded so rapidly, the company’s existing systems could not keep up.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/mobile-telco-dials-in-harnesses-big-data-with-hadoop/">Mobile Telco Dials In and Harnesses Big Data with Hadoop</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img class="size-full wp-image-26235 alignright" alt="actuate" src="http://hortonworks.com/wp-content/uploads/2013/05/actuate.jpg" width="266" height="86" />Smartphones have transformed our daily lives. A key indicator of this trend is our increased spend on data plans versus voice. We are a new generation of people who are in a constant state of activity, communication, and community building wherever we go ─ including the couch in front of the television where we can multi-screen and multi-task!</p>
<p>What does this mean for the Mobile Telecom industry?  For one of the top five mobile phone service providers in the world, responsible for developing and managing advanced data services for European countries with data services including mobile internet access for various devices, mobile email, instant messaging, news, weather updates and traffic reports ─ it means as mobile data services grow in revenue, so does the need to monitor that contribution easily and accurately. While that sounds obvious, the mobile telecom growth rate has expanded so rapidly, the company’s existing systems could not keep up. And once the business leaders had the data – they wouldn’t trust it. Making accurate business decisions at the right time would be essential for their success and growth.</p>
<h3>Big Data Challenge</h3>
<p>The customer – a Mobile Telecom giant – had an existing method for determining business performance was ad hoc and decentralized. There was no single system to extract the information in a reliable and consistent manner. “<i>We had a mix of systems and information which needed lots of cross-checking – if indeed this was even possible. Getting access to data took a long time and, even then, the business users in marketing had no real confidence in the information they were getting.</i>” This in turn compromised their ability to develop and manage these services.</p>
<p>In order to gain market share and stay competitive, the customer had to be able to:</p>
<ul>
<li>Leverage the data from mobile usage to get accurate information about real customer activity to provide improved levels of customer satisfaction.</li>
<li>Spot upcoming trends in mobile use to drive intelligent marketing.</li>
<li>Improve the information on customer usage, which drives the changes needed to their service offerings, such as the ability to offer the latest mobile phone technologies.</li>
<li>Handle large volumes of data, be easily configurable by in-house business users, and provide graphical representations of the results</li>
</ul>
<h3>Hadoop Solution</h3>
<p>The strategy included harnessing Hadoop to handle the large volumes of data – 36 terabytes- that had to be consolidated into a single environment. Our Mobile Telecom customer decided to use <a href="http://www.actuate.com">Actuate</a> – a <a href="http://hortonworks.com/hw-partners/partner-cat/business-tools/">Hortonworks partner</a> in open source based Business Intelligence and Reporting Tools (BIRT) technology that connects analytics capabilities directly to Hadoop. Actuate’s ability to report directly against the Hadoop big data source, meanwhile, allows business users to generate on-demand analytics and reports consisting of thousands of pages in a matter of seconds through an easy-to-use web portal, with negligible training.</p>
<p>The Mobile Telecom giant now has a single source of clean data they can stand behind with absolute confidence in making the right decisions to stay competitive, and keep customer satisfaction levels high.  In addition, the consumer data services division is now in a position where it can replace several of its older systems, dropping extra licenses and hardware, because of the ability to do all of its business analytics in one place.</p>
<p>A Business Intelligence Analyst at the company stated; “<i>It’s all automatic. Before, business users would be sending emails and calls to chase the data. Anyone across the whole business can have access to the information they need, and find it on their own. I particularly like the ability to drill down into the figures. You can now see at a glance what’s happening right across our activities.</i>”</p>
<p>Customers’ want accurate and fast analytics reporting without a lot of training so a partnership between Hortonworks and Actuate, just makes big data sense.</p>
<p><b>Thank you to our partner <a href="http://www.actuate.com" target="_blank">Actuate</a> for this Hadoop use case. <a href="http://hortonworks.com/partners/certified-technology-program/">Find more partners here</a>.</b></p>
<p><b> </b><b>Actuate founded and co-leads the BIRT (Business Intelligence and Reporting Tools) open source project with the Eclipse Foundation, the home of the open source Eclipse Development Framework, the leading IDE worldwide. The BIRT project’s goal was to bring the web design metaphor to creating visualizations of data. </b></p>
<p>The post <a href="http://hortonworks.com/blog/mobile-telco-dials-in-harnesses-big-data-with-hadoop/">Mobile Telco Dials In and Harnesses Big Data with Hadoop</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/mobile-telco-dials-in-harnesses-big-data-with-hadoop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Boosting Big Data and the Hadoop Ecosystem with Splunk Alliance</title>
		<link>http://hortonworks.com/blog/boosting-big-data-and-the-hadoop-ecosystem-with-splunk-alliance/</link>
		<comments>http://hortonworks.com/blog/boosting-big-data-and-the-hadoop-ecosystem-with-splunk-alliance/#comments</comments>
		<pubDate>Thu, 23 May 2013 16:08:15 +0000</pubDate>
		<dc:creator>John Kreisa</dc:creator>
				<category><![CDATA[Apache Hadoop]]></category>
		<category><![CDATA[Hadoop Ecosystem]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=26168</guid>
		<description><![CDATA[<p><p>Today we announced a strategic alliance with operational intelligence leader <a href="http://www.splunk.com/" target="_blank">Splunk</a>. We are excited to be strengthening our relationship with Splunk and <a href="http://hortonworks.com/partners/certified-technology-program/">expanding the Apache Hadoop ecosystem </a>and we expect this to further drive open source innovation. Additionally this alliance is further proof of Hadoop’s maturation as a key component of the <a href="http://hortonworks.com/hadoop-modern-data-architecture/">next generation enterprise architecture</a>.</p>
<p>One of the key benefits of the partnership is that it enables organizations to easily take advantage of the massive scale out storage and processing capabilities of Apache Hadoop with Splunk Enterprise via Splunk Hadoop Connect, which easily and reliably moves data between Splunk Enterprise and Hadoop.</p>
<p>This capability means the enterprise can easily use Splunk Enterprise to collect machine data from across the enterprise and deliver it to Hadoop for batch analytics. Likewise, the output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/boosting-big-data-and-the-hadoop-ecosystem-with-splunk-alliance/">Boosting Big Data and the Hadoop Ecosystem with Splunk Alliance</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img class=" wp-image-11188 alignright" alt="SplunkLogo" src="http://hortonworks.com/wp-content/uploads/2012/10/SplunkLogo.jpg" width="331" height="133" />Today we announced a strategic alliance with operational intelligence leader <a href="http://www.splunk.com/" target="_blank">Splunk</a>. We are excited to be strengthening our relationship with Splunk and <a href="http://hortonworks.com/partners/certified-technology-program/">expanding the Apache Hadoop ecosystem </a>and we expect this to further drive open source innovation. Additionally this alliance is further proof of Hadoop’s maturation as a key component of the <a href="http://hortonworks.com/hadoop-modern-data-architecture/">next generation enterprise architecture</a>.</p>
<p>One of the key benefits of the partnership is that it enables organizations to easily take advantage of the massive scale out storage and processing capabilities of Apache Hadoop with Splunk Enterprise via Splunk Hadoop Connect, which easily and reliably moves data between Splunk Enterprise and Hadoop.</p>
<p>This capability means the enterprise can easily use Splunk Enterprise to collect machine data from across the enterprise and deliver it to Hadoop for batch analytics. Likewise, the output of Hadoop jobs can be imported into Splunk Enterprise for rapid analysis and visualization.</p>
<p><a href="http://www.splunk.com/" target="_blank">Visit the Splunk website </a>to learn more about Splunk Enterprise and Splunk Hadoop Connect.</p>
<p>Find out more about how Hadoop and the <a href="http://hortonworks.com/products/hortonworksdataplatform/">Hortonworks Data Platform</a> enables <a href="http://hortonworks.com/hadoop-modern-data-architecture/">next-generation data architecture</a>.</p>
<p>The post <a href="http://hortonworks.com/blog/boosting-big-data-and-the-hadoop-ecosystem-with-splunk-alliance/">Boosting Big Data and the Hadoop Ecosystem with Splunk Alliance</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/boosting-big-data-and-the-hadoop-ecosystem-with-splunk-alliance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hadoop, Hadoop, Hurrah! HDP for Windows is Now GA!</title>
		<link>http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/</link>
		<comments>http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/#comments</comments>
		<pubDate>Tue, 21 May 2013 16:28:02 +0000</pubDate>
		<dc:creator>John Kreisa</dc:creator>
				<category><![CDATA[Apache Hadoop]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Windows]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=26076</guid>
		<description><![CDATA[<p><p>Today we are very excited to announce that <a href="http://hortonworks.com/hadoop-training/hadoop-on-windows-for-developers/">Hortonworks Data Platform for Windows (HDP for Windows)</a> is now generally available and ready to support the most demanding production workloads.</p>
<p>We have been blown away with the number and size of organizations who have downloaded the beta bits of this 100% open source, and native to Windows distribution of Hadoop and engaged Hortonworks and Microsoft around evolving their data architecture to respond to the challenges of enterprise big data.</p>
<p>With this key milestone HDP for Windows offers the millions of customers running their business on Microsoft technologies an ecosystem-friendly Hadoop-based solution that is built for the enterprise and purpose built for Windows. This release cements Apache Hadoop’s role as a key component of the next generation enterprise data architecture, across the broadest set of datacenter configurations as HDP becomes the first production-ready Apache Hadoop distribution to run on both Windows and Linux.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/">Hadoop, Hadoop, Hurrah! HDP for Windows is Now GA!</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p><img class="wp-image-26078 alignleft" alt="HDP for Windows" src="http://hortonworks.com/wp-content/uploads/2013/05/HDP-for-Windows.png" width="320" height="310" />Today we are very excited to announce that <a href="http://hortonworks.com/hadoop-training/hadoop-on-windows-for-developers/">Hortonworks Data Platform for Windows (HDP for Windows)</a> is now generally available and ready to support the most demanding production workloads.</p>
<p>We have been blown away with the number and size of organizations who have downloaded the beta bits of this 100% open source, and native to Windows distribution of Hadoop and engaged Hortonworks and Microsoft around evolving their data architecture to respond to the challenges of enterprise big data.</p>
<p>With this key milestone HDP for Windows offers the millions of customers running their business on Microsoft technologies an ecosystem-friendly Hadoop-based solution that is built for the enterprise and purpose built for Windows. This release cements Apache Hadoop’s role as a key component of the next generation enterprise data architecture, across the broadest set of datacenter configurations as HDP becomes the first production-ready Apache Hadoop distribution to run on both Windows and Linux.</p>
<p>Additionally, customers now also have complete portability of their Hadoop applications between on-premise and cloud deployments via HDP for Windows and Microsofts’s HDInsight Service.</p>
<h3>Enterprise Hadoop Momentum</h3>
<p><b></b>Since its beta availability, we’ve been working with customers across a wide range of industries including automotive, manufacturing, financial services, retail and government. Here are just a few examples of the tremendous opportunity those customers are seeing:</p>
<ul>
<li><b>Automotive</b> – a major automotive company wants to use HDP on Windows to create a centralized repository for all of the sensor data collected from their cars. The refinement and exploration of the data trends and patterns found through driving habits, maintenance and repair data and myriad other signals will be used to further improve the quality of their cars.</li>
<li><b>Healthcare</b> – a major healthcare applications provider is looking to build the next generation of healthcare apps that integrate patient health record data with clinical study and FDA data so that the customer experience is enriched and provides a higher level of health care services at a lower cost.</li>
<li><b>Financial services</b> – multiple major financial services organizations are looking to create centralized repositories across different divisions enabling them to explore and gain deeper insight into customer risk patterns.</li>
<li><b>Manufacturing</b> – a major manufacturer of electronics will create a centralized repository of machine generated data coming from the production lines and compare and analyze that data with part failure and return data enabling them to identify and predict problems in production and increasing the quality of their products.</li>
</ul>
<p>This is just a small sample of the emerging use cases for HDP on Windows. <a href="http://hortonworks.com/hadoop-modern-data-architecture/">You can explore how Hadoop fits into your data architecture here</a>.</p>
<h3>Availability &amp; Training</h3>
<p>Hortonworks Data Platform for Windows is now available for download at: <a href="http://hortonworks.com/download/">http://hortonworks.com/download/</a>.</p>
<p>We also have training specifically designed for HDP on Windows, you can get more information here: <a href="http://hortonworks.com/hadoop-training/hadoop-on-windows-for-developers/">http://hortonworks.com/hadoop-training/hadoop-on-windows-for-developers/</a></p>
<p>The post <a href="http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/">Hadoop, Hadoop, Hurrah! HDP for Windows is Now GA!</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/hadoop-hadoop-hurrah-hdp-for-windows-is-now-ga/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Getting the most out of business analytics</title>
		<link>http://hortonworks.com/blog/getting-the-most-out-of-business-analytics/</link>
		<comments>http://hortonworks.com/blog/getting-the-most-out-of-business-analytics/#comments</comments>
		<pubDate>Mon, 20 May 2013 18:54:12 +0000</pubDate>
		<dc:creator>Kim Rose</dc:creator>
				<category><![CDATA[Big Data Insights]]></category>
		<category><![CDATA[Business Analytics]]></category>

		<guid isPermaLink="false">http://hortonworks.com/blog/getting-the-most-out-of-business-analytics/</guid>
		<description><![CDATA[<p><p>One of the most prevalent uses of Hadoop architecture by enterprises is to create business intelligence and analytics tools that can be leveraged to identify areas&#160;that could be improved to foster greater efficiency and productivity. According to a study jointly conducted by Gartner and the&#160;Financial Executives Research Foundation, business intelligence and analytics were <a href="http://www.gartner.com/newsroom/id/2488616" target="_blank">top areas of focus</a> among surveyed CFOs. Overall, 15 of the top 19 processes that were identified as needing improvement by the study&#039;s participants&#160;could be addressed through the use of&#160;these resources. In addition, 59 percent of the survey&#039;s respondents cited the ability to facilitate operational decision making processes as an area that required more technological advancement. Furthermore, half of all participants stated that the capacity to effectively monitor business performance was an&#160;investment need as well.</p>
<p>The report indicated that enterprises could improve their business analytics deployment by facilitating communication between data scientists and the C-suite executives who make operational decisions.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/getting-the-most-out-of-business-analytics/">Getting the most out of business analytics</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>One of the most prevalent uses of Hadoop architecture by enterprises is to create business intelligence and analytics tools that can be leveraged to identify areas&nbsp;that could be improved to foster greater efficiency and productivity. According to a study jointly conducted by Gartner and the&nbsp;Financial Executives Research Foundation, business intelligence and analytics were <a href="http://www.gartner.com/newsroom/id/2488616" target="_blank">top areas of focus</a> among surveyed CFOs. Overall, 15 of the top 19 processes that were identified as needing improvement by the study&#039;s participants&nbsp;could be addressed through the use of&nbsp;these resources. In addition, 59 percent of the survey&#039;s respondents cited the ability to facilitate operational decision making processes as an area that required more technological advancement. Furthermore, half of all participants stated that the capacity to effectively monitor business performance was an&nbsp;investment need as well.</p>
<p>The report indicated that enterprises could improve their business analytics deployment by facilitating communication between data scientists and the C-suite executives who make operational decisions. Specifically, executives should be&nbsp;aware of how these tools work and how to best utilize them to maximize their effectiveness.</p>
<p>An IT executive recently presented several steps companies can take to avoid common analytics pitfalls and <a href="http://www.informationweek.com/global-cio/interviews/5-bi-requests-it-groups-must-challenge/240154743" target="_blank">optimize their business intelligence initiatives</a>, including:</p>
<ul>
<li>Enterprises should broaden the focus of a business analytics program to the entire enterprise to find new connections and relationships. This includes expanding data collection efforts and taking a holistic view of analytics projects.</li>
<li>Companies should look inward to find their data analytics leader. Many executives may be tempted to find a high profile hire who will jump-start operations, but an existing employee will already be familiar enough with the company&#039;s needs and culture to foster a successful data-driven culture.</li>
</ul>
<p>Enterprises can attain significant benefits from their Hadoop analytics and business intelligence programs. However, getting the most out of these processes requires a broad vision, internal communication and a strong business culture dedicated to the pursuit of data analytics.</p>
<p>The post <a href="http://hortonworks.com/blog/getting-the-most-out-of-business-analytics/">Getting the most out of business analytics</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/getting-the-most-out-of-business-analytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hadoop big data gets personal</title>
		<link>http://hortonworks.com/blog/hadoop-big-data-gets-personal/</link>
		<comments>http://hortonworks.com/blog/hadoop-big-data-gets-personal/#comments</comments>
		<pubDate>Mon, 20 May 2013 18:53:23 +0000</pubDate>
		<dc:creator>Kim Rose</dc:creator>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Big Data Insights]]></category>
		<category><![CDATA[Hadoop]]></category>

		<guid isPermaLink="false">http://hortonworks.com/blog/hadoop-big-data-gets-personal/</guid>
		<description><![CDATA[<p><p>One of the reasons Hadoop big data analysis is particularly valuable&#160;is that much of it is fundamentally about people &#8211; what they do, what they buy, what they think about or what they want, etc. Analysis gleaned from big data offers a clearer picture&#160;of human conditions&#160;and enables organizations to anticipate needs and respond to wants. Many big data projects offer a more objective view of the way humanity functions and can trigger insight into how to make improvements.</p>
<p>The newest trend in big data insight is analysis directly targeted at people, reported Bloomberg Businessweek.&#160;In particular, &#039;People Analytics&#039; is an approach through which <a href="http://www.businessweek.com/articles/2013-05-16/the-next-big-thing-in-big-data-people-analytics">organizations turn the focus inward</a>&#160;and use big data comprised of external research and intercompany observations to make operational improvements and interact better with employees.</p>
<p>&#34;Because most communication and collaboration happens face to face, the data are critical for people analytics to take that next leap forward and become a transformative organizational tool,&#34; wrote Businessweek contributor Ben&#160;Waber.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/hadoop-big-data-gets-personal/">Hadoop big data gets personal</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>One of the reasons Hadoop big data analysis is particularly valuable&nbsp;is that much of it is fundamentally about people &#8211; what they do, what they buy, what they think about or what they want, etc. Analysis gleaned from big data offers a clearer picture&nbsp;of human conditions&nbsp;and enables organizations to anticipate needs and respond to wants. Many big data projects offer a more objective view of the way humanity functions and can trigger insight into how to make improvements.</p>
<p>The newest trend in big data insight is analysis directly targeted at people, reported Bloomberg Businessweek.&nbsp;In particular, &#039;People Analytics&#039; is an approach through which <a href="http://www.businessweek.com/articles/2013-05-16/the-next-big-thing-in-big-data-people-analytics">organizations turn the focus inward</a>&nbsp;and use big data comprised of external research and intercompany observations to make operational improvements and interact better with employees.</p>
<p>&quot;Because most communication and collaboration happens face to face, the data are critical for people analytics to take that next leap forward and become a transformative organizational tool,&quot; wrote Businessweek contributor Ben&nbsp;Waber. &quot;By combining precise data from both real and virtual worlds, we can understand behavior at a previously unimaginable scale.&quot;</p>
<p>According to Businessweek, examples of company solutions that resulted from such a combination of data and observation&nbsp;include changing workday structure to improve morale and reduce attrition, as well as&nbsp;reducing the number of coffee stations to facilitate impromptu interactions among personnel.</p>
<p><strong>Hyperpersonal&nbsp;analytics focus&nbsp;Hadoop big data inward</strong><br />
As&nbsp;Hadoop data becomes increasingly critical for facilitating measured objectivity to personal pursuits , more will start to apply it to their own activities. Harvard Business Review correspondent H. James Wilson wrote about &#039;auto-analytics,&#039; a semantic category for the process of <a href="http://blogs.hbr.org/cs/2013/05/six_numbers_reveal_the_booming.html">applying big data insights</a> and visualization to daily actions.For example, according to a study by&nbsp;the Pew Researcher Center, <a href="http://www.pewinternet.org/~/media/Files/Reports/2013/PIP_TrackingforHealth%20with%20appendix.pdf">69 percent of people</a> use a self-tracking mechanism when they exercise. Twenty-one percent of these people use smartphone apps or other data-based functions to make self-tracking more objective and precise.</p>
<p>Wilson wrote that people are seizing upon data-based technological resources to make their own lives and practices more efficient and informative. Technology consultants ABI Research <a href="http://www.abiresearch.com/press/wearable-computing-devices-like-apples-iwatch-will">projected that by 2018</a>, more than 485 million wearable devices, from smart watches to various articles of smart clothing, will have been shipped worldwide. The trend toward data-based self-analysis parallels one of the important advances in Apache Hadoop, which offers its users a highly customizable, personalized approach to big data filtration and analysis.</p>
<p>The post <a href="http://hortonworks.com/blog/hadoop-big-data-gets-personal/">Hadoop big data gets personal</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/hadoop-big-data-gets-personal/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hive 0.11, Stinger and SQL-Compatibility</title>
		<link>http://hortonworks.com/blog/hive-0-11-stinger-and-sql-compatibility/</link>
		<comments>http://hortonworks.com/blog/hive-0-11-stinger-and-sql-compatibility/#comments</comments>
		<pubDate>Mon, 20 May 2013 16:15:44 +0000</pubDate>
		<dc:creator>Alan Gates</dc:creator>
				<category><![CDATA[Hive]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=25997</guid>
		<description><![CDATA[<p><p>The release of Hive 0.11 is exciting and represents a big step forward to delivery of <a href="http://hortonworks.com/stinger">Project Stinger </a> and SQL-IN-Hadoop.  There is still some work to be done however.  We look forward to delivery of Hadoop 2 with YARN and the Apache Tez project as being huge increases to Hive performance, but this is not the only goal of Stinger.</p>
SQL-In-Hadoop simply can’t be SQL without SQL compatibility
<p>Today, HiveQL provides a fairly good set of SQL data types and semantics and while this (or a subset thereof) may be good enough for some of the “on” Hadoop solutions, we feel there needs to be more, especially if Hadoop and Hive are to meet the stringent requirements of enterprise class business analytics. To this end, we have set a goal of compatibility with most of <a href="http://en.wikipedia.org/wiki/SQL-92" target="_blank">SQL-92</a> and beyond with some SQL-2003 extensions.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/hive-0-11-stinger-and-sql-compatibility/">Hive 0.11, Stinger and SQL-Compatibility</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>The release of Hive 0.11 is exciting and represents a big step forward to delivery of <a href="http://hortonworks.com/stinger">Project Stinger </a> and SQL-IN-Hadoop.  There is still some work to be done however.  We look forward to delivery of Hadoop 2 with YARN and the Apache Tez project as being huge increases to Hive performance, but this is not the only goal of Stinger.</p>
<h3>SQL-In-Hadoop simply can’t be SQL without SQL compatibility</h3>
<p><b></b>Today, HiveQL provides a fairly good set of SQL data types and semantics and while this (or a subset thereof) may be good enough for some of the “on” Hadoop solutions, we feel there needs to be more, especially if Hadoop and Hive are to meet the stringent requirements of enterprise class business analytics. To this end, we have set a goal of compatibility with most of <a href="http://en.wikipedia.org/wiki/SQL-92" target="_blank">SQL-92</a> and beyond with some SQL-2003 extensions.</p>
<p><a href="http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/">The release of Apache Hive 0.11</a> pushes us further towards SQL-compatibility with the decimal data type becoming more usable (<a href="https://issues.apache.org/jira/browse/HIVE-4271" target="_blank">JIRA HIVE-4271</a>) and the addition of analytic functions for windowing and aggregates.  It also vastly improves joins and all the while improves performance.  Awesome.</p>
<h3>What else?</h3>
<p>There is a lot more work to be done however and well work with the community to get it done.  Hive 0.11 had contributions from over 50 community members to close over 380 Jira tickets.  That is astounding and a huge proof point of the open community and its unrivaled capability to innovate faster than any proprietary solution.</p>
<p>We will reach our goal soon.  Here is what’s left to be done:</p>
<p style="text-align: center;"><img class=" wp-image-26013 aligncenter" alt="sqlcompat" src="http://hortonworks.com/wp-content/uploads/2013/05/sqlcompat2.png" width="518" height="325" /></p>
<p>We look forward to providing updates to Hive all summer long!</p>
<p>The post <a href="http://hortonworks.com/blog/hive-0-11-stinger-and-sql-compatibility/">Hive 0.11, Stinger and SQL-Compatibility</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/hive-0-11-stinger-and-sql-compatibility/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Week in Review: SQL IN Hadoop and Hive, Beyond Batch with YARN, NFS access to HDFS and HBase MTTR</title>
		<link>http://hortonworks.com/blog/week-in-review-sql-in-hadoop-and-hive-beyond-batch-with-yarn-nfs-access-to-hdfs-and-hbase-mttr/</link>
		<comments>http://hortonworks.com/blog/week-in-review-sql-in-hadoop-and-hive-beyond-batch-with-yarn-nfs-access-to-hdfs-and-hbase-mttr/#comments</comments>
		<pubDate>Fri, 17 May 2013 23:22:35 +0000</pubDate>
		<dc:creator>Marc Holmes</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=25908</guid>
		<description><![CDATA[<p><p>Or as it&#8217;s more commonly being called: Week-ish in Review. Let&#8217;s recap on the latest &#8211; there&#8217;s some juicy technology goodness here.</p>
<p>Delivering on Stinger: Phase 1. <a href="http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/">Just this week, Hive 0.11 has been released</a>. Owen (<a href="http://twitter.com/owen_omalley">@owen_omalley</a>) brought us the news that 55 &#8211; yes, fifty-five &#8211; developers from across the community have <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12323587&#38;styleName=Html&#38;projectId=12310843">addressed 386 JIRA tickets</a> and have delivered significant improvements to Hive along with an awesome demonstration of the power of community open-source development. Thanks to everyone! This release of Hive means that we&#8217;ve delivered on the first phase of <a href="http://hortonworks.com/stinger">the Stinger Initiative</a> too &#8211; aiming to deliver 100x performance increases to Hive.</p>
<p>Taking Hadoop Beyond Batch with YARN. All of which means we step closer to <a href="http://hortonworks.com/stinger">delivering SQL-in-Hadoop</a> and respond to the needs of enterprises for multi-application operating systems for their big data. Arun (<a href="http://twitter.com/acmurthy">@arunmurthy</a>) gives a <a href="http://hortonworks.com/blog/moving-hadoop-beyond-batch-with-apache-yarn/">terrific update on Hadoop 2.0 and YARN</a> and how that development will move Hadoop Beyond Batch.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/week-in-review-sql-in-hadoop-and-hive-beyond-batch-with-yarn-nfs-access-to-hdfs-and-hbase-mttr/">Week in Review: SQL IN Hadoop and Hive, Beyond Batch with YARN, NFS access to HDFS and HBase MTTR</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Or as it&#8217;s more commonly being called: Week-ish in Review. Let&#8217;s recap on the latest &#8211; there&#8217;s some juicy technology goodness here.</p>
<p><b>Delivering on Stinger: Phase 1</b>. <a href="http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/">Just this week, Hive 0.11 has been released</a>. Owen (<a href="http://twitter.com/owen_omalley">@owen_omalley</a>) brought us the news that 55 &#8211; yes, fifty-five &#8211; developers from across the community have <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12323587&amp;styleName=Html&amp;projectId=12310843">addressed 386 JIRA tickets</a> and have delivered significant improvements to Hive along with an awesome demonstration of the power of community open-source development. Thanks to everyone! This release of Hive means that we&#8217;ve delivered on the first phase of <a href="http://hortonworks.com/stinger">the Stinger Initiative</a> too &#8211; aiming to deliver 100x performance increases to Hive.</p>
<p><b>Taking Hadoop Beyond Batch with YARN</b>. All of which means we step closer to <a href="http://hortonworks.com/stinger">delivering SQL-in-Hadoop</a> and respond to the needs of enterprises for multi-application operating systems for their big data. Arun (<a href="http://twitter.com/acmurthy">@arunmurthy</a>) gives a <a href="http://hortonworks.com/blog/moving-hadoop-beyond-batch-with-apache-yarn/">terrific update on Hadoop 2.0 and YARN</a> and how that development will move Hadoop Beyond Batch. Stay tuned!</p>
<p><b>Delivering Enterprise Hadoop through MTTR for HBase and NFS access to HDFS</b>. Meanwhile, Nicolas Liochon (<a href="http://twitter.com/nkeywal">@nkeywal</a>) and Devaraj Das (<a href="http://twitter.com/ddraj">@ddraj</a>) provide an introduction on how HBase availability is being improved through work on <a href="http://hortonworks.com/blog/introduction-to-hbase-mean-time-to-recover-mttr/">Mean Time To Recover (MTTR) capabilities</a>. And then Brandon Li (<a href="http://twitter.com/brandonli11">@brandonli11</a>) and Suresh Srinivas (<a href="http://twitter.com/suresh_m_s">@suresh_m_s</a>) updated us on progress to <a href="http://hortonworks.com/blog/simplifying-data-management-nfs-access-to-hdfs">simplify data management through NFS access to HDFS</a>. All critical stuff for the enterprise, and all driven through the community.</p>
<p><b>Microsoft love for .NET Hadoop fans.</b> If you&#8217;re a .NET developer and have been missing out on a little Hadoop fun, then <a href="http://hortonworks.com/blog/hadoop-sdk-and-tutorials-for-microsoft-net-developers/">Microsoft has started pushing out SDKs and tutorials</a> for its Hadoop-in-the-Cloud service &#8211; HDInsight &#8211; so you can fire up Visual Studio and get rocking on that big data.</p>
<p><b>Hadoop Summit Meetups. </b>We only announced them this week, and they&#8217;re nearly full already. <a href="http://hortonworks.com/blog/meetups-at-hadoop-summit/">Still time to try and squeeze into one of the Meetups</a>: Hive, Pig, HBase, YARN, Accumulo, Ambari, Oozie, Data Science and Architecture or maybe attend Big Data Camp or Machine Learning Evening on 25th June as part of Hadoop Summit.</p>
<p><a href="http://hortonworks.com/sandbox">Now it&#8217;s time to go play</a>. Have a great weekend.</p>
<p>The post <a href="http://hortonworks.com/blog/week-in-review-sql-in-hadoop-and-hive-beyond-batch-with-yarn-nfs-access-to-hdfs-and-hbase-mttr/">Week in Review: SQL IN Hadoop and Hive, Beyond Batch with YARN, NFS access to HDFS and HBase MTTR</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/week-in-review-sql-in-hadoop-and-hive-beyond-batch-with-yarn-nfs-access-to-hdfs-and-hbase-mttr/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The importance of data accuracy for Hadoop banking analytics</title>
		<link>http://hortonworks.com/blog/the-importance-of-data-accuracy-for-hadoop-banking-analytics/</link>
		<comments>http://hortonworks.com/blog/the-importance-of-data-accuracy-for-hadoop-banking-analytics/#comments</comments>
		<pubDate>Fri, 17 May 2013 18:24:15 +0000</pubDate>
		<dc:creator>Kim Rose</dc:creator>
				<category><![CDATA[Big Data Insights]]></category>
		<category><![CDATA[Business Analytics]]></category>

		<guid isPermaLink="false">http://hortonworks.com/blog/the-importance-of-data-accuracy-for-hadoop-banking-analytics/</guid>
		<description><![CDATA[<p><p>Business analytics solutions, such as those built upon Hadoop architecture,&#160;can be a major resource for members of the financial industry. With these tools at their disposal, banks leadership could gain major insights into their operations and market places as well as improving their efforts to effectively engage potential clients.</p>
<p>However, these organizations need access to accurate customer data to gain the full benefits of business analytics solutions. According to a recent&#160;Experian QAS survey, many financial institutions have struggled to <a href="http://www.cujournal.com/issues/17_14/all-those-new-channels-affecting-accuracy-of-data-1018612-1.html" target="_blank">ensure the accuracy of the data they gather</a>, Credit Union Journal reported. Ninety-one percent of the organizations that participated in the survey suspected that the information they collected was inaccurate in some fashion. While&#160;respondents reported that as much as 18 percent of their data could not be ensured for accuracy on average, 27 percent of the total number of participating enterprises could not say how much of their information was compromised.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/the-importance-of-data-accuracy-for-hadoop-banking-analytics/">The importance of data accuracy for Hadoop banking analytics</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>Business analytics solutions, such as those built upon Hadoop architecture,&nbsp;can be a major resource for members of the financial industry. With these tools at their disposal, banks leadership could gain major insights into their operations and market places as well as improving their efforts to effectively engage potential clients.</p>
<p>However, these organizations need access to accurate customer data to gain the full benefits of business analytics solutions. According to a recent&nbsp;Experian QAS survey, many financial institutions have struggled to <a href="http://www.cujournal.com/issues/17_14/all-those-new-channels-affecting-accuracy-of-data-1018612-1.html" target="_blank">ensure the accuracy of the data they gather</a>, Credit Union Journal reported. Ninety-one percent of the organizations that participated in the survey suspected that the information they collected was inaccurate in some fashion. While&nbsp;respondents reported that as much as 18 percent of their data could not be ensured for accuracy on average, 27 percent of the total number of participating enterprises could not say how much of their information was compromised.</p>
<p>There are several steps that financial institutions can take to increase the accuracy of their data:</p>
<ul>
<li>Establish regular database maintenance tasks to manage files</li>
<li>Integrate automated verification tools to ensure that client and prospect data is up to date</li>
<li>Create a full data workflow to prioritize high-volume entry points</li>
</ul>
<p>Information services expert Thomas Schutz noted in Bank Systems &amp; Technology that banks and other financial institutions could improve the accuracy of their collected information by <a href="http://www.banktech.com/business-intelligence/improving-business-analytics-through-a-s/240154752" target="_blank">condensing the number of databases</a> they maintained. This will prevent duplicate entries&nbsp;from being entered into multiple systems. One of the problems with operating multiple databases is that updated information may not be spread to each system, leaving some with inaccurate consumer data. In addition, Schutz recommended that banks place more emphasis on training personnel to enter and access data in a streamlined and uniform process. This will eliminate entry errors and inconsistencies, maintaining the accuracy of gathered data across the enterprise and ensuring&nbsp;that it maximizes the effectiveness of their Hadoop initiatives.</p>
<p>The post <a href="http://hortonworks.com/blog/the-importance-of-data-accuracy-for-hadoop-banking-analytics/">The importance of data accuracy for Hadoop banking analytics</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/the-importance-of-data-accuracy-for-hadoop-banking-analytics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Apache Hive 0.11: Stinger Phase 1 Delivered</title>
		<link>http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/</link>
		<comments>http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/#comments</comments>
		<pubDate>Fri, 17 May 2013 18:18:36 +0000</pubDate>
		<dc:creator>Owen O'Malley</dc:creator>
				<category><![CDATA[Apache Hadoop]]></category>
		<category><![CDATA[Hive]]></category>
		<category><![CDATA[Tez]]></category>

		<guid isPermaLink="false">http://hortonworks.com/?p=25860</guid>
		<description><![CDATA[<p><p>In February, we <a href="http://hortonworks.com/stinger">announced the Stinger Initiative</a>, which outlined an approach to bring interactive SQL-query into Hadoop.  Simply put, our choice was to double down on Hive to extend it so that it could address human-time use cases (i.e. queries in the 5-30 second range). So, with input and participation from the broader community we established a fairly audacious goal of 100X performance improvement and SQL compatibility.</p>
Introducing Apache Hive 0.11 – 386 JIRA tickets closed
<p>As representatives of this open, community led effort we are very proud to announce the first release of the new and improved Apache Hive, version 0.11.  This substantial release embodies the work of a wide group of people from Microsoft, Facebook , Yahoo, SAP and others.  <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12323587&#38;styleName=Html&#38;projectId=12310843">Together we have addressed 386 JIRA tickets</a>, of which there were 28 new features and 276 bug fixes.&#8230;</p></p><p>The post <a href="http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/">Apache Hive 0.11: Stinger Phase 1 Delivered</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></description>
				<content:encoded><![CDATA[<p>In February, we <a href="http://hortonworks.com/stinger">announced the Stinger Initiative</a>, which outlined an approach to bring interactive SQL-query into Hadoop.  Simply put, our choice was to double down on Hive to extend it so that it could address human-time use cases (i.e. queries in the 5-30 second range). So, with input and participation from the broader community we established a fairly audacious goal of 100X performance improvement and SQL compatibility.</p>
<h3>Introducing Apache Hive 0.11 – 386 JIRA tickets closed</h3>
<p>As representatives of this open, community led effort we are <b>very proud</b> to announce the first release of the new and improved Apache Hive, version 0.11.  This substantial release embodies the work of a wide group of people from Microsoft, Facebook , Yahoo, SAP and others.  <a href="https://issues.apache.org/jira/secure/ReleaseNote.jspa?version=12323587&amp;styleName=Html&amp;projectId=12310843">Together we have addressed 386 JIRA tickets</a>, of which there were 28 new features and 276 bug fixes. There were <strong>FIFTY-FIVE developers</strong> involved in this and I would like to thank every one of them.  <a href="#contrib">See below for a full list.</a></p>
<h3>Delivering on the promise of Stinger Phase 1</h3>
<p>As promised we have delivered phase 1 of the <a href="http://hortonworks.com/stinger">Stinger Initiative</a> in late spring.  This release is another proof point that that the open community can innovate at a rate unequaled by any proprietary vendor.  As part of phase 1 we promised windowing, new data types, the optimized RC (ORC) file and base optimizations to the Hive Query engine and the community has delivered these key features.</p>
<p style="text-align: center;"><a href="http://hortonworks.com/stinger"><img class="wp-image-25879 aligncenter" alt="Untitled" src="http://hortonworks.com/wp-content/uploads/2013/05/Untitled-1024x639.png" width="614" height="383" /></a></p>
<h3>Key features in Hive 0.11</h3>
<ul>
<li><b>ORCFile.  It’s Optimized.<br />
</b>The ORC File (Optimized RC File) presents key new features that speed access of data Apache Hive as it adds meta information at the file and block data level so that queries can be more intelligent and use meta data to optimize access.  Further, with the ORC file, only the bytes from the required columns are read from HDFS which minimizes I/O and speeds the query chain.  These are major advances for improved performance in Hive.<b></b></li>
<li><b>Improved Data Types<br />
</b>As Apache Hive marches towards full SQL-compatibility, an update to the decimal data type was made more usable.</li>
<li><b>Analytic Functions<br />
</b>Hive 0.11 introduces windowing functions for RANK, LEAD/LAG, ROW_NUMBER, FIRST_VALUE, LAST_VALUE and more. It also introduces aggregate OVER functions with PARTITION BY and ORDER BY</li>
<li><b>Joins improved in Hive 0.11<br />
</b>Both the broadcast join and the SMB join were improved considerably in Hive 0.11.  Both joins work without user hints, so that the Hive optimizer now picks the correct join rather than depending on the user to do so. More broadcast joins are now packed into a single MapReduce job, making star join queries much more efficient.</li>
</ul>
<h3>Towards YARN and the Power of SQL-IN-Hadoop</h3>
<p>Hadoop 2.0 and explicitly YARN turns Hadoop from a single application system to a <a href="http://hortonworks.com/blog/moving-hadoop-beyond-batch-with-apache-yarn/">multi-application operating system</a>.  The next generation of Apache Hive, built on YARN, becomes part of the platform itself and can be managed by YARN to ensure that multiple use cases can be addressed beyond interactive query.  It is the delivery of a multi-application data system.  In this new world, Hive is a first class citizen along with a variety of workloads within a cluster and resources can be managed more discreetly.</p>
<p>Ultimately, this leads to further performance enhancements for Hive and with the inclusion of Tez, we will be able to demonstrate even more significant improvements as service startup times are removed a newly optimized execution chain within core Hadoop is delivered.  The near future is exciting!</p>
<h3>Apache Hive is empowering an ecosystem of SQL Based Applications</h3>
<p>This release represents significant enhancements to Hive that will improve direct SQL interaction with Hive and light up the hundreds of applications that already rely on Hive as the defacto SQL interface for Hadoop.  If you are one of the hundreds of software companies using Hive already, we hope you test out this new release and are happy with the results.  We look forward to supporting it in HDP 1.3 in the very near future.  <img src='http://hortonworks.com/wp-includes/images/smilies/icon_wink.gif' alt=';)' class='wp-smiley' /> </p>
<h3>Thank You to the Community</h3>
<p><a id="contrib"></a>Thanks to 55 developers who contributed time and effort on this release: Alan Gates, Amareshwari Sriramadasu, Andrew Chalfant, Arup Malakar, Ashish Singh, Ashish Vaidya, Ashutosh Chauhan, Bennie Schut, Bhushan Mandhani, Billie Rinaldi, Brock Noland, Carl Steinbach, Chen Chun, Chris Drome, Dilip Joseph, Edward Capriolo, Gang Tim Liu, Gopal V, Gunther Hagleitner, Harish Butani, Ivan Gorbachev, Jarek Jarcec Cecho, Jean Xu, Jingwei Lu, Johnny Zhang, Jonathan Chang, Kevin Wilfong, Lars Francke, Li Yang, Mark Grover, Mayank Garg, Mikhail Bautin, Namit Jain, Navis, Nick Collins, Owen O&#8217;Malley, Pamela Vagata, Prajakta Kalmegh, Prasad Mujumdar, Roshan Naik, Sam Tunnicliffe, Samuel Yuan, Sean Busbey, Shreepadma Venugopalan, Sushanth Sowmyan, Teddy Choi, Thejas M Nair, Thiruvel Thirumoolan, Travis Crawford, Vikram Dixit K, Vinod Kumar Vavilapalli, Wonho Kim, Xiao Jiang, Zhenxiao Luo</p>
<p>The post <a href="http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/">Apache Hive 0.11: Stinger Phase 1 Delivered</a> appeared first on <a href="http://hortonworks.com">Hortonworks</a>.</p>]]></content:encoded>
			<wfw:commentRss>http://hortonworks.com/blog/apache-hive-0-11-stinger-phase-1-delivered/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
