May 14, 2013Distilling Hadoop Patterns of Use and How You Can Use Them to Get Started Today More Info ▼There certainly is no shortage of hype when it comes to the term “Big Data” as vendors and enterprises alike highlight the transformative effect of building actionable insight from the deluge of data that is now available to us all. But amongst the hype, practical guidance is often lacking: why is Apache Hadoop most often the technology underpinning “Big Data”? How does it fit into the current landscape of databases and data warehouses that are already in use? And are there typical usage patterns that can be used to distill some of the inherent complexity for us all to speak a common language? And if there are common patterns, what are some ways that I can apply them to my unique situation?
Agenda:
- Learn what types of data are being captured to build “Big Data” applications
- Discover where Hadoop most often fits into the data landscape for the typical enterprise
- Hear how common patterns of use can simplify your approach and help you to find a usage that makes sense for your business
- See how you other organizations have used the usage patterns to get started on their Big Data journey
| To view replays, Register or Login |
May 9, 2013Introduction to Hortonworks Data Platform More Info ▼Hortonworks Data platform is the first 100% open source data management software to package the essential Apache Hadoop projects into a comprehensive and integrated platform. Join us in this 30-minute webinar to gain a better understanding of the essential Apache Hadoop projects (Pig, Hive, Oozie, HBase), as well as other components, such as monitoring management and data integration. Register for the webcast today and find out how HDP makes Hadoop easier to install, integrate, manage and use for enterprises and solution providers.
In this webinar we will outline, discuss and demo the key features of the Hortonworks Data Platform, including:
- Rapid Installation: Thanks to a wizard that makes it easy to install and provision Hadoop across clusters of machines.
- Data Integration Services: Including Talend Open Studio for Big Data, a visual development environment that allows you to connect to hundreds of data sources without writing code.
- Management and Monitoring Services: Including Hortonworks Management Center, which is an open source extensible tool that provides intuitive web-based dashboards for monitoring your clusters and creating alerts.
- Centralized Metadata Services: Including HCatalog, which greatly simplifies data sharing between Hadoop and other enterprise data systems.
- High Availability with Red Hat Enterprise Linux and the High Availability Add-On—improves reliability and stability while broadening the enterprise readiness of Apache Hadoop. High availability support enables organizations already using Red Hat Enterprise Linux to take Hadoop from proof of concept to production and allows them to satisfy SLA reliability requirements.
- Flume Integration—enables expanded streaming data capture for analysis within the Hortonworks Data Platform. Organizations can now easily and reliably collect and analyze real-time data streams, such as high-volume web logs, in Apache Hadoop, driving additional insights from data that was previously too bulky to capture and process.
- Improved Performance—faster read and write in HDFS speeds data capture and delivery within the platform. Improved MapReduce execution performance means that jobs process data more quickly.
- Open Management Console for Monitoring API Enhancements—delivers easier and deeper integration into third-party management tools and systems to gain insight into performance and assure availability, provision nodes in a cluster and perform ongoing maintenance of the Hadoop platform.
| To view replays, Register or Login |
May 1, 2013Best Practices for Virtualizing HadoopMore Info ▼Join this webinar to discuss best practices for designing and building a solid, robust and flexible Hadoop platform on an enterprise virtual infrastructure. Attendees will learn the flexibility and operational advantages of Virtual Machines such as fast provisioning, cloning, high levels of standardization, hybrid storage, vMotioning, increased stabilization of the entire software stack, High Availability and Fault Tolerance. This is a can`t miss presentation for anyone wanting to understand design, configuration and deployment of Hadoop in virtual infrastructures.
| To view replays, Register or Login |
Apr 30, 2013Hadoop & the Enterprise Data Warehouse: When to Use Which More Info ▼For this webinar, two of the most trusted experts in their fields to examine how big data technologies are being used today by practical big data practitioners.
Eric Baldeschwieler (aka E14, @eric14), CTO and Founder of Hortonworks, and Hadoop luminary will provide perspective on the role of Massively Parallel Processing (MPP) Relational Databases in the modern data platform architecture.
Stephen Brobst, CTO of Teradata, and leading expert in parallel computing architectures will share insights on how large enterprises can effectively complement their existing data warehouses with Hadoop to drive optimal value.
What you will learn in the webinar:
- How to determine when to use Hadoop, and when to use an MPP Relational Database
- Real world customer cases on big data applications from the Silicon Valley
- What to expect from Hadoop in the future, and what not to expect
| To view replays, Register or Login |
Apr 17, 20132013 Future of Open Source Survey Results RevealedMore Info ▼Don’t miss this live panel discussion on the industry’s hottest trends and the future of open source!
Date: Wednesday, April 17th
The annual Future of Open Source Survey provides a report on the state of the open source industry and analysis of future trends. Now in its seventh year, this annual survey was supported by over 25 open source software industry leaders and collaborating organizations, and compiles results from hundreds of respondents from the open source community.
For the first time ever, the Future of Open Source Survey sponsors, Black Duck Software and North Bridge Venture Partners, along with collaborator Forrester Research, will host a live panel discussion revealing this year’s survey results. Our expert panel will include:
- Tim Yeaton, CEO and President, Black Duck
- Michael Skok, General Partner at North Bridge Venture Partners
- Jeffrey Hammond, VP, Principal Analyst Serving Application Development & Delivery Professionals, Forrester Research
- Rob Bearden, CEO at Hortonworks
- Ed Tilford, Head of Open Source Governance at Thomson Reuters
- Tom Erickson, CEO at Acquia
Follow the 2013 Future of Open Source discussion on Twitter with the hashtag #FutureOSS.
| To view replays, Register or Login |
Apr 9, 2013Integrating Hadoop into Business Intelligence and Data WarehousingMore Info ▼Integrating Hadoop into Business Intelligence and Data Warehousing
A TDWI Webinar featuring Philip Russom, TDWI Research Director for Data Management
Broadcast Date: April 9, 2013 Noon ET / 9am PT
TDWI recognizes that Hadoop usage is a minority practice today, but assumes that mainstream usage of Hadoop within business intelligence (BI) and data warehousing (DW) applications will become common across many industries within a few years. This Webinar provides an overview of Hadoop products and best practices in the context of BI/DW applications, so that user organizations can prepare to integrate Hadoop into their BI/DW technology stacks and software portfolios successfully.
The content of the Webinar is based on the research findings of a new Best Practices Report by TDWI’s Philip Russomcalled “Integrating Hadoop into Business Intelligence and Data Warehousing.” That report was sponsored by vendor firms Cloudera, EMC Greenplum, Hortonworks, ParAccel, SAP, SAS, Tableau Software, and Teradata.
What You Will Learn:
- What Hadoop technologies are and can do for BI/DW
- Common types of analytic applications that Hadoop technologies enable
- Adjustments that Hadoop-based analytics with big data requires of practices in data integration, metadata management, query optimization, data warehouse architecture, etc.
| To view replays, Register or Login |
Mar 13, 2013Hadoop Reporting and Analysis: What Architecture is Best for Me? More Info ▼Hadoop is deployed for a variety of uses, including web analytics, fraud detection, security monitoring, healthcare, environmental analysis, social media monitoring, and other purposes.
Deriving meaningful insights from all this data can be a challenge, and the architectural approach you choose will make a difference in what you can and cannot achieve with reporting and analysis on your Hadoop data; there is no single correct approach. Rather, use case requirements influence selection from the several architectural choices in Hadoop reporting and analysis; each offers its own strengths and drawbacks.
Topics Covered:
- An overview of Hadoop architecture
- The Hortonworks Data Platform and Hadoop patterns of use
- The Jaspersoft BI Suite
- Jaspersoft’s capabilities and approaches for reporting, analytics, and dashboarding for various Hadoop use cases
| To view replays, Register or Login |
Mar 12, 2013Introduction to Hortonworks Data Platform for WindowsMore Info ▼According to IDC, Windows Servers run more than 50% of the servers in the Enterprise Data Center. Hortonworks has worked closely with Microsoft to port Apache Hadoop to Windows to enable organizations to take advantage of this emerging Big Data technology. Join us in this informative webinar to hear about the new Hortonworks Data Platform for Windows.
In less than an hour, you’ll learn:
- Key capabilities available in Hortonworks Data Platform for Windows
- How HDP for Windows integrates with Microsoft tools
- Key workloads and use cases for driving Hadoop today
| To view replays, Register or Login |
Mar 5, 2013Process and Visualize Your Data with Revolution R, Hadoop and GoogleVisMore Info ▼In this session, attendees will learn how to use R in the distributed environment of Hadoop using the rmr package. Additionally, the R package googleVis will be used to show how application development teams can incorporate the power of R and the power of Google Chart Tools into their applications quickly and easily. The result is a rich custom data visualization with far less coding than what would otherwise be required. The session will begin by discussing R basics and then moving to concrete examples of statistical analysis on data sets. This will be accompanied by an application development example showing custom visualization of the analysis using googleVis. The application development example will show a browser based app both kicking off the data set analysis using R as well as the visualization of the result. Visualization examples will use both googleVis as well as basic Google Chart Tools. Attendees will leave the session with a concrete example of how to incorporate R into their existing application development practices and how to use Hadoop and its ecosystem to build custom visualizations.
| To view replays, Register or Login |
Feb 13, 2013Bigger Data On Your Budget - Hortonworks and AppnovationMore Info ▼Hortonworks and Appnovation will help you get better understanding of what Big Data is, what all is involved for companies that are quickly accumulating exceedingly large amounts of complex data, what the options are to handle this information and most importantly, what this data can do for the company once translated into a usable format.
During this webinar we are going to cover several key concepts related to Big Data:
- What Big Data is
- Challenges associated with Big Data
- Options to overcome the challenges of Big Data
- Uses of Big Data
- How Hadoop enables Bigger Data for your budget
| To view replays, Register or Login |
Feb 12, 2013Break Through the Traditional Advertisement Services with Big Data and Apache HadoopMore Info ▼Entravision Communications Corporation (NYSE: EVC) is a diversified Spanish-language media company with a unique group of media assets including television stations, radio stations and digital platforms. In 2011, they made the strategic decision to build a data analytics, modeling and insights division to expand the value of its traditional advertisement services. Join us in this session with Franklin Rios, President of Luminar (an Entravision company), Oscar Padilla, VP of Strategy, Luminar, along with Impetus and Hortonworks as we discusses key implementations, results and lessons learned from their big data services operations.
| To view replays, Register or Login |
Feb 5, 2013Go From Zero to Big Data in 15 Minutes with The Hortonworks SandboxMore Info ▼Hortonworks recently unveiled the Hortonworks Sandbox, a free, comprehensive, easy-to-use, hands-on learning environment that provides the fastest onramp for anyone interested in learning, evaluating or using Apache Hadoop™ in an enterprise.
Join us in this interactive webinar as we discuss and demo features of the Hortonworks Sandbox, including:
- How to download and use the Sandbox tutorials.
- How to upload your own datasets to test and validate the use of Apache Hadoop.
- Demos of features and use cases for your very own Hortonworks Sandbox.
| To view replays, Register or Login |
Jan 31, 2013Hadoop Operations, Innovations and Enterprise Readiness with Hortonworks Data Platform v1.2 More Info ▼Hortonworks continues to innovate throughout all Hadoop related projects, packaging the most enterprise-ready components, such as Ambari, into the Hortonworks Data Platform (HDP). Please join us in this interactive webinar as we present real-world use cases of Enterprise customers that are finding success with HDP and their Big Data initiatives. We will also introduce new features from version 1.2 of the Hortonworks Data Platform and how it has become the leading 100% open source distribution choice for the Enterprise.
In this webinar we will outline how enterprise customers are successfult with HDP and also review some of the newest features in version1.2 including:
- How to provision a cluster
- How to manage and monitor a cluster using completely open source tools
- How to perform diagnostics to identify issues in a cluster
| To view replays, Register or Login |
Jan 29, 2013Big Data Analytics - Is Your Elephant Enterprise Ready?More Info ▼Hadoop’s cost effective scalability and flexibility to analyze all data types is driving organizations everywhere to embrace big data analytics. From proof of concept to deployment across the enterprise, join Datameer and Hortonworks as we answer the ‘now what?’ when rolling out your Hadoop big data analytics project. This webinar will address critical project components such as data security, data privacy, high availability, user training and use case development.
Leave this webinar with:
• A checklist for successful enterprise analytics deployments
• How best to take your project into production
• Use cases that optimize Hadoop technology
• Common pitfalls to avoid
| To view replays, Register or Login |
Jan 22, 2013Hortonworks State of the Union and Vision for Apache Hadoop in 2013More Info ▼In 2012, we released Hortonworks Data Platform powered by Apache Hadoop and established partnerships with major enterprise software vendors including Microsoft and Teradata that are making enterprise ready Hadoop easier and faster to consume. As we start 2013, we invite you to join us for this live webinar where Shaun Connolly, VP of Strategy at Hortonworks, will cover the highlights of 2012 and the road ahead in 2013 for Hortonworks and Apache Hadoop.
Join this webinar to learn:
- How our focus results in innovation within the Apache open source community while addressing enterprise requirements and ecosystem interoperability.
- About the latest releases in the Hortonworks product offering.
- About our roadmap and major areas of investment across core platform services, data services, and operational services for productive operations and management.
| To view replays, Register or Login |
Dec 18, 2012Big Data, Hadoop, Hortonworks and Microsoft HDInsightMore Info ▼Big Data is everywhere. And at the center of the big data discussion is Apache Hadoop, a next-generation enterprise data platform that allows you to capture, process and share the enormous amounts of new, multi-structured data that doesn’t fit into transitional systems.
With Microsoft HDInsight, powered by Hortonworks Data Platform, you can bridge this new world of unstructured content with the structured data we manage today. Together, we bring Hadoop to the masses as an addition to your current enterprise data architectures so that you can amass net new insight without net new headache.
Attend this webinar where we will:
- Distill the hype over reality of Hadoop
- Outline how Hadoop is being used today
- Demonstrate the simplicity of Hadoop with Microsoft HDInsight in action
| To view replays, Register or Login |
Dec 12, 2012 Do You Believe in Santa? How About Data Scientists?More Info ▼Join Teradata, Hortonworks and other Big Data experts in this LIVE panel discussion from the Big Analytics 2012, NY event as they debate the following and help you sort hype from reality:
- Is Data Scientist position really new? How so?
- Can your current team of analysts and in-house skill set leverage new technologies to fill this gap…without breaking your budget?
- What are some real-life examples where Data Scientist improved the business?
| To view replays, Register or Login |
Nov 27, 2012The Future of Apache Hadoop: High Availability and HadoopMore Info ▼Join Rohit Bakshi, Product Management, as he guides you through the current work on HA and Hadoop. Rohit will have a live demo of High Availability options on HDP 1.1 as well as answer any questions during this session.
| To view replays, Register or Login |
Oct 31, 2012The Future of Apache Hadoop_YARNMore Info ▼YARN: The Future of Data Processing with Apache Hadoop
Speaker: Arun C. Murthy, Hortonworks co-founder, VP of Apache Hadoop at Apache Software Foundation. The lead for the MapReduce project and YARN.
Apache Hadoop MapReduce has been overhauled to emerge as Apache Hadoop YARN, a generic distributed application framework to support MapReduce and other application paradigms. This change recasts Hadoop as a much more powerful data-processing system making it very different from itself 12 months ago. Attend this webinar to discuss the YARN design and architecture and how it improves Apache Hadoop to process data better.
| To view replays, Register or Login |
Oct 17, 2012Webinar Series: Future of Apache Hadoop_ZookeeperMore Info ▼Join us in this 4-part series with the core committers of the Apache Hadoop projects (Pig, Zookeeper and YARN) and Hadoop experts to gain insight into current advances in Apache Hadoop, obtain use-cases and best practices on how to get started with Hadoop and live Q&A with the people at the center of the Hadoop movement.
Webinar 3: Scaling Apache Zookeeper to the Next Generation Applications
Speaker: Mahadev Konar, Hortonworks co-founder and core contributor and PMC member of Apache Hadoop and ZooKeeper.
Apache ZooKeeper has become a de facto standard for distributed coordination. _Its design has proven to be flexible enough that it can be applied to a variety of needs of distributed applications. It has been used for leader election, service discovery, status monitoring, dynamic configuration etc. Recently new use cases have come up where ZooKeeper is being used as a discovery service with thousands of clients. Couple of examples include Hadoop Namenode HA and Yarn HA. This has led to a new set of requirements that need to be addressed. There is a need for session-less read-only client creation to address startup latency issues of thousands of clients . Also, such scale creates a need for reducing memory footprint of watch management in ZooKeeper. In this talk we will discuss the various new use cases that are coming up in Apache ZooKeeper and the work that is being done in the community to address these issues. We will also discuss the future roadmap for ZooKeeper.
| To view replays, Register or Login |
Sep 26, 2012Webinar Series: Future of Apache Hadoop_AmbariMore Info ▼Join us in this 4-part series with the core committers of the Apache Hadoop projects (Pig, Zookeeper and YARN) and Hadoop experts to gain insight into current advances in Apache Hadoop, obtain use-cases and best practices on how to get started with Hadoop and live Q&A with the people at the center of the Hadoop movement.
Webinar 2: Deployment and Management of Hadoop Clusters with Ambari
Speaker: Matt Foley, Committer and PMC Member of the Apache Hadoop Project and Member of the Technical Staff @ Hortonworks
Deploying, configuring, and managing large Hadoop and HBase clusters can be quite complex. Just upgrading one Hadoop component on a 2000-node cluster can take a lot of time and expertise, and there have been few tools specialized for Hadoop cluster administrators. AMBARI is an Apache incubator project to deliver Monitoring and Management functionality for Hadoop clusters. This paper presents the AMBARI tools for cluster management, specifically: Cluster pre-configuration and validation; Hadoop software deployment, installation, and smoketest; Hadoop configuration and re-config; and a basic set of management ops including start/stop service, add/remove node, etc. In providing these capabilities, AMBARI seeks to integrate with (rather than replace) existing open-source packaging and deployment technology available in most data centers, such as Puppet and Chef, Yum, Apt, and Zypper.
| To view replays, Register or Login |
Sep 12, 2012Webinar Series: The Future of Apache Hadoop_Pig Out to HadoopMore Info ▼Join us in this 4-part series with the core committers of the Apache Hadoop projects (Pig, Zookeeper and YARN) and Hadoop experts to gain insight into current advances in Apache Hadoop, obtain use-cases and best practices on how to get started with Hadoop and live Q&A with the people at the center of the Hadoop movement.
Webinar 1: Pig Out to Hadoop with Alan Gates
Pig has added some exciting new features in 0.10, including a boolean type, UDFs in JRuby, load and store functions for JSON, bloom filters, and performance improvements. Join Alan Gates, Hortonworks co-founder and long-time contributor to the Apache Pig and HCatalog projects, to discuss these new features, as well as talk about work the project is planning to do in the near future. In particular, we will cover how Pig can take advantage of changes in Hadoop 0.23.
| To view replays, Register or Login |
Sep 5, 2012Hadoop for Systems IntegratorsMore Info ▼Are you a Systems Integrator or consultant working on Hadoop implementations?
Working with Systems Integrators is a foundational aspect of Hortonworks business model. We see a unique and massive opportunity to leverage Hortonworks unequalled Hadoop expertise with our SI partners complementary technology and domain expertise, to enable high-value and repeatable Big Data solutions.
Join us for this 60-minute webinar and learn:
- How Hortonworks and the SI business models are 100% complementary
- How the Hortonworks Data Platform is making Hadoop easier to install, integrate, manage and use.
- What distinguishes the Hortonworks Data Platform from other Hadoop distribution
- Benefits of collaborating with Hortonworks to expedite the implementation of high-value big data solutions for your customers
| To view replays, Register or Login |
Aug 22, 2012Next Generation of Analytics with Hortonworks and Teradata AsterMore Info ▼Hortonworks and Teradata Aster have partnered to deliver advanced, powerful analytics of big data using Hadoop. Many have embrace this combined architecture that uses Hadoop and Teradata Aster analytics solutions as key ingredients to maximize value from ALL data.
In this webinar where we will leave you with key takeaways to accelerate your Big Analytics projects:
- Understand when to use Hadoop and Teradata Aster to improve analytics.
- See a demonstration of Hadoop and Teradata Aster in action.
- Hear customer stories of value.
Join us in this 60-minute webinar to gain strategic insight on how to incorporate Apache Hadoop and Teradata Aster into your Big Data Analytics strategy.
| To view replays, Register or Login |
Aug 8, 2012Talend Open Studio & Hortonworks Data PlatformMore Info ▼Data Integration is a key step in a Hadoop solution architecture. It is the first obstacle encountered once your cluster is up and running. OK, I have a cluster…now what? Complex scripts? For wide scale adoption of Apache Hadoop, an intuitive set of tools that abstract away the complexity of integration is necessary.
Enter Talend Open Studio for Big Data.
In this 60-minute webinar, Jim Walker, Director of Product Marketing at Hortonworks and Ciaran Dynes, Head of Product Marketing at Talend will discuss the different approaches organizations can take to avoid the complexity of uploading and extracting data from Hadoop.
In this session, we’ll cover:
- How to load a cluster in seconds.
- Create a pig script without writing a line of code.
| To view replays, Register or Login |
Jul 25, 2012Hortonworks and Microsoft Bring Apache Hadoop to WindowsMore Info ▼Microsoft and Hortonworks announced a strategic relationship earlier this year to accelerate and extend the delivery of Apache Hadoop-based distributions for Windows Server and Windows Azure.
Join us in this 60-minute webcast with Rohit Bakashi, Product Manager at Hortonworks and Mike Flasko, Sr. Program Manager at Microsoft to discuss the work that’s being done since the announcement.
In this session, we’ll cover:
- Hortonworks Data Platform and Microsoft’s Big Data solutions.
- A demo of HDP on both Windows Server and Windows Azure.
- Real-world use cases that leverage Microsoft Big Data solutions to unlock business insights from structured and unstructured data.
| To view replays, Register or Login |
Jun 26, 2012Introduction to Hortonworks Data Platform More Info ▼Hortonworks recently unveiled the Hortonworks Data Platform (HDP), which is 100% open source data management software powered by Apache Hadoop. HDP makes Hadoop easier to install, integrate, manage and use for enterprises and solution providers.
Join us for this webinar as we outline and demo the key features of the Hortonworks Data Platform, including:
- Rapid Installation: Thanks to a wizard that makes it easy to install and provision Hadoop across clusters of machines
- Data Integration Services: Including Talend Open Studio for Big Data, a visual development environment that allows you to connect to hundreds of data sources without writing code.
- Management and Monitoring Services: Including Hortonworks Management Center, which is an open source extensible tool that provides intuitive web-based dashboards for monitoring your clusters and creating alerts.
- Centralized Metadata Services: Including HCatalog, which greatly simplifies data sharing between Hadoop and other enterprise data systems.
| To view replays, Register or Login |
May 2, 2012Improving Hive and HBase Integration More Info ▼Apache Hive provides SQL-like access to your stored data in Apache Hadoop. Apache HBase stores tabular data in Hadoop and supports update operations. The combination of these two capabilities is often desired, however, the current integration show limitations such as performance issues. In this talk, Hortonworks co-founder, Owen O’Malley, will present an overview of Hive and HBase and discuss new updates/improvements from the community on the integration of these two projects. Various techniques used to reduce data exchange and improve efficiency will also be provided.
| To view replays, Register or Login |
Apr 18, 2012HDFS Futures: NameNode Federation for Improved Efficiency and ScalabilityMore Info ▼Scalability of the NameNode has been a key issue for HDFS clusters. Because the entire file system metadata is stored in memory on a single NameNode, and all metadata operations are processed on this single system, the NameNode both limits the growth in size of the cluster and makes the NameService a bottleneck for the MapReduce framework as demand increases. HDFS Federation horizontally scales the NameService using multiple federated NameNodes/namespaces. The federated NameNodes share the DataNodes in the cluster as a common storage layer. HDFS Federation also adds client-side namespaces to provide a unified view of the file system. In this talk, Hortonworks co-founder and key architect, Suresh Srinivas will discuss the benefits, features and best practices for implementing HDFS Federation.
| To view replays, Register or Login |
Apr 4, 2012Simplifying the Process of Uploading and Extracting Data from HadoopMore Info ▼As the volume of data continues to grow, organizations worldwide are quickly adopting Apache Hadoop to store, manage and process Big Data. However, integrating multiple data sources is still one of the more time-consuming and challenging aspects of storing and analyzing data with Hadoop.
Join us for this free informative webinar to learn how the power of open source technologies address these data integration challenges. Hear from Rohit Bakhshi, Solution Architect at Hortonworks and Jim Walker, Director of Product Marketing at Talend, on Apache Hadoop best practices that data enthusiast of any skill-levels can leverage. Gain insights to different approaches organizations can take to avoid the complexity of uploading or extracting data from Hadoop. Also, see a live demonstration on how to load HDFS in less than five minutes without writing a line of code and how to create and run a pig script.
| To view replays, Register or Login |
Mar 7, 2012Extending Hadoop beyond MapReduceMore Info ▼Hortonworks has been developing the next generation of Apache Hadoop MapReduce that factors the framework into a generic resource management fabric to support MapReduce and other application paradigms such as Graph Processing, MPI etc. High-availability is built-in from the beginning; as are security and multi-tenancy to support multiple users and organizations on large, shared clusters. The new architecture will also increase innovation, agility and hardware utilization. NextGen MapReduce is already available in Hadoop 0.23. Join us for this webcast as we discuss the main architectural highlights of MapReduce and its utility to users and administrators.
| To view replays, Register or Login |
Feb 22, 2012HCatalog, Table Management for HadoopMore Info ▼HCatalog is a metadata and table management system for Hadoop. It allows users to share data and metadata across Hive, Pig, and MapReduce. It also allows users to write their applications without being concerned how or where the data is stored, and insulates users from schema and storage format changes. In this talk, Hortonworks founder Alan Gates will introduce HCatalog, discuss its current features, and give an overview of the short term roadmap for HCatalog. Alan is an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. He also also designed HCatalog and guided its adoption as an Apache Incubator project.
| To view replays, Register or Login |
Dec 22, 2011Understanding the Hortonworks RoadmapMore Info ▼Join Hortonworks founder Eric Baldeschwieler as he guides you through Hortonworks’ planned releases for the upcoming year. Eric has led the evolution of Apache Hadoop from a 20-node prototype to a 42,000-node service behind every click at Yahoo! In this webcast, Eric will guide you through the planned enhancements to the major Hadoop components in 2012.
| To view replays, Register or Login |
Dec 15, 2011Reference Architecture for Hadoop in Banking - Industry PerspectiveMore Info ▼UBS has been an early adopter of Hadoop and continues to test a number of data processing & analytics use cases. In this webcast, Executive Director at UBS, Dave Casper will discuss how Hadoop fits the overall Data & Architecture strategy at UBS. Joining Dave in this discussion is Abhishek Mehta, (Founder, Tresata) and Arun Murthy (Hortonworks) who will outline a template to design, build and implement a Hadoop powered Data Processing & Analytics Platform within the confines of an Enterprise Data Stack. Get expert advice on how financial organizations can introduce Hadoop into a traditional Banking data environment while symbiotically integrating with existing tools and technologies.
You’ll learn:
- How to integrate Hadoop within enterprise data stack
- What to build vs buy
- What problems to use Hadoop for and what not to
| To view replays, Register or Login |
Dec 8, 2011Hadoop HDFS High Availability: HA NameNodeMore Info ▼The HDFS NameNode is a robust and reliable service as seen in practice in production at Yahoo, Facebook and other enterprises. However, the NameNode does not have automatic failover. A hot failover solution called HA NameNode is under active development (HDFS-1623) and making excellent progress. Join Hortonworks founder Sanjay Radia, as he outlines the approach and current status. Sanjay is an Apache Hadoop Committer and original architect of the Hadoop HDFS project at Yahoo!
| To view replays, Register or Login |
Dec 1, 2011What's in Store for Hadoop.NextMore Info ▼Apache Hadoop is the de-facto Big Data platform for data storage and processing. The current stable, production release of Hadoop is hadoop-0.20. The Apache Hadoop community is preparing to release hadoop-0.23 with several major improvements including HDFS Federation and NextGen MapReduce. In this webcast, Arun Murthy, the Apache Hadoop Release Master for hadoop.next, will discuss the details of the major improvements in hadoop-0.23.
| To view replays, Register or Login |
Nov 17, 2011Apache Hadoop: Now, Next and BeyondMore Info ▼This webcast will outline Hortonworks’ vision of the future of Apache Hadoop, from the team that has contributed the majority of code to Apache Hadoop and that is building the next generation. Among the topics covered will be:
- HadoopNow – an overview of the current stable release of Hadoop (0.20.205),which was recently made available in a technology preview as a fully-integrated and tested offering from Hortonworks as the Hortonworks Data Platform version 1.
- HadoopNext – the next major version built upon the NextGen MapReduce framework, which will be made available again in a technology preview from Hortonworks as the Hortonworks Data Platform version 2 in the coming weeks.
- HadoopBeyond – an overview of the major developments for future versions of the Hortonworks Data Platform being addressed or planned by the Hortonworks development team.
In addition to describing the roadmap of the Hortonworks Data Platform, this webcast will provide details on the components included in Hortonworks Data Platform version 1, including several important technologies that make this distribution more manageable, open and extensible, including:
- Apache Ambari – an open source installation and management system
- HCatalog – a metadata management service for simplifying data sharing between Hadoop and other enterprise information systems
- Open APIs – including WebHDFS and APIs for Ambari and HCatalog
| To view replays, Register or Login |
Nov 11, 2011Transactions & Interaction: The Correlation of Structured and Unstructured DataMore Info ▼The analytics world has expanded from simple transaction analytics to the correlation of transactions and interactions.Join Hortonworks and Datameer as they discuss their partnership around Apache Hadoop and how big data analytics is changing the face of BI. This webinar will include a demonstration of customer focused use cases.
You will learn:
- How Hadoop-based analytics answers new questions
- Real-world examples of successful use cases
- How sentiment analysis, consumer loan scoring and stock market analysis provide compelling insights
- Key milestones and practical first steps on the road to big data analytics
| To view replays, Register or Login |
Oct 25, 2011Apache Hadoop and the Big Data Opportunity in BankingMore Info ▼This webcast focused on how big data is transforming the banking industry. Among the topics discussed were:
- What is big data and what does it mean for banking?
- What are the challenges and opportunities that come with unstructured data?
- What is Apache Hadoop and what role should it play in your data architecture?
- What are some popular banking uses cases for how Apache Hadoop is being used today?
- How can banks leverage Apache Hadoop without having to write their own code/applications?
- Are there banking applications available that leverage the power of big data and Apache Hadoop?
- What is the future of Apache Hadoop?
- How should I get started?
Presenters:
- Arun C. Murthy – lead of the Next Generation MapReduce project in Apache Hadoop and co-founder of Hortonworks
- Abhishek Mehta – co-founder of Tresata, creator of the first Hadoop-powered big data & analytics platform for financial industry data
| To view replays, Register or Login |