HDP on Linux – Installation Forum

Use Case Scenario for Hadoop

  • #58519
    Abhi Abhi


    I would like to have some expert view on the use of a Big Data platform like Hadoop in one of my project scenarios.

    My apologies in advance as I am a complete novice in this technology although I understand databases like MySQL well.

    We are creating a product which would be used to analyse data from social media. So the input data would be a large volume of tweets, facebook posts, user profiles, youtube data
    and data from blogs etc. On top of this I would be having a web application to help me view and analyse this data. As the requirement makes it clear, I would be needing a sort of real time system. So if I have a tweet coming in, I would like to have it available to my web app readily for processing. Batch data processing may not be a suitable choice for my application.

    My question is, that is a Hadoop engine good choice for me?
    What are the parameter I should base my decision on?
    Is it also a good option to use a Multi Cluster MySQL engine as opposed to Hadoop?
    Is there any benchmarking in terms of Size and velocity of data in which Hadoop becomes a good choice?

    Sorry for posting so many questions but hoping to get some good insights.



to create new topics or reply. | New User Registration

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.