Migration from HDInsight to EMR

This topic contains 0 replies, has 1 voice, and was last updated by  Gautam Gupta 7 months ago.

  • Creator
  • #59387

    Gautam Gupta


    Hope this is the right forum for this topic, and apologies in advance for the length of the post.

    We have currently started building a product, which has Big Data requirements, and for which we have chosen Hadoop. We currently don’t have a lot of experience with Big Data.

    For our Cloud platform and Hadoop, we are trying to choose between Azure HDInsight and Amazon AWS EMR. Our product will be built using .Net and we are already using Azure for another existing product. We also have experience with AWS, though not on Hadoop as yet.

    Now, we know that Azure is not as mature as EMR, and AWS would be a better bet, at least for the next couple of years. However it probably would be easier to develop on Azure with .Net, and also save some upfront costs as we are already using it.

    So, we are thinking of building the beta version of the product on Azure, validate and benchmark the performance. Then as a backup plan, if required move to AWS and EMR for the final product.

    My main question is – How difficult would it be to migrate from HDInsight to EMR? How much of our code would we have to change for this migration? What is it that HDInsight offers that EMR doesn’t?


The forum ‘HDP 2.1 Technical Preview’ is closed to new topics and replies.

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.