About Hive

to create new topics or reply. | New User Registration

This topic contains 1 reply, has 2 voices, and was last updated by  Yi Zhang 1 year, 6 months ago.

  • Creator
  • #46638

    Prashant Kumar

    As per my understanding, all the files where unstructured data are stored in HDFS system across the hadoop cluster. Now when we have to analyse those data we use Hive. Now my question is when we extract the data from a specific file, we provide the file name and load the data into Hive tables. So hive table supports so much volume of data and which database hive is having.

Viewing 1 replies (of 1 total)

You must be to reply to this topic. | Create Account

  • Author
  • #46754

    Yi Zhang

    Hi Prashant,

    Can you clarify your question more? Hive is the ‘database’ whose storage is HDFS, in simplified analogy. file can be loaded from local filesystem into HDFS and be the underlying data for hive table.


Viewing 1 replies (of 1 total)
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.