Hive / HCatalog Forum

Hive parquet problem

  • #54264
    Kamil Malachowski

    Hi guys,
    I have problem with reading hive tables stored in parquet format, it gives following errorŁ

    Caused by: can not read class parquet.format.PageHeader: null
    at parquet.format.Util.readPageHeader(
    at parquet.hadoop.ParquetFileReader$Chunk.readAllPages(
    at parquet.hadoop.ParquetFileReader.readNextRowGroup(
    at parquet.hadoop.InternalParquetRecordReader.checkRead(
    at parquet.hadoop.InternalParquetRecordReader.nextKeyValue(
    at parquet.hadoop.ParquetRecordReader.nextKeyValue(

    Tables were created with parquet 1.2.5 and copied with distcp to Hortonworks 2.1 clutester with hive 0.13, and I guess parquet 1.3.5.
    I found that my issue may be ralated to

    Is there any quick workaround, e.g. some settings, that will will resolve my problem?

    Best Regards

to create new topics or reply. | New User Registration

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.