Hbase zookeeper connection exception

to create new topics or reply. | New User Registration


This topic contains 5 replies, has 5 voices, and was last updated by  Sasha J 1 year, 8 months ago.

  • Creator
  • #16990

    Kadir Sert

    When i’m trying to integrate nutch 2.1 with hbase in hortonworks 1.2.1 distribution, i’m getting connection exception. What can i do to resolve this problem?

    Zookeeper, version = 3.4.5
    HBase, version =
    Nutch, version = 2.1

    Nutch command output is here:
    # bin/nutch inject urls
    InjectorJob: starting
    InjectorJob: urlDir: urls
    InjectorJob: org.apache.gora.util.GoraException: org.apache.hadoop.hbase.ZooKeeperConnectionException: HBase is able to connect to ZooKeeper but the connection closes immediately. This could be a sign that the server has too many connections (30 is the default). Consider inspecting your ZK server logs for that error and then make sure you are reusing HBaseConfiguration as often as you can. See HTable’s javadoc for more information.
    at org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:167)
    at org.apache.gora.store.DataStoreFactory.createDataStore(DataStoreFactory.java:135)
    at org.apache.nutch.storage.StorageUtils.createWebStore(StorageUtils.java:75)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:214)
    at org.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:228)
    at org.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:248)
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
    at org.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:258)

    Zookeeper server logs contains warning:
    200 – WARN [NIOServerCxn.Factory:] – caught end of stream exception
    EndOfStreamException: Unable to read additional data from client sessionid 0x0, likely client has closed socket
    at org.apache.zookeeper.server.NIOServerCnxn.doIO(NIOServerCnxn.java:220)
    at org.apache.zookeeper.server.NIOServerCnxnFactory.run(NIOServerCnxnFactory.java:208)
    at java.lang.Thread.run(Thread.java:662)

Viewing 5 replies - 1 through 5 (of 5 total)

You must be to reply to this topic. | Create Account

  • Author
  • #29088

    Sasha J

    NO, we do not have any tutorial on this.


    Jason Wu

    Hi all: I also want to run Nutch on HDP. Do you have any tutorials for that?


    Larry Liu

    Hi, Kadir,

    Thanks for trying HDP.

    HDP recommends increasing the maximum number of file handles to more than 10,000. Note that increasing the file handles for the user who is running the HBase process is an operating system configuration, not an HBase configuration.
    If you are using ulimit, you must make the following configuration changes:

    In the /etc/security/limits.conf file, add the following lines:
    hdfs – nofile 32768
    hbase – nofile 32768

    After the changes made, please restart zookeeper and hbase.

    Hope this helps resolve the issue



    Kadir Sert

    the scripts you say in not working!

    # ./HMC-check.sh
    Error: unable to open database “/var/db/hmc/data/data.db”: unable to open database file
    Error: unable to open database “/var/db/hmc/data/data.db”: unable to open database file
    Error: unable to open database “/var/db/hmc/data/data.db”: unable to open database file
    Error: unable to open database “/var/db/hmc/data/data.db”: unable to open database file
    Error: unable to open database “/var/db/hmc/data/data.db”: unable to open database file
    grep: /var/log/hmc/hmc.log: No such file or directory
    Resulting file is: /tmp/…out
    Please, upload it to Hortonworks Support FTP site.

    and ambari-check.sh asking me password!



    Hi Kadir,

    Thanks for trying out Hortonworks Data Platform.

    Could you send us the complete log files? Follow the link in this post http://hortonworks.com/community/forums/topic/hmc-installation-support-help-us-help-you
    for instructions on how to get them to us.


Viewing 5 replies - 1 through 5 (of 5 total)
Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.