Hortonworks Sandbox Forum

Python script not generating log 12: How to Refine and Visualize Server Log Data

  • #49145
    Mayank Rathi

    Hello ,

    Python script generate_logs.py is not generating log files. When I tried to run the script command prompt did not return to normal. Any clues ?

    reduce/hadoop-streaming-’ -Djava.library.path=::/usr/lib/hadoop/lib/native/Linux-amd64-64:/usr/lib/hadoop/lib/native::/usr/lib/hadoop/lib/native/Linux-amd64-64:/usr/lib/hadoop/lib/native org.apache.flume.node.Application -f /etc/flume/conf/flume.conf -n sandbox
    python generate_logs.py

to create new topics or reply. | New User Registration

  • Author
  • #50039
    Sriraman S

    Same here !! Python script generate_logs.py is not generating the log files!! Please help asap!!!

    Santosh Aditham

    Exact same issue. Tutorial says ” When the log file has been generated, a timestamp will appear, and the command prompt will return to normal ([root@Sandbox \~]\#). It may take several seconds to generate the log file.” but I do not see command prompt returning to NORMAL! Any help at all?

    Santosh Aditham

    Update: Nothing wrong with the tutorial, just very very slow. If I stop the python script in 5 mins, I get approx 100 rows with HCatalog. If I leave the python script running for almost an hour, I get 500 rows with HCatalog. Sadly, the eventlog has 250,000+ rows!!! So either leave the python script for a day or two or try this: http://kzhendev.wordpress.com/2014/04/06/apache-flume-get-logs-out-of-rabbitmq-and-into-hdfs/

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.