HDP on Windows – Installation Forum

Testing mahout

  • #39516

    I tried to test our deployment of hdp with mahout using https://cwiki.apache.org/confluence/display/MAHOUT/Wikipedia+Bayes+Example
    I was able to download the Wikipedia data set and load it into hdfs when trying step 4 I get
    D:\user\mahout->bin\mahout wikipediaXMLSPlitter -d D:\user\enwiki-latest-pages-articles.xml -o wikipedia/chunks -c 64
    “Mahout home set C:\hadoop\\mahout-”
    MAHOUT_JOB: C:\hadoop\\mahout-\examples\target\mahout-examples
    13/10/07 18:23:07 WARN driver.MahoutDriver: Unable to add class: wikipediaXMLSPlitter
    13/10/07 18:23:08 WARN driver.MahoutDriver: No wikipediaXMLSPlitter.props found
    on classpath, will use command-line arguments only
    Unknown program ‘wikipediaXMLSPlitter’ chosen.
    It looks like another jar file is needed.

to create new topics or reply. | New User Registration

You must be to reply to this topic. | Create Account

Support from the Experts

A HDP Support Subscription connects you experts with deep experience running Apache Hadoop in production, at-scale on the most demanding workloads.

Enterprise Support »

Become HDP Certified

Real world training designed by the core architects of Hadoop. Scenario-based training courses are available in-classroom or online from anywhere in the world

Training »

Hortonworks Data Platform
The Hortonworks Data Platform is a 100% open source distribution of Apache Hadoop that is truly enterprise grade having been built, tested and hardened with enterprise rigor.
Get started with Sandbox
Hortonworks Sandbox is a self-contained virtual machine with Apache Hadoop pre-configured alongside a set of hands-on, step-by-step Hadoop tutorials.
Modern Data Architecture
Tackle the challenges of big data. Hadoop integrates with existing EDW, RDBMS and MPP systems to deliver lower cost, higher capacity infrastructure.