I’ve got a successful cluster of 3 running on 3 2012 Windows Server VM’s and the smoke test runs just fine.
I’ve spent the day fighting with running a map reduce written with Python using the hadoop-streaming approach. I have been passing the -files in for the mapper and the reducer (mapper.py and reduce.py) and sending them into the JAR using -file but I kept getting an error that mapper.py could not be found.
I tried passing python.exe into the JAR and setting mapper to “python.exe mapper.py”… this improves things to a certain extent but now it’s failing with a syntax error in the mapper, which I’m pretty sure is because python.exe can’t find the dependencies.
This all seems very much to be an environment issue with python and the path. I have c:\python27 in the PATH system variable and running echo
“foo bar foo bar” | mapper.py
works as expected so the path feels okay.
Has anyone experienced anything like this or have any ideas about how I can get python.
Thanks in advance.