I am trying to evaluate different distro’s of hadoop so i can use it for my production cluster. Hortonworks distro is one of the choice that i have. There are few problems that make Admins feel bad about HDP is that your RPM build process is broken, i am sure that there are lot of people that are using RPM as a standarad package manager and would like to use rpm installation for Hadoop too. There are rpms that i can download directly but i wanted to apply my own patches or pull some upstream patches to hadoop and compile the code again. The usual process that i use is build from SRPM which is broken from long time .
so If i have to build rpms from HDP its not easy,
1) rpm build using SRPM is broken for long time and its shocking that hortonworks is not fixing it.
2) There is no clear documentation on how you can build rpms ( if not by srpm )
3) i have to write my own scripts to use external software like FPM to build rpms from the binary tar balls which is painful.
Issues like this highly discourage administrators to use HDP on production clusters especially when the other vendors made these things super easy with working builds , clear documentation.
Note: I doesn’t belong to any of your competitor , i am a custumer trying to compare 2 major distro’s for my production cluster use.
If anybody else had gone through the similar pains and created workarounds, please do share. Thanks in advance.