Replies: 0
I am having a problem running python UDFs in pig on a hdp 2.1 cluster.
A simple query (like this http://pig.apache.org/docs/r0.12.1/udf.html#python-udfs) gives the error:
ERROR 1066: Unable to open iterator for alias udfoutput. Backend error : Exception while executing [POUserFunc (Name: POUserFunc(org.apache.pig.impl.builtin.StreamingUDF)[chararray] – scope-4 Operator Key: scope-4) children: null at []]: java.lang.NullPointerException
This seems to relate to issue:
https://issues.apache.org/jira/browse/PIG-3478 which is fixed for pig 0.14.
Is there a way to circumvent the issue with HDP 2.1 or should i try to upgrade to pig 0.14? Or should i wait for HDP 2.2?