1. 程式人生 > >hive1.2以前版本的where條件NullPointerException

hive1.2以前版本的where條件NullPointerException

1、異常背景:

hive版本1.1.0,表是orc格式,使用條件where name in ('支付金額','訂單量','客單價','毛利率','全鏈路達成率','貓超重點商品在架率','基準價毛利率','商品缺貨率')

2、日誌如下:

Diagnostic Messages for this Task:
Error: java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while processing row 
    at org.apache.hadoop.hive.ql.exec.mr.ExecMapper.map(ExecMapper.java:179)
    at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:54)
    at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:453)
    at org.apache.hadoop.mapred.MapTask.run(MapTask.java:343)
    at org.apache.hadoop.mapred.YarnChild$2.run(YarnChild.java:164)
    at java.security.AccessController.doPrivileged(Native Method)
    at javax.security.auth.Subject.doAs(Subject.java:415)
    at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1693)
    at org.apache.hadoop.mapred.YarnChild.main(YarnChild.java:158)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while processing row 
    at org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:52)
    at org.apache.hadoop.hive.ql.exec.mr.ExecMapper.map(ExecMapper.java:170)
    ... 8 more
Caused by: java.lang.NullPointerException
    at org.apache.hadoop.hive.ql.exec.vector.expressions.CuckooSetBytes.rehash(CuckooSetBytes.java:222)
    at org.apache.hadoop.hive.ql.exec.vector.expressions.CuckooSetBytes.insert(CuckooSetBytes.java:118)
    at org.apache.hadoop.hive.ql.exec.vector.expressions.CuckooSetBytes.load(CuckooSetBytes.java:127)
    at org.apache.hadoop.hive.ql.exec.vector.expressions.FilterStringColumnInList.evaluate(FilterStringColumnInList.java:71)
    at org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.processOp(VectorFilterOperator.java:100)
    at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:815)
    at org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95)
    at org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:157)
    at org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
    ... 9 more

3、檢視1.1.0版本原始碼:

    if (prev1 == null) {
      prev1 = t1;
      prev1 = t2;
    }
    t1 = new byte[n][];
    t2 = new byte[n][];
    for (byte[] v  : prev1) {
      if (v != null) {
        byte[] x = tryInsert(v);
        if (x != null) {
          rehash();
          return;
        }
      }
    }
    for (byte[] v  : prev2
) { if (v != null) {

發現prev2沒有初始化,而prev1初始化兩次,應該是bug

4、發現官網咋1.2版本fix了,參照https://issues.apache.org/jira/browse/HIVE-9950