[Hadoop]Hadoop單元測試MRUnit

阿新 • • 發佈：2019-01-17

在MapReduce中，map函式和reduce函式的獨立測試是非常方便的，這是由函式風格決定的。MRUnit是一個測試庫，它便於將已知的輸入傳遞給mapper或者檢查reducer的輸出是否符合預期。MRUnit與標準的執行框架（JUnit）一起使用。

1. 設定開發環境

mrunit-x.x.x-incubating-hadoop2.jar。同時還需要下載JUnit最新版本jar。

如果使用Maven方式則使用如下方式：

<junit.version>4.12</junit.version>
<mrunit.version>1.1.0</mrunit.version 
>
<!-- junit -->
<dependency>
    <groupId>junit</groupId>
    <artifactId>junit</artifactId>
    <version>${junit.version}</version>
    <scope>test</scope>
</dependency>
<!-- mrunit -->
<dependency>
   <groupId>org.apache.mrunit</groupId 
>
   <artifactId>mrunit</artifactId>
   <version>${mrunit.version}</version>
   <classifier>hadoop2</classifier>
   <scope>test</scope>
</dependency>

備註：

如果你使用的是hadoop 2.x版本，classifier設定為hadoop2

2. MRUnit 測試用例

MRUnit測試框架基於Junit，可以測試hadoop版本為0.20，0.23.x，1.0.x，2.x的map reduce程式。

下面是一個使用MRUnit對統計一年最高氣溫的Map Reduce程式進行單元測試。

測試資料如下：

0096007026999992016062218244+00000+000000FM-15+702699999V0209999C000019999999N999999999+03401+01801999999ADDMA1101731999999REMMET069MOBOB0 METAR 7026 //008 000000 221824Z AUTO 00000KT //// 34/18 A3004=

這隻有一天的資料，氣溫是340，Mapper輸出為該天氣溫340

下面是相應的Mapper和Reducer：

MaxTemperatureMapper：

package com.sjf.open.maxTemperature;
import java.io.IOException;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Mapper;
import com.google.common.base.Objects;
/**
 * Created by xiaosi on 16-7-27.
 */
public class MaxTemperatureMapper extends Mapper<LongWritable, Text, Text, IntWritable> {
    private static final int MISSING = 9999;
    public void map(LongWritable key, Text value, Context context) throws IOException, InterruptedException {
        String line = value.toString();
        // 年份
        String year = line.substring(15, 19);
        // 溫度
        int airTemperature;
        if(Objects.equal(line.charAt(87),"+")){
            airTemperature = Integer.parseInt(line.substring(88,92));
        }
        else{
            airTemperature = Integer.parseInt(line.substring(87,92));
        }
        // 空氣質量
        String quality = line.substring(92, 93);
        if(!Objects.equal(airTemperature, MISSING) && quality.matches("[01459]")){
            context.write(new Text(year), new IntWritable(airTemperature));
        }
    }
}

MaxTemperatureReducer：

package com.sjf.open.maxTemperature;
import java.io.IOException;
import java.util.Iterator;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Reducer;
/**
 * Created by xiaosi on 16-7-27.
 */
public class MaxTemperatureReducer extends Reducer<Text, IntWritable, Text, IntWritable> {
    @Override
    protected void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException {
        // 一年最高氣溫
        int maxValue = Integer.MIN_VALUE;
        for(IntWritable value : values){
            maxValue = Math.max(maxValue, value.get());
        }//for
        // 輸出
        context.write(key, new IntWritable(maxValue));
    }
}

下面是MRUnit測試類：

package com.sjf.open.maxTemperature;
import com.google.common.collect.Lists;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mrunit.mapreduce.MapDriver;
import org.apache.hadoop.mrunit.mapreduce.MapReduceDriver;
import org.apache.hadoop.mrunit.mapreduce.ReduceDriver;
import org.apache.hadoop.mrunit.types.Pair;
import org.junit.Before;
import org.junit.Test;
import java.io.IOException;
import java.util.List;
/**
 * Created by xiaosi on 16-12-8.
 */
public class MaxTemperatureTest {
    private MapDriver mapDriver;
    private ReduceDriver reduceDriver;
    private MapReduceDriver mapReduceDriver;
    @Before
    public void setUp(){
        MaxTemperatureMapper mapper = new MaxTemperatureMapper();
        mapDriver = MapDriver.newMapDriver(mapper);
        MaxTemperatureReducer reducer = new MaxTemperatureReducer();
        reduceDriver = ReduceDriver.newReduceDriver();
        reduceDriver.withReducer(reducer);
        mapReduceDriver = MapReduceDriver.newMapReduceDriver(mapper, reducer);
    }
    @Test
    public void testMapper() throws IOException {
        Text text = new Text("0096007026999992016062218244+00000+000000FM-15+702699999V0209999C000019999999N999999999+03401+01801999999ADDMA1101731999999REMMET069MOBOB0 METAR 7026 //008 000000 221824Z AUTO 00000KT //// 34/18 A3004=");
        mapDriver.withInput(new LongWritable(), text);
        mapDriver.withOutput(new Text("2016"), new IntWritable(340));
        mapDriver.runTest();
        // 輸出
        List<Pair> expectedOutputList = mapDriver.getExpectedOutputs();
        for(Pair pair : expectedOutputList){
            System.out.println(pair.getFirst() + " --- " + pair.getSecond()); // 2016 --- 340
        }
    }
    @Test
    public void testReducer() throws IOException {
        List<IntWritable> IntWritableList = Lists.newArrayList();
        IntWritableList.add(new IntWritable(340));
        IntWritableList.add(new IntWritable(240));
        IntWritableList.add(new IntWritable(320));
        IntWritableList.add(new IntWritable(330));
        IntWritableList.add(new IntWritable(310));
        reduceDriver.withInput(new Text("2016"), IntWritableList);
        reduceDriver.withOutput(new Text("2016"), new IntWritable(340));
        reduceDriver.runTest();
        // 輸出
        List<Pair> expectedOutputList = reduceDriver.getExpectedOutputs();
        for(Pair pair : expectedOutputList){
            System.out.println(pair.getFirst() + " --- " + pair.getSecond());
        }
    }
    @Test
    public void testMapperAndReducer() throws IOException {
        Text text = new Text("0089010010999992014010114004+70933-008667FM-12+000999999V0201201N006019999999N999999999+00121-00361100681ADDMA1999990100561MD1810171+9990REMSYN04801001 46/// /1206 10012 21036 30056 40068 58017=");
        mapReduceDriver.withInput(new LongWritable(), text);
        mapReduceDriver.withOutput(new Text("2014"), new IntWritable(12));
        mapReduceDriver.runTest();
        // 輸出
        List<Pair> expectedOutputList = mapReduceDriver.getExpectedOutputs();
        for(Pair pair : expectedOutputList){
            System.out.println(pair.getFirst() + " --- " + pair.getSecond()); // 2014 --- 12
        }
    }
}

如果測試的是Mapper，使用MRUnit的MapDiver，如果測試Reducer，使用ReduceDriver，如果測試整個MapReduce程式，則需要使用MapReduceDriver。在呼叫runTest()方法之前，需要配置mapper（或者Reducer），輸入值，期望的輸出key，期望的輸出值等。如果與期望的輸出值不匹配，MRUnit測試失敗。根據withOutput()被呼叫的次數，MapDiver（ReduceDriver，MapReduceDriver）能來檢查0，1,或者多個輸出記錄。

備註：

注意 MapDriver，ReduceDriver，MapReduceDriver 引入的jar包版本：

import org.apache.hadoop.mrunit.mapreduce.MapDriver;
import org.apache.hadoop.mrunit.mapreduce.MapReduceDriver;
import org.apache.hadoop.mrunit.mapreduce.ReduceDriver;

而不是：

import org.apache.hadoop.mrunit.MapDriver;
import org.apache.hadoop.mrunit.MapReduceDriver;
import org.apache.hadoop.mrunit.ReduceDriver;

這分別對應Hadoop新老版本API，第一類對應新版本的Mapper和Reducer：

import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;

第二類對應老版本的Mapper和Reducer：

import org.apache.hadoop.mapred.Mapper;
import org.apache.hadoop.mapred.Reducer;

[Hadoop]Hadoop單元測試MRUnit

1. 設定開發環境

2. MRUnit 測試用例

[Hadoop]Hadoop單元測試MRUnit

在HADOOP中使用MRUNIT進行單元測試

hadoop單元測試方法--使用和增強MRUnit[1]

Hadoop-使用MRUnit來寫單元測試

Hadoop學習筆記之三：用MRUnit做單元測試

hadoop中關於mapreduce的單元測試

使用mrunit對hadoop進行單元除錯

2018-08-06 期 MapReduce MRUnit安裝及單元測試

10分鐘從無到有搭建hadoop環境並測試mapreduce

Hadoop第一個測試例項WordCount的執行

怎樣選擇Hadoop的基準測試

hadoop學習筆記(一)——hadoop安裝及測試

淺析MapReduce單元測試框架—MRUnit

使用Docker快速搭建Hadoop，Spark測試環境

hadoop學習之HDFS（2.5）：windows下eclipse遠端連線linux下的hadoop叢集並測試wordcount例子

MapReduce的單元測試框架MRUnit

MapReduce 單元測試工具 MRUnit 使用

NUnit.Framework在VS2015中如何進行單元測試

Spring Boot的單元測試(Unit Test)

ASP.NET Zero--單元測試

[Hadoop]Hadoop單元測試MRUnit

1. 設定開發環境

2. MRUnit 測試用例

相關推薦