MapReduce寫程式碼的流程,以及需要繼承的超類

阿新 • • 發佈：2018-12-01

package tq;

import java.io.IOException;

import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.hbase.ScanPerformanceEvaluation.MyMapper;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapred.FileInputFormat;
import org.apache.hadoop.mapred.FileOutputFormat;
import org.apache.hadoop.mapred.TestMiniMRClientCluster.MyReducer;
import org.apache.hadoop.mapreduce.Job;

import wordcount.MyCombiner;

public class TianQi {
	public static void main(String[] args) throws IOException {
		//設定配置項
		Configuration conf = new Configuration();
		Job job = Job.getInstance(conf);
		
		//設定
		job.setJarByClass(TianQi.class);
		job.setJobName("sdfjsk");
		
		//設定讀取檔案的路徑
		Path filein = new Path("jk");
		FileInputFormat.addInputPath(job, filein);
		
		
		//設定檔案的輸出的路徑
		Path fileout = new Path("fdsd");
		if(fileout.getFileSystem(conf).exists(fileout)) {
			fileout.getFileSystem(conf).delete(fileout,true);
		}
		FileOutputFormat.setOutputPath(job, fileout);
		
		//設定檔案的讀入的格式 MyInputFormat extends InputFormat.class
		job.setInputFormatClass(MyInputFormat.class);
//		Multiple markers at this line
//		- The method setInputFormatClass(Class<? extends InputFormat>) in the type Job is not applicable for the arguments 
//		 (Class<MyInputFormat>)
//		- MyInputFormat cannot be resolved to a type
		
		//設定檔案讀出的格式 MyOutFormat extends OutputFormat.class
		job.setOutputFormatClass(MyOutFormat.class);
//		Multiple markers at this line
//		- The method setOutputFormatClass(Class<? extends OutputFormat>) in the type Job is not applicable for the arguments 
//		 (Class<MyOutFormat>)
//		- MyOutFormat cannot be resolved to a type
		
		//設定map端 mymapper.class extends mapper.class
		job.setMapperClass(MyMapper.class);
		
		//設定map端輸出的格式
		job.setMapOutputKeyClass(Text.class);
		job.setMapOutputValueClass(IntWritable);
		
		//設定comparator排序規則
		job.setSortComparatorClass(MySortComparator.class);
//		Multiple markers at this line
//		- The method setSortComparatorClass(Class<? extends RawComparator>) in the type Job is not applicable for the 
//		 arguments (Class<MySortComparator>)
//		- MySortComparator cannot be resolved to a type
		
		//設定partition分割槽  Mypartition extends Partitoner
		job.setPartitionerClass(MyPartition.class);
//		Multiple markers at this line
//		- MyPartition cannot be resolved to a type
//		- The method setPartitionerClass(Class<? extends Partitioner>) in the type Job is not applicable for the arguments 
//		 (Class<MyPartition>)
		
		
		//設定map端的預聚合 MyCombiner.class extends Reducer.class
		job.setCombinerClass(MyCombiner.class);
//		The method setCombinerClass(Class<? extends Reducer>) in the type Job is not applicable for the arguments (Class<MyCombiner>)
		
		//設定
		job.setGroupingComparatorClass(MyGroup.class);
//		Multiple markers at this line
//		- The method setGroupingComparatorClass(Class<? extends RawComparator>) in the type Job is not applicable for the 
//		 arguments (Class<MyGroup>)
//		- MyGroup cannot be resolved to a type
		
		//設定reduce端
		job.setReducerClass(MyReducer.class);
		
		//設定reduce端的輸出的key
		job.setOutputKeyClass(Text.class);
		
		//設定reduce端的輸出的value
		job.setOutputValueClass(IntWritable);
		
		//設定map端的task的個數
		job.setNumReduceTasks(2);
		
		/**
		 * 總結：
		 * 一、設定conf configuration conf = new configuration
		 * 		Job job = Job.getInstance(conf)
		 * 二、設定檔名和jobName
		 * 		job.getJarbyclass()
		 * 		job.setJobName()
		 * 三、設定檔案的輸入路徑和輸出路徑
		 * 		FileInputFormat.addInputPATH
		 * 		FileOutputFormat.setoutputpath
		 * 			if(filleout.getfilesystem(conf).exists(fileout))
		 * 				fileout.getfileoutsystem(conf).delete（fileout）
		 * 四、設定檔案讀入的型別
		 * 		job.setFileInputFormat extends inputFormat.class
		 * 五、設定檔案的讀出的型別
		 * 		job.setFileOutFormatclass extends outputFormat.class
		 * 
		 * 六、設定檔案map端
		 * 		job.setMapperclass extends mapper
		 * 七、設定map端的輸出的key
		 * 		job.setmapoutputkeyclass		 		
		 * 八、設定map端的輸出的value
		 * 		job.setmapoutputvalueclass
		 * 九、設定排序sort
		 * 		job.setsortComparator() extends RawComparator()
		 * 十、設定排序
		 * 		job.setGroupingComparatorclass extends RawComparator.class
		 * 十一、設定partition
		 * 		job.setpartitionclass extends partitioner
		 * 十二、設定reduce
		 * 		job.setReducerclass extends reducer.class
		 * 十三、設定reduce端的輸出的key
		 * 		job.setoutputkeyclass
		 * 十四、設定reduce端的輸出的value
		 * 		job.setoutputvalueclass
		 * 十五、設定reduce端task的個數
		 * 		job.setNumoofReduceTask()
		 * 十六、最終設定job.waitforcomplettion(true)
		 * 
		 */		
	}
}

總結：

/**
		     * 總結：
		 * 一、設定conf configuration conf = new configuration
		 * 		Job job = Job.getInstance(conf)
		 * 二、設定檔名和jobName
		 * 		job.getJarbyclass()
		 * 		job.setJobName()
		 * 三、設定檔案的輸入路徑和輸出路徑
		 * 		FileInputFormat.addInputPATH
		 * 		FileOutputFormat.setoutputpath
		 * 			if(filleout.getfilesystem(conf).exists(fileout))
		 * 				fileout.getfileoutsystem(conf).delete（fileout）
		 * 四、設定檔案讀入的型別
		 * 		job.setFileInputFormat extends inputFormat.class
		 * 五、設定檔案的讀出的型別
		 * 		job.setFileOutFormatclass extends outputFormat.class
		 * 
		 * 六、設定檔案map端
		 * 		job.setMapperclass extends mapper
		 * 七、設定map端的輸出的key
		 * 		job.setmapoutputkeyclass		 		
		 * 八、設定map端的輸出的value
		 * 		job.setmapoutputvalueclass
		 * 九、設定排序sort
		 * 		job.setsortComparator() extends RawComparator()
		 * 十、設定排序
		 * 		job.setGroupingComparatorclass extends RawComparator.class
		 * 十一、設定partition
		 * 		job.setpartitionclass extends partitioner
		 * 十二、設定reduce
		 * 		job.setReducerclass extends reducer.class
		 * 十三、設定reduce端的輸出的key
		 * 		job.setoutputkeyclass
		 * 十四、設定reduce端的輸出的value
		 * 		job.setoutputvalueclass
		 * 十五、設定reduce端task的個數
		 * 		job.setNumoofReduceTask()
		 * 十六、最終設定job.waitforcomplettion(true)
		 * 
		 */

MapReduce寫程式碼的流程：

分為以下幾個類“

一、公共設定(四種）：

1、設定conf

configuration conf = new configuration（）

Job job = Job.getInstance(conf);

2、設定類名

job.setJarByclass(tq.class)

job.setJobName("sdfds")

3、設定檔案的讀入路徑和讀出路徑

Path filein = new Path("sdfs")

FileinputFormat.addInputparh(job, filein)

Path fileout = new Path("dfdjs")

if(fileout.getFilesystem(conf).exists(fileout){
fileout.getfilesystem(conf).delete(fileout)

}

FileOutpuFormat.setOutPath(job,fileout);

4、設定檔案的讀入格式和讀出格式

job.setfileinputformatclass (fddsf) extends inputformat()

job.setfileoutputformat(dfd) extends inputformat.class

二、設定map端

1、設定map端

job.setmapperclass extends mapper

2、設定map端的輸出key和value的值

job.setmapoutputkeyclass

job.setmapoutputvalueclass

三、設定map端輸出之後

1、設定排序

job.setsortComparatorclass extends RawComparator.class

job.setGroupingComparatorclass extends RawComparator.class

2、設定分割槽

job.setpartitionerclass extends partitioner.class

3、設定map端的預聚合

job.setcombinerclass extends reducer

四、設定reduce端

1、job.setreducerclass extends Reducer.class

2、設定reduce端的輸出

job.setoutputkeyclass()

job.setoutputvalueclass()

五、所有的都結束之後

1、設定reduce端task的個數

job.setNumofReducetask(2)

2、job.waitforcompletition(true)

MapReduce寫程式碼的流程,以及需要繼承的超類

package tq; import java.io.IOException; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.Path; import org.apache.hadoop.hbas

寫程式碼過程中需要注意的地方

1.使用日誌框架列印日誌：需要注意的點：輸出有級別區分的日誌輸出帶有有效資訊的日誌日誌中帶上上下

用Java，在這裡門簡單分為防盜門需要密碼、鑰匙；安全門需要密、鑰匙、虹膜。如果，不當進入會引起警報，警報有警車警報、煙霧警報，對要求寫程式碼

1建Door類 package Door; public abstract class Door { public abstract void open(); public abstract void close(); } 2、建The_police_car_a

全面詳細的微信支付思路流程以及專案程式碼分享

之前一直沒有接觸微信支付這方面的業務，現在因專案需要，需要用到此功能，開始各種百度，稍微瞭解了一下，微信支付分為：支付寶支付、APP支付、掃碼支付，但是對於H5支付和支付寶支付現在還是沒有徹底搞明白他兩的區別，希望大佬們可以稍微提點一二，小弟先在此謝過！大體思路如下： 1】.獲取co

少說話多寫程式碼之Python學習042——類04（超類）

來看看Python中類的繼承。被繼承的類稱作超類。先看一個類，定義了一個Student類，有兩個屬性和三個方法。 class Student: name='學生' school='學校' def init(

寫程式碼：假設一年期定期利率為3.25%，計算一下需要過多少年，一萬元的一年定期存款連本帶息能翻番？

# 假設一年期定期利率為3.25%，計算一下需要過多少年，一萬元的一年定期存款連本帶息能翻番？MONEY_RATE = 0.0325money = 10000year = 1while money <= 20000: money *= (1 + MONEY_RATE) year += 1print("w

少說話多寫程式碼之Python學習044——類06（多繼承）

關於繼承最麻煩的就是多繼承，而Python是支援多繼承的。也就是說一個子類可能有兩個以上的父類。比如，如下程式碼，子類繼承了兩個類，父類的方法在子類中都可以呼叫。 class Programer: language='二進位制' de

少說話多寫程式碼之Python學習043——類05（檢查繼承關係）

Python中還可以檢查類的繼承的關係。比如，如下兩個類，PrimaryBaLinghouStudent繼承了BaLinghouStudent。 class BaLinghouStudent: name='學生' school='學校

IT行業程式設計師需知：不止於寫程式碼，我們還需要提升自身的軟技能

作為一所專業的IT教育培訓類企業，我們叩丁狼教育在一開始都會這樣教育我們的學員，一定要把精力集中放在學習技能上，因為對於初學者來說，這是他們必定要邁出的第一步。而對於已經掌握了一定技術的軟體開發人員，在這裡建議你邁出第二步。大多數程式設計師追求與時俱進的時候會把時間花費在新的框架或新的程

程式設計師網咖寫程式碼挨頓打？網友：想笑死我繼承我的花唄？

不知道為什麼，作為程式設計師老是能夠碰到一些奇奇怪怪的倒黴事，特別是在網咖，就好像是幸運女神不站在自己這邊一樣，總能捱到一頓打，最近就有一名程式設計師就是如此。自己因為專案臨時改需求，但又快要上線了，所以就想在去公司加班。誰知道那天公司直接停電了，沒辦法，只好去網咖趕工了。

CNN卷積神經網路應用於人臉識別（詳細流程+程式碼實現)和相應的超引數解釋

DeepLearning tutorial（5）CNN卷積神經網路應用於人臉識別（詳細流程+程式碼實現） @author：wepon 本文主要講解將CNN應用於人臉識別的流程，程式基於Python+numpy+theano+PIL開發，採用類似LeNet5的

12.Scala中的繼承：超類的構造、重寫欄位、重寫方法程式碼實戰

object ExtendOverride_12 { def main(args: Array[String]): Unit = { val w = new Worker("Spark", 5, 100000) println("school:

git分支開發，分支(feature)同步主幹(master)程式碼，以及最終分支合併到主幹的操作流程

由於rebase執行速度慢，分支同步主幹程式碼時，分支的每次提交都可能和主幹產生衝突，需要解決的次數太多，影響提交效率。同時，為了保證主幹提交線乾淨(可以安全回溯)，所以採用下面所說的merge法。 merge法核心: (master) git merge feature --squash 意思是把fea

簡談用g++編譯執行c++程式碼流程，以及動態庫靜態庫的建立與使用

一 g++ 編譯執行hello world 1編寫hello world 程式碼 #include<iostream> using namespace std; int main() { cout << "hello

誰說設計師不會寫程式碼？超簡單PHOTOSHOP指令碼語言介紹

自動化對每個設計師的工作來說是很有用的。它可以在重複的任務上節省寶貴的時間，還能夠幫我們更快捷、更容易的解決一系列問題。你可以使用photoshop的動作來使工作流程自動化，這是很流行的，大多數人都知道並且已經在使用的方法。今天，我們將介紹給你一種高階的自動化技巧：指

文件驅動 —— 表單元件（五）：基於Ant Design Vue 的表單控制元件的demo，再也不需要寫程式碼了。

# 原始碼 [https://github.com/naturefwvue/nf-vue3-ant](https://github.com/naturefwvue/nf-vue3-ant) # 特點 * 只需要更改meta，既可以切換表單 * 可以統一修改樣式，統一升級，以最小的代價，應對UI的升級、切換

代碼上線流程以及版本發布小結

監測請求 log app 說明 process class 指定簡單之前的上線流程很簡單粗暴如圖：這簡直是災難性質的，上傳 SVN，在測試服務器上看看正在調試的接口沒問題，直接 sync 到線上服務器。代碼無法回滾，只能覆蓋。而客戶端的同學需要穩當的 api 作為

Struts 2 Spring Hibernate三大框架的執行流程以及原理

freemark 步驟二維 ring logs spa att spring 添加轉:http://www.cnblogs.com/System-out-println/p/5974113.html Struts2框架一、簡介 Struts2是一個相當強大的Ja

Fixed Partition Memory Management UVALive - 2238 建圖很巧妙 km算法左右頂點個數不等模板以及需要註意的問題求最小權匹配

-1 program push_back 訓練指南 const 完成 ons tin 方法 /** 題目: Fixed Partition Memory Management UVALive - 2238 鏈接：https://vjudge.net/problem/UVA

Hibernate的工作流程以及三種狀態（面試題）

數據庫 delet 垃圾打開 ron 工作流沒有 flush 行數據 Hibernate的工作流程以及三種狀態轉載自：http://www.cnblogs.com/fifiyong/p/6390699.html Hibernate的工作流程： 1. 讀取並解

MapReduce寫程式碼的流程,以及需要繼承的超類

相關推薦