hadoop-mapreduce-examples Hadoop例項
[[email protected] hadoop-3.1.1]# bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-3.1.1.jar
An example program must be given as the first argument.
Valid program names are:
aggregatewordcount: An Aggregate based map/reduce program that counts the words in the input files.
aggregatewordhist
bbp: A map/reduce program that uses Bailey-Borwein-Plouffe to compute exact digits of Pi.
dbcount: An example job that count the pageview counts from a database.
distbbp: A map/reduce program that uses a BBP-type formula to compute exact bits of Pi.
grep
join: A job that effects a join over sorted, equally partitioned datasets
multifilewc: A job that counts words from several files.
pentomino: A map/reduce tile laying program to find solutions to pentomino problems.
pi: A map/reduce program that estimates Pi using a quasi-Monte Carlo method.
randomtextwriter
randomwriter: A map/reduce program that writes 10GB of random data per node.
secondarysort: An example defining a secondary sort to the reduce.
sort: A map/reduce program that sorts the data written by the random writer.
sudoku: A sudoku solver.
teragen: Generate data for the terasort
terasort: Run the terasort
teravalidate: Checking results of terasort
wordcount: A map/reduce program that counts the words in the input files.
wordmean: A map/reduce program that counts the average length of the words in the input files.
wordmedian: A map/reduce program that counts the median length of the words in the input files.
有效的程式名稱是:
aggregatewordcount:一個基於聚合的map/reduce程式,它對輸入檔案中的單詞進行計數。
aggregatewordhist:一個基於聚合的map/reduce程式,用於計算輸入檔案中單詞的直方圖。
bbp:一個使用Bailey Borwein Plouffe計算PI精確數字的map/reduce程式。
dbcount:一個計算頁面瀏覽量的示例作業,從資料庫中計數。
distbbp:一個使用BBP型公式計算PI精確位元的map/reduce程式。
grep:一個在輸入中計算正則表示式匹配的map/reduce程式。
join:一個影響連線排序、相等分割槽資料集的作業
multifilewc:一個從多個檔案中計算單詞的任務。
pentomino:一個地圖/減少瓦片鋪設程式來找到解決PotoMimo問題的方法。
pi:一個用擬蒙特卡洛方法估計PI的MAP/Relp程式。
randomtextwriter:一個map/reduce程式,每個節點寫入10GB的隨機文字資料。
randomwriter:一個對映/RADIUS程式,每個節點寫入10GB的隨機資料。
secondarysort:定義一個次要排序到減少的例子。
sort:一個對隨機寫入器寫入的資料進行排序的map/reduce程式。
sudoku:數獨求解者。
teragen:為terasort生成資料
terasort:執行terasort
teravalidate: terasort的檢查結果
wordcount:一個對映/縮小程式,計算輸入檔案中的單詞。
wordmean:map/reduce程式,用於計算輸入檔案中單詞的平均長度。
wordmedian:map/reduce程式,用於計算輸入檔案中單詞的中值長度。