hive 壓縮最終結果中間結果

阿新 • • 發佈：2019-02-06

1.hive壓縮

hive>set mapred.output.compress=true;

hive> set mapred.compress.map.output=true;
hive> set hive.exec.compress.output=true;
hive> set mapred.map.output.compression.codec=org.apache.hadoop.io.compress.BZip2Codec;
hive> set hive.exec.compress.intermediate=true;
hive> set io.compression.codecs=org.apache.hadoop.io.compress.BZip2Codec;
hive> SET io.seqfile.compression.type=BLOCK;

最後hive表資料是.bz2字尾

奇怪現象true false引數在sql指令碼中使用可以起作用，而mapred.map.output.compression.codec不起作用，需要在hive的xml中配置。

2.mapreduce壓縮

conf.setBoolean("mapred.output.compress", true);
conf.setClass("mapred.output.compression.codec", BZip2Codec.class, CompressionCodec.class);

壓縮後有字尾

3.hive壓縮後的表，可以用使用sql+python呼叫，資料會自動解壓。

說明：

最終的結果資料開啟壓縮：

<property>
<name>hive.exec.compress.output</name>
<value>true</value>
<description> This controls whether the final outputs of a query (to a local/hdfs file or a hive table) is compressed. The compression codec and other options are determined from hadoop config variables mapred.output.compress* </description>

</property>

中間的結果資料是否壓縮，當sql生成多個MR，最後mr輸出不壓縮，之前MR的結果資料壓縮。
<property>
<name>hive.exec.compress.intermediate</name>
<value>true</value>
<description> This controls whether intermediate files produced by hive between multiple map-reduce jobs are compressed. The compression codec and other options are determined from hadoop config variables mapred.output.compress* </description>
</property>
<property>
<name>hive.intermediate.compression.codec</name>
<value>org.apache.hadoop.io.compress.LzoCodec</value>
</property>

hive 壓縮最終結果中間結果

hive 壓縮最終結果中間結果

git 中間結果合併

【轉】提取caffe前饋的中間結果+逐層視覺化

使用sparkStreaming與Kafka直連方式WordCount,使用redis存放中間結果

mxnet下如何檢視中間結果

Keras框架下輸出模型中間結果

uva1599 bfs雙向遍歷利用陣列儲存中間結果

兩個工具輸出中間結果，計時函數

hive使用beeline將hql結果匯出為csv檔案

hive把hql查詢的結果匯出到本地或者HDFS上面

Hive中將多個查詢結果按行拼接成一張表

hive壓縮

Struts2 結果和結果類型

二維碼url中漢字傳參，導致查詢不到結果，結果為編碼所引起

hadoop hive 壓縮引數測試

php實現非同步方法之一(php對於curl或瀏覽器或ajax請求立即返回結果,返回結果後的php程式碼還能繼續執行)

Hive壓縮說明

Hive壓縮方式設定

Hive壓縮測試

Struts2-結果和結果型別

hive 壓縮 最終結果 中間結果

相關推薦

hive 壓縮最終結果中間結果