Hive中表的資料匯入(五種方式)

阿新 • • 發佈：2018-12-04

總結：

load：

hive中一共有五種資料匯出的方式：

①：load data方式，如果路徑是local是追加，若為HDFS則為是覆蓋

②：insert into [ value() , select ]

③：as , like ；as會獲得資料，like只會獲得表結構

④：location：先網HDFS上上傳資料，在把該檔案所在的資料夾的路徑通過location的方式指定給表

⑤：import：匯入export出來的資料

load：

> load data [local] inpath '/opt/module/datas/student.txt' [overwrite] into table student [partition (partcol1=val1,…)];
（1）load data:表示載入資料
（2）local:表示從本地載入資料到hive表；否則從HDFS載入資料到hive表
（3）inpath:表示載入資料的路徑
（4）overwrite:表示覆蓋表中已有資料，否則表示追加
（5）into table:表示載入到哪張表
（6）student:表示具體的表
（7）partition:表示上傳到指定分割槽

1，建立一張表，並從本地向表中裝載資料
create table if not exists stu4(id int,name string)
row format delimited
fields terminated by '\t';


> select * from stu4;
+----------+------------+--+
| stu4.id  | stu4.name  |
+----------+------------+--+
+----------+------------+--+
> load data local inpath '/opt/module/hive/stu.txt' into table stu4;

> select * from stu4;
+----------+------------+--+
| stu4.id  | stu4.name  |
+----------+------------+--+
| 1001     | zhangfei   |
| 1002     | liubei     |
| 1003     | guanyu     |
| 1004     | zhaoyun    |
| 1005     | caocao     |
| 1006     | zhouyu     |
+----------+------------+--+

2，建立一張表，並從HDFS上向表中裝載資料：
create table if not exists stu5(id int,name string)
row format delimited
fields terminated by '\t';
> !sh hadoop fs -put /opt/module/hive/stu.txt /stu.txt
> select * from stu5;
+----------+------------+--+
| stu5.id  | stu5.name  |
+----------+------------+--+
+----------+------------+--+
> load data inpath '/stu.txt' into table stu5;
> select * from stu5;
+----------+------------+--+
| stu5.id  | stu5.name  |
+----------+------------+--+
| 1001     | zhangfei   |
| 1002     | liubei     |
| 1003     | guanyu     |
| 1004     | zhaoyun    |
| 1005     | caocao     |
| 1006     | zhouyu     |
+----------+------------+--+

3,載入資料覆蓋表中已有的資料:
> select * from stu5;
+----------+------------+--+
| stu5.id  | stu5.name  |
+----------+------------+--+
| 1001     | zhangfei   |
| 1002     | liubei     |
| 1003     | guanyu     |
| 1004     | zhaoyun    |
| 1005     | caocao     |
| 1006     | zhouyu     |
+----------+------------+--+

> load data local inpath '/opt/module/hive/stu2.txt' overwrite  into table stu5;
> select * from stu5;
+----------+------------+--+
| stu5.id  | stu5.name  |
+----------+------------+--+
| 1001     | zhangfei   |
| 1002     | liubei     |
| 1003     | guanyu     |
+----------+------------+--+

insert：

1，建立一張分割槽表：insert進一些資料
> create table stu6(id int,name string)
partitioned by (month string)
row format delimited
fields terminated by '\t';

> insert into table stu6 partition(month = '12') values(1001,'zhangfei'),(1002,'liubei');

0: jdbc:hive2://hadoop108:10000> select * from stu6;
+----------+------------+-------------+--+
| stu6.id  | stu6.name  | stu6.month  |
+----------+------------+-------------+--+
| 1001     | zhangfei   | 12          |
| 1002     | liubei     | 12          |
+----------+------------+-------------+--+

2，根據select的內容插入資料：
0: jdbc:hive2://hadoop108:10000> select * from stu6;
+----------+------------+-------------+--+
| stu6.id  | stu6.name  | stu6.month  |
+----------+------------+-------------+--+
| 1001     | zhangfei   | 12          |
| 1002     | liubei     | 12          |
+----------+------------+-------------+--+

> insert overwrite table stu6 partition(month = '12') select id,name from stu_par1 where month = '12';
0: jdbc:hive2://hadoop108:10000> select * from stu6;
+----------+------------+-------------+--+
| stu6.id  | stu6.name  | stu6.month  |
+----------+------------+-------------+--+
| 1001     | zhangfei   | 12          |
| 1002     | liubei     | 12          |
| 1003     | guanyu     | 12          |
| 1004     | zhaoyun    | 12          |
| 1005     | caocao     | 12          |
| 1006     | zhouyu     | 12          |
+----------+------------+-------------+--+
overwrite 對原來的資料進行了覆蓋：

3，多表插入模式：
from stu_par1
insert overwrite table stu6 partition(month = '11')
select id,name where month = '11'
insert overwrite table stu6 partition(month = '10')
select id,name where month = '10';

0: jdbc:hive2://hadoop108:10000> select * from stu6;
+----------+------------+-------------+--+
| stu6.id  | stu6.name  | stu6.month  |
+----------+------------+-------------+--+
| 1001     | zhangfei   | 10          |
| 1002     | liubei     | 10          |
| 1003     | guanyu     | 10          |
| 1004     | zhaoyun    | 10          |
| 1005     | caocao     | 10          |
| 1006     | zhouyu     | 10          |
| 1001     | zhangfei   | 11          |
| 1002     | liubei     | 11          |
| 1003     | guanyu     | 11          |
| 1004     | zhaoyun    | 11          |
| 1005     | caocao     | 11          |
| 1006     | zhouyu     | 11          |
| 1001     | zhangfei   | 12          |
| 1002     | liubei     | 12          |
| 1003     | guanyu     | 12          |
| 1004     | zhaoyun    | 12          |
| 1005     | caocao     | 12          |
| 1006     | zhouyu     | 12          |
+----------+------------+-------------+--+

建立表並載入資料（As Select）：

create table if not exists stu7
as select id,name from stu1;

0: jdbc:hive2://hadoop108:10000> select * from stu7;
+----------+------------+--+
| stu7.id  | stu7.name  |
+----------+------------+--+
| 1001     | zhangfei   |
| 1002     | liubei     |
| 1003     | guanyu     |
| 1004     | zhaoyun    |
| 1005     | caocao     |
| 1006     | zhouyu     |
+----------+------------+--+
6 rows selected (0.149 seconds)

location：

1，HDFS的路徑上有如下的內容：/ex  該目錄下有stu.txt檔案
create external table stu_ex2(id int,name string)
row format delimited
fields terminated by '\t'
location '/ex';

0: jdbc:hive2://hadoop108:10000> select * from stu_ex2;
+-------------+---------------+--+
| stu_ex2.id  | stu_ex2.name  |
+-------------+---------------+--+
| 1001        | zhangfei      |
| 1002        | liubei        |
| 1003        | guanyu        |
| 1004        | zhaoyun       |
| 1005        | caocao        |
| 1006        | zhouyu        |
+-------------+---------------+--+
6 rows selected (0.093 seconds)

import：

import匯入的資料必須是export匯出的資料：

1，將資料匯出到HDFS上：
export table stu1 to '/export/data/stu1'

0: jdbc:hive2://hadoop108:10000> !sh hadoop fs -ls /export/data/stu1
Found 2 items
-rwxr-xr-x   3 isea supergroup       1329 2018-12-01 19:38 /export/data/stu1/_metadata
drwxr-xr-x   - isea supergroup          0 2018-12-01 19:38 /export/data/stu1/data

發現stu1目錄下多了兩個檔案，資料儲存在data中

2，將HDFS上的資料匯入到stu8；
0: jdbc:hive2://hadoop108:10000> show tables;
+------------------------+--+
|        tab_name        |
+------------------------+--+
| stu1                   |
| stu2                   |
| stu3                   |
| stu4                   |
| stu5                   |
| stu6                   |
| stu7                   |
| stu_ex1                |
| stu_ex2                |
| stu_par1               |
| stu_par2               |
| values__tmp__table__1  |
+------------------------+--+
0: jdbc:hive2://hadoop108:10000> import table stu8 from '/export/data/stu1';

0: jdbc:hive2://hadoop108:10000> select * from stu8;
+----------+------------+--+
| stu8.id  | stu8.name  |
+----------+------------+--+
| 1001     | zhangfei   |
| 1002     | liubei     |
| 1003     | guanyu     |
| 1004     | zhaoyun    |
| 1005     | caocao     |
| 1006     | zhouyu     |
+----------+------------+--+

總結：

hive中一共有五種資料匯出的方式：

①：load data方式，如果路徑是local是追加，若為HDFS則為是覆蓋

②：insert into [ value() , select ]

③：as , like ；as會獲得資料，like只會獲得表結構

④：location：先網HDFS上上傳資料，在把該檔案所在的資料夾的路徑通過location的方式指定給表

⑤：import：匯入export出來的資料 import table table_name from 'hdfs路徑';

Hive中表的資料匯入(五種方式)

目錄總結： load： insert：建立表並載入資料（As Select）： location： import：總結：總結： hive中一共有五種資料匯出的方式： ①：load data方式，如果路徑是local是追加，若為HDFS則

Android資料儲存五種方式

https://www.cnblogs.com/ITtangtang/p/3920916.html SharedPreferences的基本使用-----存,刪,改,查：https://www.cnblogs.com/qianzf/p/7582400.html Android Sha

SpringMVC的controller向jsp傳遞資料的五種方式詳解

第一種使用model來儲存資料到前臺我的專案目錄為我的controller頁面程式碼@RequestMapping("/demo") public String Model(Model model

Android資料儲存五種方式總結

SharePreferences是用來儲存一些簡單配置資訊的一種機制，使用Map資料結構來儲存資料，以鍵值對的方式儲存，採用了XML格式將資料儲存到裝置中。例如儲存登入使用者的使用者名稱和密碼。只能在同一個包內使用，不能在不同的包之間使用，其實也就是說只能在創建它的應用

Android 資料儲存五種方式使用與總結

轉載自http://blog.csdn.net/amazing7/article/details/51437435 1、概述　　Android提供了5種方式來讓使用者儲存持久化應用程式資料。根據自己的需求來做選擇，比如資料是否是應用程式私有的，是否能被其他程式訪

Hibernate查詢資料的五種方式

1.導航物件圖查詢：根據已經載入的物件，導航到其他物件。例如，對於已經載入的Customer物件，呼叫它getOrders().iterator()方法就可以導航到所有關聯的Order物件，假如在關聯級別使用了延遲載入檢索策略，那麼首次執行此方法時，Hibernate

Android常見資料儲存五種方式

本文介紹Android平臺進行資料儲存的五大方式,分別如下: 下面詳細講解這五種方式的特點第一種： SharedPreferences儲存資料適用範圍：儲存少量的資料，且這些資料的格式非常簡單：字串型、基本型別的值。

資料儲存(五種方式)SharedPreferences儲存

一.SharedPreferences儲存 1.使用SharedPreferences儲存資料時，不需要指定檔案字尾，字尾自動設定為*.xml。 2.儲存資料---SaveData.java publicclass MySharedPreferencesDemo exte

資料儲存(五種方式二)-檔案儲存

檔案儲存 1.SharedPreference只能儲存一些簡單的資料，要想儲存更多型別的資料，需要使用檔案的儲存操作。有兩種形式：形式一：直接利用Activity提供的檔案操作方法。此類操作的所有檔案路徑只能是“data\data\<packagename>

往HIVE表中匯入匯出資料的幾種方式詳解

一：往HIVE表中匯入匯出資料語法結構:[ ]帶括號的表示可選擇欄位LOAD DATA [LOCAL] INPATH 'filepath' [OVERWRITE] INTOTABLE tablename

Hive表資料匯入匯出的不同方式和自定義列分隔符

資料來源： hive> select * from test1; OK Tom 24.0 NanJing Nanjing University Jack

匯入資料到hive表中的6種方式

資料匯入六種方式1、載入本地檔案到hive表語法2、載入hdfs檔案到hive中3、載入資料覆蓋表中已有的資料4、建立表時通過select載入create table if not exists default.dept_catsas select * from dept;5、建立表通過insert載入6、建

淺談hive中資料的幾種壓縮方式

hive庫中有個表，表名叫做user_info_base表創表的命令是：create table user_info_base( id string, name string, age string)row format delimited fields t

SSH深度歷險（六）深入淺出----- Spring事務配置的五種方式

配置處理數據 data easy ont get 添加由於這對時間在學習SSH中Spring架構，Spring的事務配置做了具體總結。在此之間對Spring的事務配置僅僅是停留在聽說的階段，總結一下。總體把控。通過這次的學習發覺Spring的事務

PHP讀取文件內容的五種方式

ria feof function word man val toolbar article str php讀取文件內容的五種方式分享下php讀取文件內容的五種方法：好吧，寫完後發現文件全部沒有關閉。實際應用當中，請註意關閉 fclose($fp);-- php讀取文件內

JAVA創建對象的五種方式

sta puts title turn serializa one specified args urn 最近在學習JAVA知識，通過網上翻看資料，發現原先有的理解不深的東西，漸漸明白了些。對象的使用，在編寫軟件過程中是必不可少的，不知道大家有沒和我一樣，幾乎都是使用new

繼承的五種方式

prototype xtend function 混入 post create es6 extend proto 1.混入式繼承 var obj1 = {} var obj2 = { name: ‘ys‘, age: 18 } for(var k in

Mysql查看版本號的五種方式介紹

utf linux character rh系列 gre outfile x86_64 roo server 一、使用命令行模式進入mysql會看到最開始的提示符; 查看版本信息 #1使用命令行模式進入mysql會看到最開始的提示符 Your MySQL connecti

join連接的五種方式的簡單使用案例（Inner join,Left join,Right join

oracle inner join left join right join full join 1.內連接Inner join內連接是基於連接謂詞將倆張表（如A和B）的列組合到一起產生新的結果表，在表中存在至少一個匹配時，INNER JOIN 關鍵字返回行。下面是一個簡單的使用案例

Button監聽點擊事件的五種方式

per amp 其中 show java new mpat vat ati 常用方式為匿名類和本類監聽的方法。其中本類監聽方法需要繼承View.OnClickListener接口之後，重寫onClick方法。 PS：重寫某一個方法的快捷鍵為Ctrl+O package c

Hive中表的資料匯入(五種方式)

總結：

load：

insert：

建立表並載入資料（As Select）：

location：

import：

總結：

相關推薦