分割槽表，管理表

阿新 • • 發佈：2019-02-10

建立分割槽表：

create table if not exists china_partition(
ProvinceID int,
ProvinceName string,
CityID int,
CityName string,
ZipCode int,
DistrictID int,
DistrictName string)
partitioned by ( Province string,City string )
row format delimited fields terminated by ','
;

注意：分割槽欄位名和資料欄位名不能相同，不然報錯如下：

hive> create table if not exists china_partition(
    > ProvinceID int,
    > ProvinceName string,
    > CityID int,
    > CityName string,
    > ZipCode int,
    > DistrictID int,
    > DistrictName string)
    > partitioned by ( ProvinceName string,CityName string )
    > row format delimited fields terminated by ','
    > ;
FAILED: SemanticException [Error 10035]: Column repeated in partitioning columns

分割槽表載入資料：

load data local inpath '/home/hadoop/china_data/beijing.txt' into table china_partition partition ( Province='beijing',city='beijing');

hdfs檔案系統目錄結構：

也可以使用 show partitions 檢視分割槽：

hive> show partitions china_partition;

使用 hive.mapred.mode（可選值為 strict，nonstrict）

如果 hive.mapred.mode 設定為 strict，查詢分割槽表示 Hql 語句必須有 where 子句，不然會報錯：

hive> select * from china_partition;
FAILED: SemanticException Queries against partitioned tables without a partition filter are disabled for safety reasons. If you know what you are doing, please make sure that hive.strict.checks.large.query is set to false and that hive.mapred.mode is not set to 'strict' to enable them. No partition predicate for Alias "china_partition" Table "china_partition"

如果 hive.mapred.mode 設定為 nonstrict，查詢分割槽表可以不帶where子句：

hive> set hive.mapred.mode=nonstrict;
hive> select * from china_partition;
OK
china_partition.provinceid	china_partition.provincename	china_partition.cityid	china_partition.cityname	china_partition.zipcode	china_partition.districtid	china_partition.districtname	china_partition.province	china_partition.city
1	北京市	1	北京市	100000	1	東城區	beijing	beijing
1	北京市	1	北京市	100000	2	西城區	beijing	beijing
1	北京市	1	北京市	100000	3	崇文區	beijing	beijing
1	北京市	1	北京市	100000	4	宣武區	beijing	beijing
1	北京市	1	北京市	100000	5	朝陽區	beijing	beijing
1	北京市	1	北京市	100000	6	豐臺區	beijing	beijing
1	北京市	1	北京市	100000	7	石景山區	beijing	beijing
1	北京市	1	北京市	100000	8	海淀區	beijing	beijing
1	北京市	1	北京市	100000	9	門頭溝區	beijing	beijing
1	北京市	1	北京市	100000	10	房山區	beijing	beijing
1	北京市	1	北京市	100000	11	通州區	beijing	beijing
1	北京市	1	北京市	100000	12	順義區	beijing	beijing
1	北京市	1	北京市	100000	13	昌平區	beijing	beijing
1	北京市	1	北京市	100000	14	大興區	beijing	beijing
1	北京市	1	北京市	100000	15	懷柔區	beijing	beijing
1	北京市	1	北京市	100000	16	平谷區	beijing	beijing
1	北京市	1	北京市	100000	17	密雲縣	beijing	beijing
1	北京市	1	北京市	100000	18	延慶縣	beijing	beijing
Time taken: 0.125 seconds, Fetched: 18 row(s)

如果分割槽特別多，使用者執行查詢部分分割槽，也是使用：

hive> show partitions china_partition partition (province='beijing');
OK
partition
province=beijing/city=beijing
Time taken: 0.18 seconds, Fetched: 1 row(s)

使用 describe formatted table_name 也可以顯示分割槽資訊：

hive> describe formatted china_partition;
OK
col_name	data_type	comment
# col_name            	data_type           	comment             
	 	 
provinceid          	int                 	                    
provincename        	string              	                    
cityid              	int                 	                    
cityname            	string              	                    
zipcode             	int                 	                    
districtid          	int                 	                    
districtname        	string              	                    
	 	 
# Partition Information	 	 
# col_name            	data_type           	comment             
	 	 
province            	string              	                    
city                	string              	                    
	 	 
# Detailed Table Information	 	 
Database:           	default             	 
Owner:              	hadoop              	 
CreateTime:         	Tue Apr 25 16:05:55 CST 2017	 
LastAccessTime:     	UNKNOWN             	 
Retention:          	0                   	 
Location:           	hdfs://localhost:9000/user/hive/warehouse/china_partition	 
Table Type:         	MANAGED_TABLE       	 
Table Parameters:	 	 
	transient_lastDdlTime	1493107555          
	 	 
# Storage Information	 	 
SerDe Library:      	org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe	 
InputFormat:        	org.apache.hadoop.mapred.TextInputFormat	 
OutputFormat:       	org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat	 
Compressed:         	No                  	 
Num Buckets:        	-1                  	 
Bucket Columns:     	[]                  	 
Sort Columns:       	[]                  	 
Storage Desc Params:	 	 
	field.delim         	,                   
	serialization.format	,                   
Time taken: 0.065 seconds, Fetched: 38 row(s)
hive>

hive分割槽表還有很多策略例如 archive，touce，enable no_drop,enable offline 等。

分割槽表，管理表

建立分割槽表： create table if not exists china_partition( ProvinceID int, ProvinceName string, CityID int, CityName string, ZipCode int, Distr

mysql系列之3----數據導入導出，管理表，查詢

搜索系統記錄 user pri 分隔符 nat 配置 _id 權限一、數據導入與導出 1、搜索系統的目錄：show variables like "secure_file_priv" //如果顯示為空的話，可以去配置文件裏面設置路徑，

Hive資料載入（內部表，外部表，分割槽表）

內表資料載入建立表時載入 create table newtable as select col1,col2 from oldtable hive> create table te

資料結構之線性表（順序表，單鏈表，迴圈連結串列，雙向連結串列）-- 圖書管理系統

順序表 #include <iostream> #include <cstring> #include <cstdlib>///exit()標頭檔案exit(0):正常執行程式並退出程式。exit(1):非正常執行導致退出程式 #incl

子表，父表；一對多，多對一；主鍵，外鍵梳理。

梳理一段引用 cnblogs .com 課程 alt img 分享這個每次搞明白後，過一段時間又亂了，這次總結下：子表與父表：　　父表：被引用的表。被引用列一定是父表的主鍵。　　　　子表：引用父表的某一列作為外鍵。一對多，多對一：一的一方一定是父表，多的一

Java鏈接HBASE數據庫，創建一個表，刪除一張表，修改表，輸出插入，修改，數據刪除，數據獲取，顯示表信息，過濾查詢，分頁查詢，地理hash

can charat nfa true 目錄結構 dfa byte sin extra 準備工作 1、創建Java的Maven項目創建好的目錄結構如下：另外註意junit的版本，最好不要太高，最開始筆者使用的junit4.12的，發現運行的時候會報錯。最後把Junit

建立一個帶頭結點的單向鏈表，鏈表中的各結點按結點數據中的數據遞增有序鏈接，函數fun的功能是：把形參x的值放入一個新結點並插入鏈表中，使插入後各結點數據域中的數據仍保持遞增有序

print lis void clu ret div clas head number #include <stdio.h> #include <stdlib.h> #define N 8 typedef struct l

orcl創建表及管理表

reat 圖片進制數表數據 set varchar 保存大對象長度常用的字段數據類型： .字符串（varchar2(n)） n表示保存最大長度，基本200作用。.整數（number(n)） n位的整數，也可用int代替.小數（number(n,m)） m為小數位，

SQLALchemy之創建表，刪除表

alc 創建 all data- unique prim nullable 利用 cad 1、創建引擎 "數據庫+第三方模塊：//用戶名：密碼@數據庫服務端IP：端口號/數據庫名？編碼" engine = create_engine( "mysql+p

（轉載）一文搞定資料倉庫之拉鍊表，流水錶，全量表，增量表

轉載自：https://blog.csdn.net/mtj66/article/details/78019370 全量表：每天的所有的最新狀態的資料，增量表：每天的新增資料，增量資料是上次匯出之後的新資料。拉鍊表：維護歷史狀態，以及最新狀態資料的一種

think PHP建立資料庫表，資料庫表更名

引用 use think\Db; 建立表方法 public function createTable($tableName) { $sql = "CREATE TABLE IF NOT EXISTS `$tableName` ( `id` in

oracle11g匯出表時會發現少表，空表導不出解決方案。

一：背景引入 oracle11g用exp命令匯出資料庫表時，有時會發現只匯出了一部分表時而且不會報錯，原因是有空表沒有進行匯出，之前一直沒有找到

2、編譯原理字母表，符號表

【1】字母表 & 符號符號串【2】文法及其分類文法的分類 0型文法總結：文法 G(Vn,Vt,P,S) Vn: 非空有限的非終結符號集 Vt：終結符號集 P: 產生式集 S（屬於Vn）文法的識別符號

mysql命令列建立表，插入表資料

create table t_hero( id int unsigned auto_increment primary key, name varchar(10) unique not null, age tinyint unsigned default 0, gender set("男", "女"), st

MySQL優化分庫分表，為什麼要分表，分表以後如何進行排序查詢，業務如何設計？

MySQL優化分庫分表，為什麼要分表，分表以後如何進行排序查詢，業務如何設計？昨天面試新人的時候，遇到了這麼一個問題，按照自己的想法大體聊了一些，但大多是感性的，並沒有完整的瞭解why and how. 今天查了一些相關的資料，包括《MySQL效能調優與架構設計》、《高效能Mysql》，慢慢的整

hive 外部表，內部表

create table test(id int,name string) row format delimited fields terminated by ',' ; 載入資料 load data local inpath '/home/test/test.txt' into table t

laravel 一表關聯二表，二表關聯三表，通過一表controller拿三表數據

bus eid con type new spa nat reac col model 一表關聯二表 public function ordercode() { return $this->hasOne(\App\Models\OrderCo

oracle exp 匯出表時會發現少表，空表導不出解決方案

今天遇到一個群有，他說在oracle11g上利用exp匯出的時候，發現原本資料庫中有723張表，但是用exp匯出的時候卻只能匯出304張出來，其實這個原因是oralce11g 中增加了一個新特性 "deferred_segment_creation" 含義是段延遲建立，預設是true。

Hive基本操作，DDL操作(建立表，修改表，顯示命令)，DML操作(Load Insert Select),Hive Join,Hive Shell引數(內建運算子、內建函式)等

1. Hive基本操作 1.1 DDL操作1.1.1 建立表建表語法 CREATE [EXTERNAL] TABLE [IF NOT EXISTS] table_name

將兩個有序順序表合成一個新的有序順序表，使表中所有元素的值均不同

typedef int type; typedef struct { int len; type data[MAX]; }sqList; bool isposorder(sqList *a)

分割槽表，管理表

相關推薦