hive 命令整理

阿新 • • 發佈：2019-02-17

啟動

hive

資料庫操作

create database database_name; -- 新建資料庫
creat database if not exists -- 新建資料庫 database_name;
show databases; -- 檢視資料庫
show databases like 'h.*'; -- 檢視資料庫
use default;    --使用哪個資料庫
create table test3 like test2; --只是複製了表結構，並不會複製內容
create table test2 as select name,addr from test1; 
--複製表結構的同時，把內容也複製過來了，需要執行mapreduce
show tables;  --檢視該資料庫中的所有表
show tables  ‘*t*’;    --支援模糊查詢
SHOW TABLES IN DbName; --檢視指定資料庫中的所有表
describe formatted(可選) tab_name;  --查看錶的結構及表的路徑
describe database database_name; --檢視資料庫的描述及路徑
creat database database_name location '路徑';   --修改資料庫的路徑
drop database if 
 exists database_name; --刪除空的資料庫
drop database if exists database_name cascade; --先刪除資料庫中的表再刪除資料庫
show partitions t1;   --查看錶有哪些分割槽 
alter table table_name rename to another_name;   --修改表名
drop table t1 CASCADE(可選，忽略錯誤);      --刪除表t1
drop table if exists CASCADE --刪除資料庫的時候，不允許刪除有資料的資料庫，如果資料庫裡面有資料則會報錯。如果要忽略這些內容，則在後面增加CASCADE 
關鍵字，則忽略報錯，刪除資料庫。 t1;--如果存在表t1，刪除表t1
load data inpath '/root/inner_table.dat' into table t1;   --移動hdfs中資料到t1表中
load data local inpath '/root/inner_table.dat' into table t1;  --上傳本地資料到hdfs中
!ls;  --查詢當前linux資料夾下的檔案
dfs -ls /; --查詢當前hdfs檔案系統下  '/'目錄下的檔案
set hive.cli.print.current.db=true;  --顯示地展示當前使用的資料庫
set hive.cli.print.header=true; --Hive顯示列頭

匯入

向管理表中載入資料：
Hive沒有行級別的插入、刪除、更新的操作，那麼往表裡面裝資料的唯一的途徑就是使用一種“大量”的資料裝載操作，或者僅僅將檔案寫入到正確的目錄下面。
overwrite關鍵字：
    load data local inpath '${env:HOME}/目錄'
    overwrite(可選) into table table_name
    partition (分割槽)；
-- 如果沒有使用overwrite，則會再拷貝一份資料，不會覆蓋原來的資料。

匯出

hadoop fs -cp source_path target_path
insert……directory……
e.g insert overwrite local directory '/tmp/目錄'     -- 這裡指定的路徑也可以是全URL路徑

退出

quit;     --退出hive
exit;    --exit會影響之前的使用，所以需要下一句kill掉hadoop的程序
hadoop job -kill jobid

檔案執行hive SQL

-- 控制檯執行
hive -f sql_path;
e.g hive -f /path/to/file/xxxx.hql;
-- hive shell 執行
source sql_path;
e.g source /path/to/file/test.sql;
-- 一次使用命令
hive -e "SQL語句"；
e.g.  $ hive -e "select * from mytable limit 3";

建表語句

CREATE [TEMPORARY] [EXTERNAL] TABLE [IF NOT EXISTS] [db_name.]table_name    -- (Note: TEMPORARY available in Hive 0.14.0 and later)
  [(col_name data_type [COMMENT col_comment], ...)]
  [COMMENT table_comment]
  [PARTITIONED BY (col_name data_type [COMMENT col_comment], ...)]
  [CLUSTERED BY (col_name, col_name, ...) [SORTED BY (col_name [ASC|DESC], ...)] INTO num_buckets BUCKETS]
  [SKEWED BY (col_name, col_name, ...)                  -- (Note: Available in Hive 0.10.0 and later)]
     ON ((col_value, col_value, ...), (col_value, col_value, ...), ...)
     [STORED AS DIRECTORIES]
  [
   [ROW FORMAT row_format] 
   [STORED AS file_format]
     | STORED BY 'storage.handler.class.name' [WITH SERDEPROPERTIES (...)]  -- (Note: Available in Hive 0.6.0 and later)
  ]
  [LOCATION hdfs_path]
  [TBLPROPERTIES (property_name=property_value, ...)]   -- (Note: Available in Hive 0.6.0 and later)
  [AS select_statement];   -- (Note: Available in Hive 0.5.0 and later; not supported for external tables)

查詢表資料

hive> select * from employees;
OK
tony    1338    ["a1","a2","a3"]        {"k1":1.0,"k2":2.0,"k3":3.0}    {"street":"s1","city":"s2","state":"s3","zip":4}
mark    5453    ["a4","a5","a6"]        {"k4":4.0,"k5":5.0,"k6":6.0}    {"street":"s4","city":"s5","state":"s6","zip":6}
ivy     323     ["a7","a8","a9"]        {"k7":7.0,"k8":8.0,"k9":9.0}    {"street":"s7","city":"s8","state":"s9","zip":9}
Time taken: 10.204 seconds, Fetched: 3 row(s)

查樹組
hive> select subordinates[1]  from employees;
Total MapReduce CPU Time Spent: 2 seconds 740 msec
OK
a2
a5
a8
查map
hive> select deductions["k2"]  from employees;

OK
2.0
NULL
NULL
Time taken: 75.812 seconds, Fetched: 3 row(s)

查結構體
hive> select address.city  from employees;
Total MapReduce CPU Time Spent: 2 seconds 200 msec
OK
s2
s5
s8
Time taken: 75.311 seconds, Fetched: 3 row(s)

select * 不執行mapreduce，只進行一個本地的查詢。
而select 某個欄位生成一個job，執行mapreduce。

執行

nohup hive -f insert.sql >log.log &

hive 命令整理

啟動 hive 資料庫操作 create database database_name; -- 新建資料庫 creat database if not exists -- 新建資料

Hive基本命令整理

建立表： hive> CREATE TABLE pokes (foo INT, bar STRING); Creates a table called pokes with two columns, the first being an intege

HIVE與mysql的關係 hive常用命令整理 hive與hdfs整合過程

轉：https://my.oschina.net/winHerson/blog/190131 二、hive常用命令 1. 開啟行轉列功能之後: set hive.cli.print.header=true; // 列印列名 set hive.cli.print.row.to.vertical=true; /

Git使用：安裝，使用及常用命令整理

reset short 配置文件 res 命名 nbsp class 名詞如果對於程序猿而言，git是最常接觸的工具之一，因此需要熟練快速掌握其技巧。 git安裝： windwos：【原創】Windows平臺下Git的安裝與配置 Ubuntu：git與github在

git常用命令整理

align enter style git常用命令 com branch commit ast 添加 git常用命令整理查看當前分支：git branch 切換分支：git checkout ****（分支名）創建分支：git branch ****（分支名）刪

salt 常用命令整理

test rm -rf source zip 表達執行cmd root function ons salt 常用命令整理 ***********模塊*********** 查看模塊列表module salt ‘minion‘ sys.list_modules

linux的網絡命令整理更新中

net-tools 與 iproute包linux的網絡命令整理更新中1.安裝包：net-tools 主要命令: netstat , ifconfig , route , iptunneliproute 主要命令: ss , ip addr , ip route , ip tunnel 2.net-t

Redis學習筆記（三）常用命令整理

mes ember nbsp end 插入學習筆記頻道 hash value Redis 常用命令 1.DEL key 刪除key2.EXISTS key 檢查key是否存在3.KEYS * 查看所有的key4.EXPIRE key seconds 設置key的過期時

linux常用命令整理（五）：shell基礎

程序猿逆向多條希望正則表達 group 運行 ls命令交互式大家好，我是會唱歌的程序猿～～～～～～最近在學習linux，閑暇之余就把這些基本的命令進行了整理，希望大家能用的上，整理的的目的是在忘了的時候翻出來看看^?_?^，前後一共分為五個部分

ADB 基本命令整理

ips mman rip fault radio content rtt removes indent What Is ADB Android debug bridge is a command line tool that lets you communicate

安裝atlas後執行hive命令報錯

repeat log color bug mage client img sof atl 在集群中安裝atlas，在安裝atlas的節點上執行hive -e "show databases;" 正常，但是在集群中其他節點上執行hive -e "show database

Linux常用命令整理

remove 開頭容量 mina 顯示刪除目錄用戶移動文件 dir 　　這裏的常用命令指的是編程c/c++與shell程序常用到的linux命令。　　8/24/2017 整理一遍常用命令，希望提高Linux編程的效率正文如下： cd指令切換文件夾到指定

Linux監控命令整理（top,free,vmstat,iostat,mpstat,sar,netstat）

指令 res 時間信息 bin 禁止 1.3 硬盤 bre 核心 1.1 top 1.1.1 命令說明 Top 命令能夠實時監控系統的運行狀態，並且可以按照cpu、內存和執行時間進行排序 1.1.2 參數說明命令行啟動參數：用法: top -hv | -bcis

redis常用命令整理

key hello eight 不能時間否則 round spa 是否 key： DEL：刪除給定的一個或多個 key ，返回值：被刪除 key 的數量。 EXISTS：檢查給定 key 是否存在，返回值：若 key 存在，

linux系統配置常用命令整理

sta 字母 port 內存大小查看內存四十七 mes memfree 監聽一、 cat /proc/cpuinfo |grep "model name" && cat /proc/cpuinfo |grep "phys

git命令整理備忘

git命令 ant xxx over set data- pan jad 回滾 git命令整理備忘參考https://www.liaoxuefeng.com/wiki/0013739516305929606dd18361248578c67b8067c8c017b000

git 命令整理

文件管理 commit 文件名 nbsp 管理需要推送多個 new 一、git branch:1、創建本地分支 local_branch git branch local_branch2、切換到分支local_branch git checkout lo

Hive筆記整理（一）

大數據 Hive [TOC] Hive筆記整理（一） Hive Hive由facebook貢獻給Apache，是一款建立在Hadoop之上的數據倉庫的基礎框架。數據倉庫特點——關於存放在數據倉庫中的數據的說明：是能夠為企業的各個級別的決策提供數據支撐的數據其實說白了，就是一個存放數據

Hive筆記整理（二）

大數據 Hive [TOC] Hive筆記整理（二） Hive中表的分類 managed_table—受控表、管理表、內部表表中的數據的生命周期/存在與否，受到了表結構的影響，當表結構被刪除的，表中的數據隨之一並被刪除。默認創建的表就是這種表。可以在cli中通過desc extended t

Hive筆記整理（三）

大數據 Hive [TOC] Hive筆記整理（三） Hive的函數 Hive函數分類函數的定義和java、mysql一樣，有三種。 UDF（User Definition Function 用戶定義函數）一路輸入，一路輸出 sin(30°)=1/2 UDAF（User Definition A

hive 命令整理

相關推薦