[大資料]連載No19 Hbase Shell和API的增刪改查+與MapperReducer讀寫操作

阿新 • • 發佈：2019-01-01

本次總結如下
1、Hbase Shell的常用命令
2、Java APi 對hbase的增刪改查
3、Mapper Reducer從hbase讀寫數資料，計算單詞數量，並寫回hbase

登入hbase Shell

[[email protected] ~]#/home/softs/hbase-0.98.12.1-hadoop2/bin/hbase shell

1、表操作

建立表user create 'test', 'cf' # test表明 cf列族
查詢表user scan 'test'

2、增刪改查操作
插入資料 put 'test', 'row1', 'cf:username', 'value1' # row行唯一識別符號 username列名
查詢資料 list 'test'
id查詢 get 'test', 'row1'
刪除表 disable 'test' 然後 drop 'test' #刪除前要先禁用掉

hbase(main):001:0> scan 'user'
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/softs/hbase-0.98.12.1-hadoop2/lib/slf4j-log4j12-1.6.4.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in [jar:file:/home/softs/hadoop-2.5.0/share/hadoop/common/lib/slf4j-log4j12-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
2018-06-29 04:03:54,274 WARN  [main] util.NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
ROW                                 COLUMN+CELL                                                                                            
 2                                  column=col1:count, timestamp=1530214622282, value=1                                                    
 userId1                            column=col1:age, timestamp=1530201359347, value=2                                                      
 userId1                            column=col1:name, timestamp=1530203162657, value=xiaohong                                              
 userId1                            column=col2:age, timestamp=1530203197562, value=33                                                     
 userId1                            column=col2:name, timestamp=1530201359347, value=\xE5\xB0\x8F\xE7\xBA\xA2                              
2 row(s) in 0.3550 seconds

hbase(main):002:0>

Hbase WebUI檢視叢集資訊

java操作hbase Api

public class HbaseCrub {

    HBaseAdmin hbase;
HTable table;
String user = "user";
String col1="col1";
String col2="col2";
@Before
public void before() throws  Exception{
        Configuration configuration=new Configuration();
/**指定zookeeper叢集，找到配置檔案*/
configuration.set("hbase.zookeeper.quorum" 
,"master,node1,node2");
/**資料庫連線**/
hbase =new HBaseAdmin(configuration);
table=new HTable(configuration,user.getBytes());
}

    @After
public void end() throws  Exception{

        if(hbase != null) {
            hbase.close();
}
        if(table != null) {
            table.close();
}
    }


    @Test
public void createTable() throws  Exception{
        if(hbase.tableExists(user.getBytes())){

            /*禁用，刪除**/
hbase.disableTable(user.getBytes());
hbase.deleteTable(user.getBytes());
}

        HTableDescriptor descriptor =new HTableDescriptor(TableName.valueOf(user));
/**要先指定列族*/
HColumnDescriptor columnDescriptor=new HColumnDescriptor(col1.getBytes());
/**記憶體快取*/
columnDescriptor.setInMemory(true);
descriptor.addFamily(columnDescriptor);
HColumnDescriptor columnDescriptor2=new HColumnDescriptor(col2.getBytes());
/**使用資料使用記憶體先存放*/
columnDescriptor2.setInMemory(false);
descriptor.addFamily(columnDescriptor2);
hbase.createTable(descriptor);
}


    @Test
public void insertUser() throws  Exception{
        /**指定rowkey*/
String  rowKey ="userId1";
Put put =new Put(rowKey.getBytes());
put.add(col1.getBytes(),"name".getBytes(),"小石頭".getBytes());
put.add(col1.getBytes(),"age".getBytes(),"2".getBytes());
put.add(col2.getBytes(),"name".getBytes(),"小紅".getBytes());
table.put(put);
}

    @Test
public void deleteUser() throws  Exception{

        /**指定rowkey*/
Delete delete =new Delete("userId1".getBytes());
delete.deleteColumn(col1.getBytes(),"name".getBytes());
table.delete(delete);
}


    @Test
public void getByUserId() throws  Exception{

        /**指定rowkey*/
Get get =new Get("userId1".getBytes());
/**指定返回的列*/
get.addColumn(col1.getBytes(),"age".getBytes());
Result result= table.get(get);
//單行記錄
Cell cell=result.getColumnLatestCell(col1.getBytes(),"age".getBytes());
System.out.println(new String(CellUtil.cloneValue(cell)));
}


    @Test
public void listUsers() throws  Exception{
        /**
         * Scan 查詢 返回多行資料
         * 儘量不要用全表掃描
         * 1、範圍查詢  起始rowkey  結束rowkey
         * 2、過濾器 filter  慎重！！
         * @throws Exception
         */
Scan scan =new Scan();
scan.setStartRow("userId0".getBytes());
scan.setStopRow("userId3".getBytes());
// 新增查詢條件
SingleColumnValueFilter filter1 = new SingleColumnValueFilter(
                col1.getBytes(), "age".getBytes(), CompareFilter.CompareOp.EQUAL, "2".getBytes());
scan.setFilter(filter1);
ResultScanner results= table.getScanner(scan);
results.forEach(result -> {
            System.out.print(new String(result.getValue(col1.getBytes(),"name".getBytes()))+"\t");
System.out.println(new String(result.getValue(col1.getBytes(),"age".getBytes())));
});
}

}

hbase與mapperReduce整合,讀行資料，統計單詞數量

job類

public static void main(String []args) throws  Exception{
    Configuration conf =new Configuration();
/**本地執行*/
conf.set("fs.defaultFS","hdfs://master:8020");
conf.set("hbase.zookeeper.quorum", "master,node1,node2");
Job job =Job.getInstance(conf);
job.setJarByClass(WCJob.class);
/**從hbase讀取資料設定查詢條件*/
Scan scan =new Scan();
TableMapReduceUtil.initTableMapperJob("user",scan,WCMapper.class, Text.class, IntWritable.class,job,false);
/**
     * 最後一個引數指定為false,因為是本地執行,需要注意
     * */
TableMapReduceUtil.initTableReducerJob("user",WCReducer.class,job,null,null,null,null,false);
job.waitForCompletion(true);
}

Mapper類，統計單詞數量，輸出

public class WCMapper  extends TableMapper<Text,IntWritable>{

    /**
     * 每次讀一行呼叫一次map方法
     * */
@Override
protected void map(ImmutableBytesWritable key, Result value, Context context) throws IOException, InterruptedException {

        String age =new String(value.getValue("col1".getBytes(),"age".getBytes()));
context.write(new Text(age),new IntWritable(1));
}
}

reducer類，計算結果，並寫入到hbase

/**
 * Text,IntWritable 和mapper資料的資料型別一直
 * */
public class WCReducer extends TableReducer<Text,IntWritable,ImmutableBytesWritable> {

    @Override
protected void reduce(Text key, Iterable<IntWritable> values, Context context) throws IOException, InterruptedException {
        int num =0;
        for(IntWritable in : values){
            num ++;
}

        /**以年級為rowkey,寫入到hbase**/
Put put =new Put(key.getBytes());
put.add("col1".getBytes(),"count".getBytes(),(num+"").getBytes());
context.write(null,put);
}
}

結果檢視，正確

[大資料]連載No19 Hbase Shell和API的增刪改查+與MapperReducer讀寫操作

本次總結如下1、Hbase Shell的常用命令2、Java APi 對hbase的增刪改查3、Mapper Reducer從hbase讀寫數資料，計算單詞數量，並寫回hbase登入hbase Shell[[email protected] ~]#/home/sof

大資料學習之Hbase shell的基本操作

HBase的命令列工具，最簡單的介面，適合HBase管理使用，可以使用shell命令來查詢HBase中資料的詳細情況。安裝完HBase之後，啟動hadoop叢集(利用hdfs儲存)，啟動zookeeper，使用start-hbase.sh命令開啟hbase服務，最後在shel

yii2 框架的 AR 和 DAO 增刪改查

寫法 mod sar 增刪改查 title isn function rec 自己自己做個總結方便以後查找使用 /** * yii 的增刪改查 */ //增 public function add1($data) {

npm 安裝axios和使用增刪改查

del rom port import ole ror 功能 delet nbsp 1：安裝axios（建議安裝淘寶鏡像） npm install axios 2：項目導入 npm install --save axios vue-axios 3：頁面導入 import

MySQL在DOS介面對database和table增刪改查

昨天新接觸MySQL，學習了一些內容，今天過來複習一下。（吐槽一下：安裝個MySQL耗費老子半天時間！！）學習了一下，大概知道了對資料庫基本的增刪改查，增add,刪drop,改alter,查show,都是英文單詞，很好理解。首先講一下資料庫的增刪改查　　建立資料庫 crea

IOS資料處理及版本特性-CoreData的增刪改查

實戰：對HBase業務表進行增刪改查操作（Eclipse,Linux 環境）

嘗試使用HBASE shell 和HBase java API 兩種方式來演示對業務表的操作。一、Hbase Shell 1、啟動控制檯，啟用hbaseshell 控制檯對Hbase 進行操作具體命令如下： [[email protected] hbase]$ bin/h

zookeeper增刪改查與監聽API

//監聽單節點內容 public class WatchDemo{ public static void main(String[] args) throws Exception { private String connectString="ip1:2181,ip2:2181,ip

Elasticsearch Javascript API增刪改查

查詢根據索引、型別、id進行查詢： client.get({ index:'myindex', type:'mytype', id:1 },function(error, response){// ...}); 根據某個查詢條件，查詢某個索引的所有資料 client.s

mysqld的安裝和基本增刪改查

1安裝mysql 1 ）刪除低版本的mysql服務 2）新版本的mysql的安裝、初始密碼設定、登入和退出資料庫 3）安裝mysql後的配置檔案 4）登入資料庫後，對庫的操作：檢視庫，檢視當前所在庫，切換庫，建立庫，刪除庫 5）資料庫裡表的操作：在庫裡檢視所列的所有表，查

實戰Angular2+web api增刪改查 (二）

webapi配置 protected void Application_Start() { AreaRegistration.RegisterAllAreas(); //SwaggerConfig.Regis

實戰Angular2+web api增刪改查(三）

Angular2 開發本示例使用了Angular2 RC4版本，另外由於使用了bootstrap，所以需要html中引入對應的css及js檔案入口 import { bootstrap } from '@angular/platform-browser-dynam

雲時代的大資料儲存-雲HBase

為什麼縱觀資料庫發展的幾十年，從網狀資料庫、層次資料庫到RDBMS資料庫，在最近幾年的NewSQL的興起，加上開源的運動，再加上雲的特性，可以說是日新月異。在20世紀80年代後，大部分的業務確定使用RDBMS資料為儲存基礎。新世紀開始，隨著網際網路的發展，資料量的增大，慢慢RDBMS資料庫撐不住，就出

【大資料技術】3.Mapreduce和Yarn

一、Mapreduce Mapreduce主要應用於日誌分析、海量資料的排序、索引計算等應用場景，它是一種分散式計算模型，主要用於解決離線海量資料的計算問題。核心思想是：“分而治之，迭代彙總” Mapreduce主要由兩個階段： map階段:任務分解 1.讀取HDFS中的檔案，把輸入檔

大資料的主要分析模式和分析技術

大資料的主要分析模式和分析技術大資料時代所分析的資料的最主要特徵是“多源異構”，其分析過程是逐層抽象、降維、概括和解讀的過程。從資料採集的源頭進行劃分，可將大資料時代分析處理的資料物件劃分為以下幾個類別：（1）各網頁中使用者的瀏覽次數、點選率，各種社交網站、動態網站網頁內容

【大資料技術】HBase基本知識介紹及典型案例分析

（1）分散式、多版本、面向列的開源資料庫（2）支援上億行、百萬列；（3）強一致性、高擴充套件、高可用 Hbase是一個強一致性資料庫，不是“最終一致性”資料庫。 HBase資料讀寫，更新的資料是放在Mems

大資料計算框架Hadoop, Spark和MPI

轉自：https://www.cnblogs.com/reed/p/7730338.html 今天做題，其中一道是請簡要描述一下Hadoop, Spark, MPI三種計算框架的特點以及分別適用於什麼樣的場景。一直想對這些大資料計算框架總結一下，只可惜太懶，一直拖著。今

大資料技術學習筆記之linux基礎3-軟體管理與shell指令碼開發

一、Linux軟體管理 -》壓縮檔案管理 -》常見壓縮格式 -

大資料學習第二天——shell程式設計

2.1 基本格式程式碼寫在普通文字檔案中，通常以 .sh為字尾名 vi hello.sh #!/bin/bash ## 表示用哪一種shell解析器來解析執行我們的這個指令碼程式 echo "hello world"

大資料技術之HBase第8章擴充套件

8.1布隆過濾器在日常生活中，包括在設計計算機軟體時，我們經常要判斷一個元素是否在一個集合中。比如在字處理軟體中，需要檢查一個英語單詞是否拼寫正確（也就是要判斷它是否在已知的字典中）；在 FBI，一個嫌疑人的名字是否已經在嫌疑名單上；在網路爬蟲裡，一個網址是否被訪問過等等

[大資料]連載No19 Hbase Shell和API的增刪改查+與MapperReducer讀寫操作

相關推薦