使用Spark/Java讀取已開啟Kerberos認證的HBase

阿新 • • 發佈：2019-01-16

1.賦予drguo使用者相應的許可權

2.KDC中建立drguo使用者並匯出相應的keytab檔案

[root@bigdata28 ~]# kadmin.local 
Authenticating as principal drguo/admin@AISINO.COM with password.
kadmin.local:  addprinc drguo/bigdata28
WARNING: no policy specified for drguo/bigdata28@AISINO.COM; defaulting to no policy
Enter password for principal "drguo/ 
[email protected]": 
Re-enter password for principal "drguo/[email protected]": 
Principal "drguo/[email protected]" created.
kadmin.local:  xst -norandkey -k /home/drguo/drguo_bigdata28.keytab drguo/bigdata28@AISINO.COM
Entry for principal drguo/bigdata28@AISINO.COM with kvno 1, encryption type aes256-cts-hmac-sha1-96 
 added to keytab WRFILE:/home/drguo/drguo_bigdata28.keytab.
Entry for principal drguo/bigdata28@AISINO.COM with kvno 1, encryption type aes128-cts-hmac-sha1-96 added to keytab WRFILE:/home/drguo/drguo_bigdata28.keytab.
Entry for principal drguo/bigdata28@AISINO.COM with kvno 1, encryption type des3-cbc-sha1 added to keytab WRFILE 
:/home/drguo/drguo_bigdata28.keytab.
Entry for principal drguo/bigdata28@AISINO.COM with kvno 1, encryption type arcfour-hmac added to keytab WRFILE:/home/drguo/drguo_bigdata28.keytab.
Entry for principal drguo/bigdata28@AISINO.COM with kvno 1, encryption type des-hmac-sha1 added to keytab WRFILE:/home/drguo/drguo_bigdata28.keytab.
Entry for principal drguo/bigdata28@AISINO.COM with kvno 1, encryption type des-cbc-md5 added to keytab WRFILE:/home/drguo/drguo_bigdata28.keytab.
kadmin.local:  q

3.將krb5.conf與keytab檔案拷到本地，方便測試

4.使用Spark讀取HBase

package drguo.test

import java.io.IOException

import com.google.protobuf.ServiceException
import dguo.test.HBaseKerb
import org.apache.hadoop.hbase.HBaseConfiguration
import org.apache.hadoop.hbase.client.{HBaseAdmin, HTable}
import org.apache.hadoop.hbase.mapreduce.{TableInputFormat}
import org.apache.hadoop.hbase.util.Bytes
import org.apache.hadoop.security.UserGroupInformation
import org.apache.spark.{SparkConf, SparkContext}

/**
  * Created by drguo on 2018/7/18.
  */
object SparkExecHBase {


  def main(args: Array[String]): Unit = {
//    HBaseKerb.getAllRows("XMJZ")
    System.setProperty("java.security.krb5.conf", "d:/krb5.conf")
    val sparkConf = new SparkConf().setAppName("SparkExecHBase").setMaster("local")
    val sc = new SparkContext(sparkConf)

    val conf = HBaseConfiguration.create()
    conf.set(TableInputFormat.INPUT_TABLE, "XMJZ")
    conf.set("hbase.zookeeper.quorum","172.19.6.28,172.19.6.29,172.19.6.30")
    conf.set("hbase.zookeeper.property.clientPort", "2181")
    conf.set("hadoop.security.authentication", "Kerberos")

    UserGroupInformation.setConfiguration(conf)
    try {
      UserGroupInformation.loginUserFromKeytab("drguo/[email protected]", "d:/drguo_bigdata28.keytab")
      HBaseAdmin.checkHBaseAvailable(conf)
    } catch {
      case e: IOException =>
        e.printStackTrace()
      case e: ServiceException =>
        e.printStackTrace()
    }

    val hbaseRdd = sc.newAPIHadoopRDD(conf, classOf[TableInputFormat], classOf[org.apache.hadoop.hbase.io.ImmutableBytesWritable], classOf[org.apache.hadoop.hbase.client.Result])
//    println(hbaseRdd.toString())
    hbaseRdd.map( x=>x._2).map{result => (result.getRow,result.getValue(Bytes.toBytes("Info"),Bytes.toBytes("ADDTIME")))}.map(row => (new String(row._1),new String(row._2))).collect.foreach(r => (println(r._1+":"+r._2)))

  }

}

5.使用Java讀取（網上也有不少例子，但大部分都有一些重複、多餘的程式碼）

package dguo.test;

import com.google.protobuf.ServiceException;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.Cell;
import org.apache.hadoop.hbase.CellUtil;
import org.apache.hadoop.hbase.HBaseConfiguration;
import org.apache.hadoop.hbase.client.*;
import org.apache.hadoop.hbase.util.Bytes;
import org.apache.hadoop.security.UserGroupInformation;

import java.io.IOException;

/**
 * Created by drguo on 2018/7/18.
 */
public class HBaseKerb {

    private static Configuration conf = null;
    static {
        System.setProperty("java.security.krb5.conf", "d:/krb5.conf" );
        //使用HBaseConfiguration的單例方法例項化
        conf = HBaseConfiguration.create();
        conf.set("hbase.zookeeper.quorum", "172.19.6.28,172.19.6.29,172.19.6.30");
        conf.set("hbase.zookeeper.property.clientPort", "2181");
        conf.set("hadoop.security.authentication" , "Kerberos" );

        UserGroupInformation.setConfiguration(conf);

        try {
            UserGroupInformation.loginUserFromKeytab("drguo/[email protected]", "d:/drguo_bigdata28.keytab");
            HBaseAdmin.checkHBaseAvailable(conf);
        } catch (IOException e) {
            e.printStackTrace();
        } catch (ServiceException e) {
            e.printStackTrace();
        }

    }

    public static void getAllRows(String tableName) throws IOException{
        HTable hTable = new HTable(conf, tableName);
        //得到用於掃描region的物件
        Scan scan = new Scan();
        //使用HTable得到resultcanner實現類的物件
        ResultScanner resultScanner = hTable.getScanner(scan);
        for(Result result : resultScanner){
            Cell[] cells = result.rawCells();
            for(Cell cell : cells){
                //得到rowkey
                System.out.println("行鍵:" + Bytes.toString(CellUtil.cloneRow(cell)));
                //得到列族
                System.out.println("列族" + Bytes.toString(CellUtil.cloneFamily(cell)));
                System.out.println("列:" + Bytes.toString(CellUtil.cloneQualifier(cell)));
                System.out.println("值:" + Bytes.toString(CellUtil.cloneValue(cell)));
            }
        }
    }

    public static void main(String[] args) throws IOException{
        getAllRows("XMJZ");
    }
}

PS:

出現下述錯誤往往是因為System.setProperty(“java.security.krb5.conf”, “d:/krb5.conf”)中的krb5.conf檔案沒有找到（比如路徑錯誤）或是裡面配置的kdc、admin_server地址錯誤。

Exception in thread “main” java.lang.IllegalArgumentException: Can’t get Kerberos realm
at org.apache.hadoop.security.HadoopKerberosName.setConfiguration(HadoopKerberosName.java:65)
at org.apache.hadoop.security.UserGroupInformation.initialize(UserGroupInformation.java:319)
at org.apache.hadoop.security.UserGroupInformation.setConfiguration(UserGroupInformation.java:374)
at drguo.test.SparkExecHBase$.main(SparkExecHBase.scala:32)
at drguo.test.SparkExecHBase.main(SparkExecHBase.scala)
Caused by: java.lang.reflect.InvocationTargetException
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.apache.hadoop.security.authentication.util.KerberosUtil.getDefaultRealm(KerberosUtil.java:84)
at org.apache.hadoop.security.HadoopKerberosName.setConfiguration(HadoopKerberosName.java:63)
… 4 more
Caused by: KrbException: Cannot locate default realm
at sun.security.krb5.Config.getDefaultRealm(Config.java:1029)
… 10 more

使用Spark/Java讀取已開啟Kerberos認證的HBase

1.賦予drguo使用者相應的許可權

2.KDC中建立drguo使用者並匯出相應的keytab檔案

3.將krb5.conf與keytab檔案拷到本地，方便測試

4.使用Spark讀取HBase

5.使用Java讀取（網上也有不少例子，但大部分都有一些重複、多餘的程式碼）

PS:

使用Spark/Java讀取已開啟Kerberos認證的HBase

有kerberos認證hbase在spark環境下的使用

cdh5.12.2 開啟kerberos認證

Spark連線需Kerberos認證的HBase

java在線聊天項目0.6版解決客戶端關閉後異常問題 dis.readUTF()循環讀取已關閉的socket

HBase實操 | 如何使用Java連線Kerberos的HBase

spark從mysql讀取資料（redis/mongdb/hbase等類似，換成各自RDD即可）

大資料Spark優化讀取Hbase--region 提高並行數過程詳細解析

VBA 從一個未開啟的Excel檔案中讀取資料到，已開啟的檔案中.

StreamSets 從Mysql到Hbase(帶kerberos認證)的實時資料採集

spark on yarn模式下掃描帶有kerberos的hbase

java程式碼連線Hive(開啟Kerberos和sentry)

spark1.4 讀取hbase 0.96 報錯 java.io.NotSerializableException: org.apache.hadoop.hbase.io.ImmutableBytes

Spark如何讀取Hbase特定查詢的資料

spark操作讀取hbase例項

spark DataFrame 使用Java讀取mysql和寫入mysql的例子

Java Api Consumer 連線啟用Kerberos認證的Kafka

通過BulkLoad快速將海量數據導入到Hbase（TDH，kerberos認證）

java讀取網頁圖片路徑並下載到本地

Java讀取Properties配置文件

使用Spark/Java讀取已開啟Kerberos認證的HBase

1.賦予drguo使用者相應的許可權

2.KDC中建立drguo使用者並匯出相應的keytab檔案

3.將krb5.conf與keytab檔案拷到本地，方便測試

4.使用Spark讀取HBase

5.使用Java讀取（網上也有不少例子，但大部分都有一些重複、多餘的程式碼）

PS:

相關推薦