hbase實戰之javaAPI插入資料

阿新 • • 發佈：2018-11-04

一，實現思路

　　1，先mapreduces得到並傳遞資料。

　　2，寫好連線表，建立表，插入表hbase資料庫的工具。

　　3，在reduces中呼叫寫好的hbase工具。

　　4，main類提交。

二，程式碼書寫

　　1，mapper

package com;

import java.io.IOException;

import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Mapper;
//傳遞資料
public class mapper extends Mapper<LongWritable, Text, Text, User>{

	@Override
	protected void map(LongWritable key, Text value, Mapper<LongWritable, Text, Text, User>.Context context)
			throws IOException, InterruptedException {

		String data = value.toString();
		String[] s = data.split(",");
	    System.out.println(data);
		context.write(new Text("1"), new User(s[0],s[1],s[2],s[3],s[4]));
	}
}

　　2，hbase工具類

package com;
import java.util.List;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.hbase.HBaseConfiguration;
import org.apache.hadoop.hbase.HColumnDescriptor;
import org.apache.hadoop.hbase.HTableDescriptor;
import org.apache.hadoop.hbase.TableName;
import org.apache.hadoop.hbase.client.Admin;
import org.apache.hadoop.hbase.client.Connection;
import org.apache.hadoop.hbase.client.ConnectionFactory;
import org.apache.hadoop.hbase.client.HBaseAdmin;
import org.apache.hadoop.hbase.client.Put;
import org.apache.hadoop.hbase.client.Table;

public class HbaseUtils {
	public static final String c="info";
	//reducer呼叫的方法
	public static  void insertinfo(String ip,String port,String tableName,List<User> list) throws Exception{
		Connection con=getConnection(ip,port);	
		HBaseAdmin admin = (HBaseAdmin)con.getAdmin();
		Table table = con.getTable(TableName.valueOf(tableName));
		boolean b = admin.tableExists(TableName.valueOf(tableName));
		if(!b){
			createTable(admin,tableName);
		}
		insertList(table,list);
	}
	//插入資料的方法
	private static void insertList(Table table, List<User> list) throws Exception {
		for (User user : list) {
			Put put = new Put(user.getId().getBytes());
			put.addColumn(c.getBytes(), "name".getBytes(), user.getName().getBytes());
			put.addColumn(c.getBytes(), "Age".getBytes(), user.getAge().getBytes());
			put.addColumn(c.getBytes(), "Sex".getBytes(), user.getSex().getBytes());
			put.addColumn(c.getBytes(), "Part".getBytes(), user.getPart().getBytes());
			table.put(put);
		}
	}
	//建立表的方法
	private static void createTable(Admin admin, String tableName) throws Exception {
		HTableDescriptor descriptor = new HTableDescriptor(TableName.valueOf(tableName));
		HColumnDescriptor descriptor2 = new HColumnDescriptor(c);
		descriptor.addFamily(descriptor2);
		admin.createTable(descriptor);
	}
	//獲得與hbase的連線
	private static Connection getConnection(String ip, String port) throws Exception {
		Configuration configuration = HBaseConfiguration.create();
		configuration.set("hbase.zookeeper.quorum", ip);
		configuration.set("hbase.zookeeper.property.clientPort", port);
		Connection connection = ConnectionFactory.createConnection(configuration);
		return connection;
	}
}

　　3，reducer

package com;
import java.io.IOException;
import java.lang.reflect.InvocationTargetException;
import java.util.ArrayList;
import java.util.List;
import org.apache.commons.beanutils.BeanUtils;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Reducer;
public class reducer extends Reducer<Text, User, Text, Text>{
	@Override
	protected void reduce(Text keyin, Iterable<User> value, Reducer<Text, User, Text, Text>.Context conetxt)
			throws IOException, InterruptedException {
		ArrayList<User> list=new ArrayList<User>();
		//克隆迭代器中的資料
		for(User user:value) {
			User user1=new User();
			System.out.println(user);
			
			try {
				BeanUtils.copyProperties(user1, user);
				list.add(user1);
			} catch (Exception e) {
				// TODO Auto-generated catch block
				e.printStackTrace();
			}
		}
		System.out.println("list+++++++++++++++"+list);
		//呼叫hbase工具的方法
		try {
			HbaseUtils.insertinfo("192.168.184.131", "2181", "sw", list);
		} catch (Exception e) {
			// TODO Auto-generated catch block
			e.printStackTrace();
		}
		conetxt.write(new Text("status"), new Text(":success"));
	}
}

　　4，main

package com;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
public class main {
public static void main(String[] args) throws Exception {
	Configuration conf = new Configuration();
	conf.set("mapreduce.framework.name", "local");
	conf.set("fs.defaultFS", "file:///");
	Job wordCountJob = Job.getInstance(conf);	
	//重要：指定本job所在的jar包
	wordCountJob.setJarByClass(main.class);	
	//設定wordCountJob所用的mapper邏輯類為哪個類
	wordCountJob.setMapperClass(mapper.class);
	//設定wordCountJob所用的reducer邏輯類為哪個類
	wordCountJob.setReducerClass(reducer.class);	
	//設定map階段輸出的kv資料型別
	wordCountJob.setMapOutputKeyClass(Text.class);
	wordCountJob.setMapOutputValueClass(User.class);	
	//設定最終輸出的kv資料型別
	wordCountJob.setOutputKeyClass(Text.class);
	wordCountJob.setOutputValueClass(Text.class);	
	//設定要處理的文字資料所存放的路徑
	FileInputFormat.setInputPaths(wordCountJob, "C:\\test\\in6\\data.txt");
	FileOutputFormat.setOutputPath(wordCountJob, new Path("C:\\test\\out6"));	
	//提交job給hadoop叢集
	wordCountJob.waitForCompletion(true);
}
}

hbase實戰之javaAPI插入資料

一，實現思路　　1，先mapreduces得到並傳遞資料。　　2，寫好連線表，建立表，插入表hbase資料庫的工具。　　3，在reduces中呼叫寫好的hbase工具。　　4，main類提交。二，程式碼書寫　　1，mapper

python_NLP實戰之豆瓣讀書資料聚類

用k_means對豆瓣讀書資料聚類 1、讀取資料以及資料預處理 book_data = pd.read_csv('data/data.csv') #讀取檔案 print(book_data.head()) book_titles = book_data['title'

SSM】之MyBatis插入資料後獲取自增主鍵

很多時候，我們都需要在插入一條資料後回過頭來獲取到這條資料在資料表中的自增主鍵，便於後續操作。針對這個問題，有兩種解決方案：（1）先插入，後查詢。我們可以先插入一條資料，然後根據插入的資料的各個欄位值，再次訪問資料庫，從資料庫中將剛剛插入的資料查詢出來。當

Linux-C成長之路（九）Linux C程式設計實戰之路複合資料型別

Linux C程式設計實戰之路複合資料型別咱們知道，C語言中有許多基本資料型別，比如int型，float型，double型等，我們經常使用這些基本資料型別來表達一些簡單的資料，比如一個人的年齡可以用 int 型資料來表示，一本書的價格可以用 float 型

從壹開始 [ Ids4實戰 ] 之四 ║ 使用者資料管理 & 前後端授權聯調

前言哈嘍~~~ 大家週一好！夏天到了，大家舒服了沒有，熟話說，打敗你的不是天真，是天真熱！

HBase實戰案例之使用Scanner獲取資料

HBase 實戰案例之使用Scanner獲取資料 1.Java API 簡介 1.1 getScanner() getScanner方法有三個過載模型，分別如下： getScanner(Scan scan) /** * Returns a sc

javaAPI-Hbase非同步之批量高效寫入資料

package cn.ngsoc.hbase.util; import org.apache.commons.lang.StringUtils; import org.apache.hadoop.conf.Configuration; import org.apache.

大資料HBase系列之HBase分散式資料庫部署

一、部署準備 1. 依賴框架大資料Hadoop系列之Hadoop分散式叢集部署：https://blog.csdn.net/volitationLong/article/details/80285123 大資料Zookeeper系列之Zookeeper叢集部署：https://

大資料HBase系列之初識HBase

1. HBase簡介 1.1 為什麼使用HBase 傳統的RDBMS關係型資料庫（MySQL/Oracle）儲存一定量資料時進行資料檢索沒有問題，可當資料量上升到非常巨大規模的資料（TB/PB）級別時，傳統的RDBMS已無法支撐，這時候就需要一種新型的資料庫系統更好更

Docker實戰之安裝配置HBase-1.2.2完全分散式叢集

環境配置 VM：VMware Workstation OS：Ubuntu 14.04 LTS HBASE：hbase-1.2.2 HBase叢集規劃 172.17.0.5 hmaster 172.17.0.6&

Java之JDBC批量插入資料

普通插入方式 10萬條資料，耗時13秒。。。 private String url = "jdbc:mysql://localhost:3306/test01"; private String user = "root"; private String password

Hbase實戰教程之happybase

本文基於實驗室已經搭建好的Hadoop平臺而寫，使用Python呼叫happybase庫。 1.thrift 是facebook開發並開源的一個二進位制通訊中介軟體，通過thrift，我們可以用Python來操作Hbase 首先開啟Ha

Python實戰之Excel資料按索引更新

在日常工作中，我們經常需要需要批量更新資料，比如有個destination表，裡面有一列的資料需要被更新，更新的依據為reference表，python指令碼執行前和執行後的資料列示意圖如下：我們使用Excel檔案作為confi

Apache CXF實戰之五壓縮Web Service資料

分享一下我老師大神的人工智慧教程！零基礎，通俗易懂！http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識，造福人民，實現我們中華民族偉大復興！

大資料專案實戰之 --- 使用者畫像專案分析

一、使用者畫像專案分析 ------------------------------------------------------- 1.概念使用者畫像也叫使用者資訊標籤化、客戶資訊。根據使用者的資訊和行為動作，用一些標籤把使用者描繪出來，描繪的標籤就是使用者畫像。

大資料專案實戰之 --- 某App管理平臺的手機app日誌分析系統（三）

一、建立hive分割槽表 ---------------------------------------------------- 1.建立資料庫 $hive> create database applogsdb; 2.建立分割槽表編寫指令碼。

hbase實踐之資料讀取詳解

hbase基本儲存組織結構與資料讀取組織結構對比 Segment是Hbase2.0的概念，MemStore由一個可寫的Segment，以及一個或多個不可寫的Segments構成。故hbase 1.*版本中的MemstoreScanner變成了SegmentScanner。對應關係表

Hbase實戰教程之happybase（轉自wolfoxliu）

Hbase實戰教程之happybase wolfoxliu W wolfoxliu 釋出於 2017/03/10 1 本文基於實驗室已經搭建好的Hadoop平臺而寫，使用Python

hbase與hive關聯、插入資料

接上一篇文章hbase的基本操作，做進一步深入。細想一下，使用put命令插入資料到hbase，使用get方法從hbase讀取資料還是有諸多不方便。顯然，NO SQL資料庫在某些操作上還是沒有支援SQL的資料庫更加便捷。那麼，是否可以將hbase與什麼關聯一下，既支援hbase的NO SQL又

大資料專案實戰之十三:13.Spark上下文構建以及模擬資料生成

import com.ibeifeng.sparkproject.conf.ConfigurationManager; import com.ibeifeng.sparkproject.constant.Constants; import com.ibeifeng.sparkpro

hbase實戰之javaAPI插入資料

相關推薦