spark通過jdbc訪問postgresql資料庫

阿新 • • 發佈：2019-01-10

1.首先要有可用的jdbc

[[email protected] bin]$ locate jdbc|grep postgres
/mnt/hd01/www/html/deltasql/clients/java/dbredactor/lib/postgresql-8.2-507.jdbc4.jar
/usr/lib/ruby/gems/1.8/gems/railties-3.2.13/lib/rails/generators/rails/app/templates/config/databases/jdbcpostgresql.yml
/usr/src/postgis-2.0.0/java/jdbc/src/org/postgresql
/usr/src/postgis-2.0.0/java/jdbc/src/org/postgresql/driverconfig.properties
/usr/src/postgis-2.0.0/java/jdbc/stubs/org/postgresql
/usr/src/postgis-2.0.0/java/jdbc/stubs/org/postgresql/Connection.java
/usr/src/postgis-2.0.0/java/jdbc/stubs/org/postgresql/PGConnection.java
/usr/src/postgis-2.1.0/java/jdbc/src/org/postgresql
/usr/src/postgis-2.1.0/java/jdbc/src/org/postgresql/driverconfig.properties
/usr/src/postgis-2.1.0/java/jdbc/stubs/org/postgresql
/usr/src/postgis-2.1.0/java/jdbc/stubs/org/postgresql/Connection.java
/usr/src/postgis-2.1.0/java/jdbc/stubs/org/postgresql/PGConnection.java
沒有合適的，就在管網下載：https://jdbc.postgresql.org/download/postgresql-9.4-1205.jdbc4.jar

2.把下載好的jar檔案放在$SPARK_HOME/lib下面
3.啟動sparck shell

[[email protected] bin]$ SPARK_CLASSPATH=$SPARK_HOME/lib/postgresql-9.4-1205.jdbc4.jar $SPARK_HOME/bin/spark-shell
.
.
.
Please instead use:
 - ./spark-submit with --driver-class-path to augment the driver classpath
 - spark.executor.extraClassPath to augment the executor classpath

15/11/04 17:53:05 WARN SparkConf: Setting 'spark.executor.extraClassPath' to '/usr/src/data-integration/lib/postgresql-9.3-1102-jdbc4.jar' as a work-around.
15/11/04 17:53:05 WARN SparkConf: Setting 'spark.driver.extraClassPath' to '/usr/src/data-integration/lib/postgresql-9.3-1102-jdbc4.jar' as a work-around.
15/11/04 17:53:06 WARN Utils: Service 'SparkUI' could not bind on port 4040. Attempting port 4041.
15/11/04 17:53:07 WARN MetricsSystem: Using default name DAGScheduler for source because spark.app.id is not set.
Spark context available as sc.
15/11/04 17:53:09 WARN Connection: BoneCP specified but not present in CLASSPATH (or one of dependencies)
15/11/04 17:53:09 WARN Connection: BoneCP specified but not present in CLASSPATH (or one of dependencies)
15/11/04 17:53:25 WARN ObjectStore: Version information not found in metastore. hive.metastore.schema.verification is not enabled so recording the schema version 1.2.0
15/11/04 17:53:25 WARN ObjectStore: Failed to get database default, returning NoSuchObjectException
15/11/04 17:53:28 WARN NativeCodeLoader: Unable to load native-hadoop library for your platform... using builtin-java classes where applicable
15/11/04 17:53:29 WARN Connection: BoneCP specified but not present in CLASSPATH (or one of dependencies)
15/11/04 17:53:29 WARN Connection: BoneCP specified but not present in CLASSPATH (or one of dependencies)
SQL context available as sqlContext.

4.建立DataFrame物件

scala> val df = sqlContext.load("jdbc", Map("url" -> "jdbc:postgresql://localhost:5434/cd03?user=cd03&password=cd03", "dbtable" -> "test_trans"))
warning: there were 1 deprecation warning(s); re-run with -deprecation for details
df: org.apache.spark.sql.DataFrame = [trans_date: string, trans_prd: int, trans_cust: int]
標準格式：
val jdbcDF = sqlContext.read.format("jdbc").options( 
  Map("url" -> "jdbc:postgresql:dbserver",
  "dbtable" -> "schema.tablename")).load()

5.檢視schema

scala> df.printSchema()
root
 |-- trans_date: string (nullable = true)
 |-- trans_prd: integer (nullable = true)
 |-- trans_cust: integer (nullable = true)

6.簡單計算

scala> df.filter(df("trans_cust")>9999999).select("trans_date","trans_prd").show
+----------+---------+
|trans_date|trans_prd|
+----------+---------+
| 2015-5-20|     2007|
| 2015-7-24|     5638|
| 2015-5-19|     8182|
| 2015-2-24|    11391|
| 2015-8-13|    17341|
| 2015-2-22|    10996|
| 2015-1-17|    15284|
|  2015-1-8|    16090|
| 2015-1-25|    13528|
| 2015-1-17|     9498|
| 2015-9-25|     7235|
| 2015-8-19|     4084|
| 2015-4-24|    16637|
| 2015-5-27|    13829|
| 2015-0-13|    13956|
| 2015-3-19|    11974|
| 2015-10-5|     1185|
| 2015-3-28|     9412|
| 2015-6-13|    15203|
| 2015-2-14|    10087|
+----------+---------+
only showing top 20 rows

spark通過jdbc訪問postgresql資料庫

1.首先要有可用的jdbc[[email protected] bin]$ locate jdbc|grep postgres /mnt/hd01/www/html/deltasql/clients/java/dbredactor/lib/postgresql-8

java 資料視覺化（二）通過jdbc訪問資料庫，在servlet上獲取資料庫資料

想要通過servlet獲取資料庫資料，首先需要建立jdbc 因為資料是通過無線感測傳到資料庫的，因此jdbc裡只有查詢操作，增刪改的同學可以自行新增。程式碼中被註釋掉的部分用於測試。 main函式部分用於檢測是否連線上資料庫，並檢測是否能讀到資料，若是讀得到

如何通過JDBC訪問資料庫

//資料庫連線的本質其實就是客戶端維持了一個和遠端MySQL伺服器的一個TCP長連線，並且在此連線上維護了一些資訊。 //socket是TCP/IP協議的API。其只是對TCP/IP協議棧操作的抽象

案例1：使用JDBC訪問Oracle資料庫1

一. 在java工程的src目錄下建立一個配置檔案 info.properties （注意：1.字尾名不要寫錯；2.檔案放在src目錄下，不要弄錯），在這個檔案中寫入如下程式碼。 //info.properties driverClass=oracle.jdbc.driv

Android通過jdbc連線mySQL資料庫時，資料庫拒絕連線

原因： mysql伺服器出於安全考慮，預設只允許本機使用者通過命令列登入。解決方案：先通過localhost登入mysql伺服器將mysql伺服器的mysql資料庫的user表中root使用者的Host欄位改為"%"。操作如下： window+r 輸

postgresql從入門到菜鳥（五）JDBC連線postgresql資料庫

之前都是通過psql命令進行資料庫的操作，從這一期開始準備寫一些如何通過LIBPQ，JDBC,ODBC等方式來postgresql並進行相關的操作，這一期準備先說說JDBC。這裡分為三個模組來講：一.獲取連線二.執行select語句三.執行insert，delete，u

通過JDBC向oracle資料庫中插入Clob大物件

好記性不如爛筆頭，今天剛剛學過Clob的插入和查詢，寫篇部落格，以備後用首先建立一個包含大物件的表 create table data( id varchar2(20), content clob ); 然後通過JDBC連線資料庫並插入Clob

Windows環境下C/C++訪問PostgreSQL資料庫

轉載自：https://segmentfault.com/a/1190000000628234 PostgreSQL是一款在Linux環境下應用十分廣泛的輕量級關係型資料庫，大家都聽說過MySQL，卻對PostgreSQL鮮有耳聞，它其實在效能、應用領域上和MySQL不相上下。網上關於Windo

如何解決Java通過JDBC訪問MySQL時SSL連線告警問題

背景 MySQL 5.5.45+, 5.6.26+, 5.7.6+開始支援SSL連線，如果沒有明確設定相關的選項時，預設要求SSL連線。為相容性考慮，舊應用程式需要設定verifyServerCert

JAVA通過JDBC操作MySQL資料庫（三）：PreparedStatement介面操作資料庫

JAVA通過JDBC操作MySQL資料庫（三）：PreparedStatement介面操作資料庫 Statement介面的問題 PreparedStatement介面操作資料庫 Statement介面的問題在文章JAVA通過JDBC操作

Java程式通過JDBC連線遠端資料庫MySQL

程式碼如下： import java.sql.*; public class jdbc { @SuppressWarnings("unused") public static void main

如何通過ip訪問MySql資料庫

如果你想通過IP地址連線到你的mysql資料庫時發生如下錯誤錯誤： ERROR 1130: Host '192.168.1.3' is not allowed to connect to this MySQL server 解決方法如下： 1.改表法。可能是你的帳

scala（一）：通過jdbc連線mysql資料庫

1.主題描述 scala針對MySQL資料庫進行增刪改查的基本操作。 2.程式碼 package SparkSQLproject.Log import java.sql.{Connection, DriverManager} /** * scala通過jdb

Myeclipse中通過JDBC連線MySQL資料庫的詳細步驟

首先要說明的是，使用jdbc連線資料庫並不難，只要你按照接下來的步湊一步一步的做，理清思路，相信你肯定能夠成功。一、準備工作下載JDBC驅動。網上有許多驅動可供下載，但魚龍混雜，可能部分讀者不知

通過JDBC連線SQLSEVER資料庫

與連線MySql配製相似，只是JDBC的驅動不同，下面只是簡單例子，實際開發中應該把實現資料庫連線的程式碼封裝起來，然後通過函式呼叫得到Connection. testsql2k.java import java.sql.*;/** * <p>Title: &

Python通過pypyodbc訪問Access資料庫

看書上通過ODBC訪問資料庫的案例，想實踐一下在Python 3.6.1中實現access2003資料庫的連結，但是在匯入odbc模組的時候出現了問題，後來查了一些資料就嘗試著使用py

Java通過JDBC 進行MySQL資料庫操作

一：前言在測試工作中難免會和資料庫打交道，可能大家對資料庫語句都比較熟，比如常見的增刪改查等，但是當我們在做自動化測試時，比如介面測試，大部分請求引數，可能都是從資料庫中取值，並且介面返回值最後都會記錄在資料庫中，這樣前提就需要和資料庫建立連線，才能進行讀寫

Java中通過JDBC操作MySQL資料庫

JDBC相關的操作 0、JDBC常用類和介面介紹 DriverManager類 DriverManager類用來管理資料庫中的所有驅動程式；是JDBC的管理層，作用於使用者和驅動程式之間，跟蹤可用的驅動程式，並在資料庫的驅動程式之間建立連線。此外，D

在myeclipse中通過jdbc訪問mysql出現拒絕訪問的解決辦法

前兩天在做爬蟲的時候，改完一陣程式碼之後，再次執行居然提示“ERROR 1044: Access denied for user: '@localhost' to database 'mysql'”這個錯誤，不知道什麼原因，問了其他人他們說也經常遇到，後來我就去網上查了

【Java 資料庫】java工程通過JDBC連線到資料庫

（在原博文上有改動）下面請一字一句地看，一遍就設定成功，比你設定幾十遍失敗，費時會少得多。首先，在連線資料庫之前必須保證SQL Server 2012是採用SQL Server身份驗證方式而不是windows身份驗證方式。如果在安裝時選用了後者，則重新設定如下:

spark通過jdbc訪問postgresql資料庫

相關推薦