Spark Structured Streaming框架(5)之進程管理

阿新 • • 發佈：2017-09-03

ntp 框架 manager lis ive term red ogr pan

　　Structured Streaming提供一些API來管理Streaming對象。用戶可以通過這些API來手動管理已經啟動的Streaming，保證在系統中的Streaming有序執行。

1. StreamingQuery

　　在調用DataStreamWriter方法的start啟動Streaming後，會返回一個StreamingQuery對象。所以用戶就可以通過這個對象來管理Streaming。

如下所示：

val query = df.writeStream.format("console").start() // get the query object

query.id // get the unique identifier of the running query that persists across restarts from checkpoint data

query.runId // get the unique id of this run of the query, which will be generated at every start/restart

query.name // get the name of the auto-generated or user-specified name

query.explain() // print detailed explanations of the query

query.stop() // stop the query

query.awaitTermination() // block until query is terminated, with stop() or with error

query.exception // the exception if the query has been terminated with error

query.recentProgress // an array of the most recent progress updates for this query

query.lastProgress // the most recent progress update of this streaming query

2. StreamingQueryManager

　　Structured Streaming提供了另外一個管理Streaming的接口是：StreamingQueryManager。用戶可以通過SparkSession對象的streams方法獲得。

如下所示：

val spark: SparkSession = ...

val streamManager = spark.streams()

streamManager.active // get the list of currently active streaming queries

streamManager.get(id) // get a query object by its unique id

streamManager.awaitAnyTermination() // block until any one of them terminates

3. 參考文獻

[1]. Structured Streaming Programming Guide.

[2]. Kafka Integration Guide.

Spark Structured Streaming框架(5)之進程管理

ntp 框架 manager lis ive term red ogr pan 　　Structured Streaming提供一些API來管理Streaming對象。用戶可以通過這些API來手動管理已經啟動的Streaming，保證在系統中的Streaming有序執行。

Spark Structured Streaming框架(5)之進程管理

1. StreamingQuery

2. StreamingQueryManager

3. 參考文獻

Spark Structured Streaming框架(5)之進程管理

【操作系統】之進程管理

linux系統管理之進程管理

安全衛士之進程管理異常

操作系統之進程管理（2）

【linux之進程管理，系統監控】

Linux學習筆記之進程管理

Linux之進程管理

任督二脈之進程管理（1）

Linux之進程管理，性能監控與計劃任務

第四節：框架前期準備篇之進程外Session的兩種配置方式

Spark2.3（三十四）：Spark Structured Streaming之withWaterMark和windows視窗是否可以實現最近一小時統計

把握linux內核設計思想（十三）：內存管理之進程地址空間

Python基礎：之進程

python學習之進程線程學習一

Python之進程與線程

軟考之進程，線程，管程比較

GreenPlum之進程會話管理篇

windows核心編程之進程間共享數據

python基礎之進程間通信、進程池、協程

Spark Structured Streaming框架(5)之進程管理

1. StreamingQuery

2. StreamingQueryManager

3. 參考文獻

相關推薦