Spark Master資源排程--SparkContext向所有master註冊

阿新 • • 發佈：2018-12-05

Spark Master資源排程–SparkContext向所有master註冊

Youtube視訊分享

Spark Master資源排程–SparkContext向所有master註冊 : https://youtu.be/AXxCnCc5Mh0

Bilibili視訊分享

Spark Master資源排程–SparkContext向所有master註冊 :

https://www.bilibili.com/video/av37442295/

SparkContext啟動向master傳送訊息

ClientEndpoint向master傳送訊息: RegisterApplication

    /**
     *  Register with all masters asynchronously and returns an array `Future`s for cancellation.
     */
    private def tryRegisterAllMasters(): Array[JFuture[_]] = {
      for (masterAddress <- masterRpcAddresses) yield {
        registerMasterThreadPool.submit(new Runnable {
          override def run(): Unit = try {
            if (registered.get) {
              return
            }
            logInfo("Connecting to master " + masterAddress.toSparkURL + "...")
            val masterRef =
              rpcEnv.setupEndpointRef(Master.SYSTEM_NAME, masterAddress, Master.ENDPOINT_NAME)
            masterRef.send(RegisterApplication(appDescription, self))
          } catch {
            case ie: InterruptedException => // Cancelled
            case NonFatal(e) => logWarning(s"Failed to connect to master $masterAddress", e)
          }
        })
      }
    }

master處理訊息RegisterApplication

建立 Application 並註冊到master上
Application 儲存到 master 儲存引擎中
向driver傳送已註冊成功訊息: RegisteredApplication

    case RegisterApplication(description, driver) => {
      // TODO Prevent repeated registrations from some driver
      if (state == RecoveryState.STANDBY) {
        // ignore, don't send response
      } else {
        logInfo("Registering app " + description.name)
        val app = createApplication(description, driver)
        registerApplication(app)
        logInfo("Registered app " + description.name + " with ID " + app.id)
        persistenceEngine.addApplication(app)
        driver.send(RegisteredApplication(app.id, self))
        schedule()
      }
    }

過濾所有已註冊的Worker(狀態為ALIVE)
遍歷 waitingDrivers，如果有等待中的Drivers,給worker傳送啟動Driver訊息: LaunchDriver
呼叫在worker上啟動executor方法

 /**
   * Schedule the currently available resources among waiting apps. This method will be called
   * every time a new app joins or resource availability changes.
   */
  private def schedule(): Unit = {
    if (state != RecoveryState.ALIVE) {
      return
    }
    // Drivers take strict precedence over executors
    val shuffledAliveWorkers = Random.shuffle(workers.toSeq.filter(_.state == WorkerState.ALIVE))
    val numWorkersAlive = shuffledAliveWorkers.size
    var curPos = 0
    for (driver <- waitingDrivers.toList) { // iterate over a copy of waitingDrivers
      // We assign workers to each waiting driver in a round-robin fashion. For each driver, we
      // start from the last worker that was assigned a driver, and continue onwards until we have
      // explored all alive workers.
      var launched = false
      var numWorkersVisited = 0
      while (numWorkersVisited < numWorkersAlive && !launched) {
        val worker = shuffledAliveWorkers(curPos)
        numWorkersVisited += 1
        if (worker.memoryFree >= driver.desc.mem && worker.coresFree >= driver.desc.cores) {
          launchDriver(worker, driver)
          waitingDrivers -= driver
          launched = true
        }
        curPos = (curPos + 1) % numWorkersAlive
      }
    }
    startExecutorsOnWorkers()
  }

過濾waitingApps,剛才註冊的Application已經在ArrayBuffer中
對已註冊的worker進行過濾
過濾條件狀態為ALIVE,可用cpu核心數大於等於每個executor的核心數，可用記憶體大於等於Application在每個executor需要的記憶體數
對可用worker進行排序(按可用核心數從大到小排序)
呼叫方法 scheduleExecutorsOnWorkers，worker給executor分配多少個cpu核心


  /**
   * Schedule and launch executors on workers
   */
  private def startExecutorsOnWorkers(): Unit = {
    // Right now this is a very simple FIFO scheduler. We keep trying to fit in the first app
    // in the queue, then the second app, etc.
    for (app <- waitingApps if app.coresLeft > 0) {
      val coresPerExecutor: Option[Int] = app.desc.coresPerExecutor
      // Filter out workers that don't have enough resources to launch an executor
      val usableWorkers = workers.toArray.filter(_.state == WorkerState.ALIVE)
        .filter(worker => worker.memoryFree >= app.desc.memoryPerExecutorMB &&
          worker.coresFree >= coresPerExecutor.getOrElse(1))
        .sortBy(_.coresFree).reverse
      val assignedCores = scheduleExecutorsOnWorkers(app, usableWorkers, spreadOutApps)

      // Now that we've decided how many cores to allocate on each worker, let's allocate them
      for (pos <- 0 until usableWorkers.length if assignedCores(pos) > 0) {
        allocateWorkerResourceToExecutors(
          app, assignedCores(pos), coresPerExecutor, usableWorkers(pos))
      }
    }
  }

進行具體的當前Application在Worker上給executor分配幾個cpu核心

 /**
   * Schedule executors to be launched on the workers.
   * Returns an array containing number of cores assigned to each worker.
   *
   * There are two modes of launching executors. The first attempts to spread out an application's
   * executors on as many workers as possible, while the second does the opposite (i.e. launch them
   * on as few workers as possible). The former is usually better for data locality purposes and is
   * the default.
   *
   * The number of cores assigned to each executor is configurable. When this is explicitly set,
   * multiple executors from the same application may be launched on the same worker if the worker
   * has enough cores and memory. Otherwise, each executor grabs all the cores available on the
   * worker by default, in which case only one executor may be launched on each worker.
   *
   * It is important to allocate coresPerExecutor on each worker at a time (instead of 1 core
   * at a time). Consider the following example: cluster has 4 workers with 16 cores each.
   * User requests 3 executors (spark.cores.max = 48, spark.executor.cores = 16). If 1 core is
   * allocated at a time, 12 cores from each worker would be assigned to each executor.
   * Since 12 < 16, no executors would launch [SPARK-8881].
   */
  private def scheduleExecutorsOnWorkers(
      app: ApplicationInfo,
      usableWorkers: Array[WorkerInfo],
      spreadOutApps: Boolean): Array[Int] = {
    val coresPerExecutor = app.desc.coresPerExecutor
    val minCoresPerExecutor = coresPerExecutor.getOrElse(1)
    val oneExecutorPerWorker = coresPerExecutor.isEmpty
    val memoryPerExecutor = app.desc.memoryPerExecutorMB
    val numUsable = usableWorkers.length
    val assignedCores = new Array[Int](numUsable) // Number of cores to give to each worker
    val assignedExecutors = new Array[Int](numUsable) // Number of new executors on each worker
    var coresToAssign = math.min(app.coresLeft, usableWorkers.map(_.coresFree).sum)

    /** Return whether the specified worker can launch an executor for this app. */
    def canLaunchExecutor(pos: Int): Boolean = {
      val keepScheduling = coresToAssign >= minCoresPerExecutor
      val enoughCores = usableWorkers(pos).coresFree - assignedCores(pos) >= minCoresPerExecutor

      // If we allow multiple executors per worker, then we can always launch new executors.
      // Otherwise, if there is already an executor on this worker, just give it more cores.
      val launchingNewExecutor = !oneExecutorPerWorker || assignedExecutors(pos) == 0
      if (launchingNewExecutor) {
        val assignedMemory = assignedExecutors(pos) * memoryPerExecutor
        val enoughMemory = usableWorkers(pos).memoryFree - assignedMemory >= memoryPerExecutor
        val underLimit = assignedExecutors.sum + app.executors.size < app.executorLimit
        keepScheduling && enoughCores && enoughMemory && underLimit
      } else {
        // We're adding cores to an existing executor, so no need
        // to check memory and executor limits
        keepScheduling && enoughCores
      }
    }

    // Keep launching executors until no more workers can accommodate any
    // more executors, or if we have reached this application's limits
    var freeWorkers = (0 until numUsable).filter(canLaunchExecutor)
    while (freeWorkers.nonEmpty) {
      freeWorkers.foreach { pos =>
        var keepScheduling = true
        while (keepScheduling && canLaunchExecutor(pos)) {
          coresToAssign -= minCoresPerExecutor
          assignedCores(pos) += minCoresPerExecutor

          // If we are launching one executor per worker, then every iteration assigns 1 core
          // to the executor. Otherwise, every iteration assigns cores to a new executor.
          if (oneExecutorPerWorker) {
            assignedExecutors(pos) = 1
          } else {
            assignedExecutors(pos) += 1
          }

          // Spreading out an application means spreading out its executors across as
          // many workers as possible. If we are not spreading out, then we should keep
          // scheduling executors on this worker until we use all of its resources.
          // Otherwise, just move on to the next worker.
          if (spreadOutApps) {
            keepScheduling = false
          }
        }
      }
      freeWorkers = freeWorkers.filter(canLaunchExecutor)
    }
    assignedCores
  }

分配worker資源給executor
給worker傳送啟動executor訊息： LaunchExecutor
給driver傳送Executor已增加訊息：ExecutorAdded

/**
   * Allocate a worker's resources to one or more executors.
   * @param app the info of the application which the executors belong to
   * @param assignedCores number of cores on this worker for this application
   * @param coresPerExecutor number of cores per executor
   * @param worker the worker info
   */
  private def allocateWorkerResourceToExecutors(
      app: ApplicationInfo,
      assignedCores: Int,
      coresPerExecutor: Option[Int],
      worker: WorkerInfo): Unit = {
    // If the number of cores per executor is specified, we divide the cores assigned
    // to this worker evenly among the executors with no remainder.
    // Otherwise, we launch a single executor that grabs all the assignedCores on this worker.
    val numExecutors = coresPerExecutor.map { assignedCores / _ }.getOrElse(1)
    val coresToAssign = coresPerExecutor.getOrElse(assignedCores)
    for (i <- 1 to numExecutors) {
      val exec = app.addExecutor(worker, coresToAssign)
      launchExecutor(worker, exec)
      app.state = ApplicationState.RUNNING
    }
  }

Spark Master資源排程--SparkContext向所有master註冊

Spark Master資源排程–SparkContext向所有master註冊更多資源 github: https://github.com/opensourceteams/spark-scala-maven csdn(彙總視訊線上看): https://blog.

Spark Master資源排程--worker向master註冊

Spark Master資源排程–Worker向Master註冊更多資源 github: https://github.com/opensourceteams/spark-scala-maven csdn(彙總視訊線上看): https://blog.csdn.net

Spark原始碼分析之Master資源排程演算法原理

Master是通過schedule方法進行資源排程，告知worker啟動executor等。一schedule方法 1判斷master狀態，只有alive狀態的master才可以進行資源排程，sta

spark學習-Master資源排程分配演算法

Master資源排程分配演算法：1.Application的排程演算法有兩種，一種是spreadOutApps，另一種是非spreadOutApps。2.spreadOutApps，會將每個Application要啟動的executor都平均分配到各個worker上去。（比如

Spark的資源排程

1、緒論上圖是Spark程式執行時的一個超級簡單的概括。我們執行一個Spark應用程式時，首先第一步肯定是寫一個Spark Application應用程式，然後呼叫資源排程器為Driver

spark學習記錄（五、Spark基於資源排程管理器的提交模式）

一、Standalone（Spark自帶） 1.1 Standalone-client模式提交命令： ./spark-submit --master spark://hadoop1:7077 --class org.apache.spark.examples.Spar

Master原理剖析與原始碼分析：資源排程機制原始碼分析（schedule()，兩種資源排程演算法）

1、主備切換機制原理剖析與原始碼分析 2、註冊機制原理剖析與原始碼分析 3、狀態改變處理機制原始碼分析 4、資源排程機制原始碼分析（schedule()，兩種資源排程演算法） * Dri

Spark學習(四)資源排程與任務排程的整合

文章目錄一、資源排程二、任務排程三、資源排程與任務排程整合四、粗細粒度資源排程 1、什麼是粗粒度資源排程？ 2、什麼是細粒度資源排程？一、資源排程 1、待叢集Spark叢集啟動成功後，W

Spark資源排程和任務排程

轉自：https://blog.csdn.net/lhworldblog/article/details/79300025 一、前述 Spark的資源排程是個很重要的模組，只要搞懂原理，才能具體明白Spark是怎麼執行的，所以尤其重要。自願申請的話，本文分粗粒度和細粒度模式分別介紹。

Spark-資源排程

目錄 Master中的物件資源排程流程資源排程結論影響Executor個數的因素 Master中的物件在Spark資源排程過程中，Master中有三個物件比較重要。 va

【資源排程總綱】Yarn原始碼剖析（零） --- spark任務提交到yarn的流程

前言本系列的目的在於試圖剖析spark任務提交至hadoop yarn上的整個過程，從yarn的啟動，以及spark-submit提交任務到yarn上，和在yarn中啟動任務包括yarn元件之間的通訊，用以提升自身知識儲備，記錄學習的過程為目的，由於個人能力有限文章中或許

Spark-任務排程與資源排程的整合

目錄排程流程排程流程 1.原始碼打成jar包，放到叢集上 2.提交Application,客戶端會生成一個Driver程序。 spark-submit --master --class jarPath 3.當TaskSchedul

Spark一些基礎原理——資源排程

自學知識：RDD的生命週期，DAG任務排程 lv0 在Spark中，資源排程是Master負責管理的，Worker通過註冊的形式在Master註冊相關資源。而在執行過程中，是通過sc即Driver向Master申請計算資源（Master根據叢集設定啟動不同的

27課：SPARK 執行在yarn資源排程框架 client 、cluster方式！！

分散式叢集 [email protected]:/usr/local/hadoop-2.6.0/etc/hadoop# vi /etc/hosts 127.0.0.1 localhost 192.168.189.1 master 192.168.189

spark資源排程流程總結

初學spark在Standalone模式下的資源排程機制，發現學習原始碼是理解spark一切機制的根本。現在對相關spark2.1.0原始碼的學習做個梳理。一應用程式提交時Master中對Driv

spark提交任務的模式—— standalone模式與yarn模式、資源排程與任務排程

standalone模式在客戶端提交Application，Driver在客戶端啟動；客戶端向Master申請資源，Master返回Worker節點； Driver向Worker節點發送task，監控task執行，回收結果。在客戶端提交App

Spark的資源管理和排程模式

1.Spark-standalone Standalone的模式下，spark的資源管理和排程是自己來管理和排程的，主要由master來管理。 2.Spark-yarn ResourceManager NodeManager ApplicationMaster Contai

spark叢集8080埠頁面只顯示master的情況

電腦配置是一臺物理機作為master，一臺物理機作為slave，在master啟動執行後，使用jps命令分別檢視兩臺機器的執行狀況，master與slave均執行正常，但是進入master：8080的web控制端檢視執行狀態時候，發現只有master一個節點作為wor

Spark standalone簡介與執行wordcount（master、slave1和slave2）

前期部落格 1. Standalone模式即獨立模式，自帶完整的服務，可單獨部署到一個叢集中，無需依賴任何其他資源管理系統。從一定程度上說，該模式是其他兩種的基礎。借鑑Spark開發模式，我們可以得到一種開發新型計算框架的一般思路：先設計出它的s

大資料：Spark Standalone 叢集排程（二）如何建立、分配Executors的資源

Standalone 的整體架構在Spark叢集中的3個角色Client, Master, Worker, 下面的圖是Client Submit 一個任務的流程圖：完整的流程：Driver 提交任務給Master, 由Master節點根據任務的引數對進行Worker

Spark Master資源排程--SparkContext向所有master註冊

Spark Master資源排程–SparkContext向所有master註冊

更多資源

Youtube視訊分享

Bilibili視訊分享

SparkContext啟動向master傳送訊息

master處理訊息RegisterApplication

相關推薦