Kubernetes32--ReplicationController原始碼--RC數量控制

阿新 • • 發佈：2018-12-26

RC啟動，通過一系列配置，進入核心方法

func (rsc *ReplicaSetController) syncReplicaSet(key string) error {

獲取RS資訊

rs, err := rsc.rsLister.ReplicaSets(namespace).Get(name)

判斷RS是否需要同步

rsNeedsSync := rsc.expectations.SatisfiedExpectations(key)

獲取標籤選擇器

selector, err := metav1.LabelSelectorAsSelector(rs.Spec.Selector)

獲取RS所在名稱空間的所有Pod

allPods, err := rsc.podLister.Pods(rs.Namespace).List(labels.Everything())

過濾掉所有非活躍pod

var filteredPods []*v1.Pod
	for _, pod := range allPods {
		if controller.IsPodActive(pod) {
			filteredPods = append(filteredPods, pod)
		}
	}

根據標籤選擇器來決定收養或者釋放Pod

filteredPods, err = rsc.claimPods(rs, selector, filteredPods)

如果需要同步並且RS沒有被刪除，則執行調整叢集數量方法

if rsNeedsSync && rs.DeletionTimestamp == nil {
		manageReplicasErr = rsc.manageReplicas(filteredPods, rs)
	}

調整之後更新RS狀態

newStatus := calculateStatus(rs, filteredPods, manageReplicasErr)

	// Always updates status as pods come up or die.
	updatedRS, err := updateReplicaSetStatus(rsc.kubeClient.AppsV1().ReplicaSets(rs.Namespace), rs, newStatus)

延時一段時間重新入隊，再次加入到佇列中執行

if manageReplicasErr == nil && updatedRS.Spec.MinReadySeconds > 0 &&
		updatedRS.Status.ReadyReplicas == *(updatedRS.Spec.Replicas) &&
		updatedRS.Status.AvailableReplicas != *(updatedRS.Spec.Replicas) {
		rsc.enqueueReplicaSetAfter(updatedRS, time.Duration(updatedRS.Spec.MinReadySeconds)*time.Second)
	}

核心方法manageReplicas

// manageReplicas checks and updates replicas for the given ReplicaSet.
// Does NOT modify <filteredPods>.
// It will requeue the replica set in case of an error while creating/deleting pods.
func (rsc *ReplicaSetController) manageReplicas(filteredPods []*v1.Pod, rs *apps.ReplicaSet) error {

計算現在叢集Pod數量值與目標值的差值

diff := len(filteredPods) - int(*(rs.Spec.Replicas))

如果差值小於0，說明應該增加叢集數量

if diff < 0 {
		diff *= -1
		if diff > rsc.burstReplicas {
			diff = rsc.burstReplicas
		}

rsc.expectations.ExpectCreations(rsKey, diff)

慢開始批量處理函式，開始慢速直行，如果執行成功則加快執行

successfulCreations, err := slowStartBatch(diff, controller.SlowStartInitialBatchSize, func() error

// slowStartBatch tries to call the provided function a total of 'count' times,
// starting slow to check for errors, then speeding up if calls succeed.
//
// It groups the calls into batches, starting with a group of initialBatchSize.
// Within each batch, it may call the function multiple times concurrently.
//
// If a whole batch succeeds, the next batch may get exponentially larger.
// If there are any failures in a batch, all remaining batches are skipped
// after waiting for the current batch to complete.
//
// It returns the number of successful calls to the function.
func slowStartBatch(count int, initialBatchSize int, fn func() error) (int, error) {

慢開始分發函式實現如下：

func slowStartBatch(count int, initialBatchSize int, fn func() error) (int, error) {
	remaining := count
	successes := 0
	for batchSize := integer.IntMin(remaining, initialBatchSize); batchSize > 0; batchSize = integer.IntMin(2*batchSize, remaining) {
		errCh := make(chan error, batchSize)
		var wg sync.WaitGroup
		wg.Add(batchSize)
		for i := 0; i < batchSize; i++ {
			go func() {
				defer wg.Done()
				if err := fn(); err != nil {
					errCh <- err
				}
			}()
		}
		wg.Wait()
		curSuccesses := batchSize - len(errCh)
		successes += curSuccesses
		if len(errCh) > 0 {
			return successes, <-errCh
		}
		remaining -= batchSize
	}
	return successes, nil
}

batchSize每次迴圈待執行的次數，初始時為initialBatchSize，如果成功，以後加倍

for batchSize := integer.IntMin(remaining, initialBatchSize); batchSize > 0; batchSize = integer.IntMin(2*batchSize, remaining)

執行緒組同步策略

var wg sync.WaitGroup
		wg.Add(batchSize)
		for i := 0; i < batchSize; i++ {
			go func() {
				defer wg.Done()
				if err := fn(); err != nil {
					errCh <- err
				}
			}()
		}
		wg.Wait()

如果不能全部執行完，則等待下一個週期繼續同步

// Any skipped pods that we never attempted to start shouldn't be expected.
		// The skipped pods will be retried later. The next controller resync will
		// retry the slow start process.
		if skippedPods := diff - successfulCreations; skippedPods > 0 {
			klog.V(2).Infof("Slow-start failure. Skipping creation of %d pods, decrementing expectations for %v %v/%v", skippedPods, rsc.Kind, rs.Namespace, rs.Name)
			for i := 0; i < skippedPods; i++ {
				// Decrement the expected number of creates because the informer won't observe this pod
				rsc.expectations.CreationObserved(rsKey)
			}
		}

如果diff>0，執行刪除操作

else if diff > 0 {
		if diff > rsc.burstReplicas {
			diff = rsc.burstReplicas
		}

獲取待刪除的Pod列表，根據Pod狀態排序，刪除

func getPodsToDelete(filteredPods []*v1.Pod, diff int) []*v1.Pod {
	// No need to sort pods if we are about to delete all of them.
	// diff will always be <= len(filteredPods), so not need to handle > case.
	if diff < len(filteredPods) {
		// Sort the pods in the order such that not-ready < ready, unscheduled
		// < scheduled, and pending < running. This ensures that we delete pods
		// in the earlier stages whenever possible.
		sort.Sort(controller.ActivePods(filteredPods))
	}
	return filteredPods[:diff]
}

看一下Pod排序規則

func (s ActivePods) Less(i, j int) bool {
	// 1. Unassigned < assigned
	// If only one of the pods is unassigned, the unassigned one is smaller
	if s[i].Spec.NodeName != s[j].Spec.NodeName && (len(s[i].Spec.NodeName) == 0 || len(s[j].Spec.NodeName) == 0) {
		return len(s[i].Spec.NodeName) == 0
	}
	// 2. PodPending < PodUnknown < PodRunning
	m := map[v1.PodPhase]int{v1.PodPending: 0, v1.PodUnknown: 1, v1.PodRunning: 2}
	if m[s[i].Status.Phase] != m[s[j].Status.Phase] {
		return m[s[i].Status.Phase] < m[s[j].Status.Phase]
	}
	// 3. Not ready < ready
	// If only one of the pods is not ready, the not ready one is smaller
	if podutil.IsPodReady(s[i]) != podutil.IsPodReady(s[j]) {
		return !podutil.IsPodReady(s[i])
	}
	// TODO: take availability into account when we push minReadySeconds information from deployment into pods,
	//       see https://github.com/kubernetes/kubernetes/issues/22065
	// 4. Been ready for empty time < less time < more time
	// If both pods are ready, the latest ready one is smaller
	if podutil.IsPodReady(s[i]) && podutil.IsPodReady(s[j]) && !podReadyTime(s[i]).Equal(podReadyTime(s[j])) {
		return afterOrZero(podReadyTime(s[i]), podReadyTime(s[j]))
	}
	// 5. Pods with containers with higher restart counts < lower restart counts
	if maxContainerRestarts(s[i]) != maxContainerRestarts(s[j]) {
		return maxContainerRestarts(s[i]) > maxContainerRestarts(s[j])
	}
	// 6. Empty creation time pods < newer pods < older pods
	if !s[i].CreationTimestamp.Equal(&s[j].CreationTimestamp) {
		return afterOrZero(&s[i].CreationTimestamp, &s[j].CreationTimestamp)
	}
	return false
}

建立Pod方法

err := rsc.podControl.CreatePodsWithControllerRef(rs.Namespace, &rs.Spec.Template, rs, controllerRef)

// CreatePodsWithControllerRef creates new pods according to the spec, and sets object as the pod's controller.
	CreatePodsWithControllerRef(namespace string, template *v1.PodTemplateSpec, object runtime.Object, controllerRef *metav1.OwnerReference) error

刪除Pod方法

err := rsc.podControl.DeletePod(rs.Namespace, targetPod.Name, rs)

// DeletePod deletes the pod identified by podID.
	DeletePod(namespace string, podID string, object runtime.Object) error

計算並更新RS狀態

計算RS狀態

func calculateStatus(rs *apps.ReplicaSet, filteredPods []*v1.Pod, manageReplicasErr error) apps.ReplicaSetStatus {

計算所有標籤符合的副本數量

if templateLabel.Matches(labels.Set(pod.Labels)) {
			fullyLabeledReplicasCount++
		}

計算Podready的副本數量

if podutil.IsPodReady(pod) {
			readyReplicasCount++

// PodReady means the pod is able to service requests and should be added to the
	// load balancing pools of all matching services.
	PodReady PodConditionType = "Ready"

計算PodAvailable數量

if podutil.IsPodAvailable(pod, rs.Spec.MinReadySeconds, metav1.Now()) {
				availableReplicasCount++
			}

// IsPodAvailable returns true if a pod is available; false otherwise.
// Precondition for an available pod is that it must be ready. On top
// of that, there are two cases when a pod can be considered available:
// 1. minReadySeconds == 0, or
// 2. LastTransitionTime (is set) + minReadySeconds < current time
func IsPodAvailable(pod *v1.Pod, minReadySeconds int32, now metav1.Time) bool {
	if !IsPodReady(pod) {
		return false
	}

	c := GetPodReadyCondition(pod.Status)
	minReadySecondsDuration := time.Duration(minReadySeconds) * time.Second
	if minReadySeconds == 0 || !c.LastTransitionTime.IsZero() && c.LastTransitionTime.Add(minReadySecondsDuration).Before(now.Time) {
		return true
	}
	return false
}

更新RS狀態

// Always updates status as pods come up or die.
	updatedRS, err := updateReplicaSetStatus(rsc.kubeClient.AppsV1().ReplicaSets(rs.Namespace), rs, newStatus)

// updateReplicaSetStatus attempts to update the Status.Replicas of the given ReplicaSet, with a single GET/PUT retry.
func updateReplicaSetStatus(c appsclient.ReplicaSetInterface, rs *apps.ReplicaSet, newStatus apps.ReplicaSetStatus) (*apps.ReplicaSet, error) {

1.核心物件介面

// A TTLCache of pod creates/deletes each rc expects to see.
expectations *controller.UIDTrackingControllerExpectations

2.慢開始分發機制

3.Pod排序規則

4.Pod操作增加以及刪除

Kubernetes32--ReplicationController原始碼--RC數量控制

RC啟動，通過一系列配置，進入核心方法 func (rsc *ReplicaSetController) syncReplicaSet(key string) error { 獲取RS資訊 rs, err := rsc.rsLister.ReplicaSets(namespace).Ge

Kubernetes31--ReplicationController原始碼--RC執行過程

ReplicationController簡介 ReplicationController（簡稱RC）是確保使用者定義的Pod副本數保持不變。在使用者定義範圍內，如果pod增多，則ReplicationController會終止額外的pod，如果減少，RC會建立新的pod，始終保持在定義範圍。例

C#獲取顯示器屏幕數量控制winform顯示到哪一個屏幕

數量 orm all count() str 所在名稱 device cursor 獲取當前系統連接的屏幕數量： Screen.AllScreens.Count();獲取當前屏幕的名稱：string CurrentScreenName = Screen.FromContr

四、django rest_framework原始碼之頻率控制剖析

1. 緒言　　許可權判定之後的下一個環節是訪問頻率控制，本篇我們分析訪問頻率控制部分原始碼。 2. 原始碼分析訪問頻率控制在dispatch方法中的initial方法呼叫check_throttles方法開始。入口如下：

[原始碼]symantec遠端控制軟體PcAnywhere原始碼分享

前一陣傳聞有賽門鐵克的員工盜走很大一部分原始碼出去，索價5萬刀，這個人也太沒出息了，才索價5萬刀，不給力，後來就不知道怎麼解決的，反正網上是有了一部分原始碼在飄。在某個論壇瞎逛的時候，發現了賽門鐵克的招牌軟體pcanywhere的原始碼，這個可是一款企業級應用的遠端管理軟體，非常強大，竟然有人發了原始碼，先謝

android 4.4 原始碼新增動態控制導航欄

1.上滑顯示導航欄，下滑隱藏導航欄。修改以下類程式碼修改： base/core/java/com/android/internal/statusbar/IStatusBar.aidl 修改： base/core/java/com/androi

Kubernetes34--ReplicationController原始碼--slowStartBatch

在RC數量控制過程中，需要建立一部分Pod物件，呼叫slowStartBatch方法 successfulCreations, err := slowStartBatch(diff, controller.SlowStartInitialBatchSize, func() error {

Qt 原始碼剖析之控制元件繪製

Qt 原始碼剖析之控制元件繪製這裡使用QPushButton為例，講解一下具體QPushButton是怎麼繪製的首先看一段程式碼，這段程式碼是QPushButton中的繪製事件函式，可以看出來是使用QStylePainter來繪製， QStylePainter包裝了所有高

介紹一個基於QT的原始碼編輯器控制元件QScintilla

轉： pc的部落格什麼是QScintilla? QScintilla is a port to Qt of Neil Hodgson’s Scintilla C++ editor control. QScintilla是Scintilla在QT上的移植，Scintil

viewpager佈局複用中FragmentPagerAdapter的坑，原始碼分析，控制元件id的一些思考

一個fragment的佈局複用，裡面是tablayout+viewpager，viewpager載入不同adapter，adapter繼承FragmentPageAdapter。執行後有問題，先初始化的fragment正常顯示，後加載的fragment裡的vie

Hadoop Job 中 Map 與 Reduce 數量控制

在Hadoop 中提交的job 時常需要對其執行時的map task 和reduce task數量進行控制，reduce的數量可以通過setNumReduceTasks() 函式簡單設定，但map task 數量並不簡單由 setNumMapTasks() 控制

Tomcat原始碼閱讀之閉鎖的實現與連線數量的控制

嗯，今天其實在看HtttpProcessor的實現，但是突然想到了以前在看poller的時候看到了有閉鎖，用於控制當前connector的連線數量，嗯，那就順便把這部分來看了。。。在Tomcat中，通過繼承AbstractQueuedSynchronizer來實現了自己的

不用線程池，使用Semaphore信號量同樣也可以控制Thread多線程的並行數量。

for release map new dst sta 信號量 code tar static Semaphore sem = new Semaphore(100, 100); for (int i = 0; i <1000; i++)

控制程序的啟動數量（限制遊戲多開）

strong err mod ucc pan 會有 sys ready 命令行引言：在PC端使用軟件的過程中。有時開發人員會有類似限制程序啟動數量的需求，如限制某程序在單一PC端的啟動數量。或是為了統計PC端啟動的程序數量等，顯然須要一種“計數器

控制並發數量

sta star blog lee 線程 say col 控制 log my @promises; for 0..13 { push @promises, start {say $_;sleep 1;} if @promises == 4 {

C#:多進程開發，控制進程數量

使用 star 其他 nvi 都是 ont tar obj proc 正在c#程序優化時，如果多線程效果不佳的情況下，也會使用多進程的方案，如下： System.Threading.Tasks.Task task=System.Threading.Tasks.Task

精準控制PWM脈沖的頻率和數量

pen rcc 一個 emp set oid reload rip sub 　　在一些項目中，我們經常要控制PWM脈沖的頻率和數量，比如步進電機的控制等，下面分享一個程序是關於這方面的，程序的思想就是通過STM32的定時器來輸出PWM波，並開啟定時器中斷，在中斷裏面計數脈沖

excel導入sqlserver數據庫大數據量，可每秒控制數量

content 列名 rip containe creat use 提示導入 null 數據庫代碼 USE [Test] GO /****** Object: Table [dbo].[Table_1] Script Date: 11/07

hive優化，控制map、reduce數量

行合並答案只有一個 mapred hdfs yarn str 浪費邏輯一、調整hive作業中的map數 1.通常情況下，作業會通過input的目錄產生一個或者多個map任務。主要的決定因素有： input的文件總個數，input的文件大小，集群設置的文件塊大小(目前

控制nginx並發鏈接數量和客戶端請求nginx的速率

區域自帶 available 位置 remote root clas php 客戶一、控制nginx並發鏈接數 ngx_http_limit_conn_module這個模塊用於限制每個定義的key值的鏈接數，特別是單IP的鏈接數。不是所有的鏈接數都會被計數，一個符合計

Kubernetes32--ReplicationController原始碼--RC數量控制

核心方法manageReplicas

計算並更新RS狀態

相關推薦