Kafka原始碼深度解析－序列5 －Producer －RecordAccumulator佇列分析

阿新 • • 發佈：2019-02-12

在Kafka原始碼分析－序列2中，我們提到了整個Producer client的架構圖，如下所示：

這裡寫圖片描述

其它幾個元件我們在前面都講過了，今天講述最後一個元件RecordAccumulator.

Batch傳送

在以前的kafka client中，每條訊息稱為 “Message”，而在Java版client中，稱之為”Record”，同時又因為有批量傳送累積功能，所以稱之為RecordAccumulator.

RecordAccumulator最大的一個特性就是batch訊息，扔到佇列中的多個訊息，可能組成一個RecordBatch，然後由Sender一次性發送出去。

每個TopicPartition一個佇列

下面是RecordAccumulator的內部結構，可以看到，每個TopicPartition對應一個訊息佇列，只有同一個TopicPartition的訊息，才可能被batch。

public final class RecordAccumulator {
    private final ConcurrentMap<TopicPartition, Deque<RecordBatch>> batches;

   ...
}

batch的策略

那什麼時候，訊息會被batch，什麼時候不會呢？下面從KafkaProducer的send方法看起：

//KafkaProducer
    public Future<RecordMetadata> send(ProducerRecord<K, V> record, Callback callback) {
        try 
 {
            // first make sure the metadata for the topic is available
            long waitedOnMetadataMs = waitOnMetadata(record.topic(), this.maxBlockTimeMs);

            ...

            RecordAccumulator.RecordAppendResult result = accumulator.append(tp, serializedKey, serializedValue, callback, remainingWaitMs);   //核心函式：把訊息放入佇列

            if 
 (result.batchIsFull || result.newBatchCreated) {
                log.trace("Waking up the sender since topic {} partition {} is either full or getting a new batch", record.topic(), partition);
                this.sender.wakeup();
            }
            return result.future;

從上面程式碼可以看到，batch邏輯，都在accumulator.append函式裡面：

    public RecordAppendResult append(TopicPartition tp, byte[] key, byte[] value, Callback callback, long maxTimeToBlock) throws InterruptedException {
        appendsInProgress.incrementAndGet();
        try {
            if (closed)
                throw new IllegalStateException("Cannot send after the producer is closed.");
            Deque<RecordBatch> dq = dequeFor(tp);  //找到該topicPartiton對應的訊息佇列
            synchronized (dq) {
                RecordBatch last = dq.peekLast(); //拿出佇列的最後1個元素
                if (last != null) {  
                    FutureRecordMetadata future = last.tryAppend(key, value, callback, time.milliseconds()); //最後一個元素, 即RecordBatch不為空，把該Record加入該RecordBatch
                    if (future != null)
                        return new RecordAppendResult(future, dq.size() > 1 || last.records.isFull(), false);
                }
            }

            int size = Math.max(this.batchSize, Records.LOG_OVERHEAD + Record.recordSize(key, value));
            log.trace("Allocating a new {} byte message buffer for topic {} partition {}", size, tp.topic(), tp.partition());
            ByteBuffer buffer = free.allocate(size, maxTimeToBlock);
            synchronized (dq) {
                // Need to check if producer is closed again after grabbing the dequeue lock.
                if (closed)
                    throw new IllegalStateException("Cannot send after the producer is closed.");
                RecordBatch last = dq.peekLast();
                if (last != null) {
                    FutureRecordMetadata future = last.tryAppend(key, value, callback, time.milliseconds());
                    if (future != null) {
                        // Somebody else found us a batch, return the one we waited for! Hopefully this doesn't happen often...
                        free.deallocate(buffer);
                        return new RecordAppendResult(future, dq.size() > 1 || last.records.isFull(), false);
                    }
                }

                //佇列裡面沒有RecordBatch，建一個新的，然後把Record放進去
                MemoryRecords records = MemoryRecords.emptyRecords(buffer, compression, this.batchSize);
                RecordBatch batch = new RecordBatch(tp, records, time.milliseconds());
                FutureRecordMetadata future = Utils.notNull(batch.tryAppend(key, value, callback, time.milliseconds()));

                dq.addLast(batch);
                incomplete.add(batch);
                return new RecordAppendResult(future, dq.size() > 1 || batch.records.isFull(), true);
            }
        } finally {
            appendsInProgress.decrementAndGet();
        }
    }

    private Deque<RecordBatch> dequeFor(TopicPartition tp) {
        Deque<RecordBatch> d = this.batches.get(tp);
        if (d != null)
            return d;
        d = new ArrayDeque<>();
        Deque<RecordBatch> previous = this.batches.putIfAbsent(tp, d);
        if (previous == null)
            return d;
        else
            return previous;
    }

從上面程式碼我們可以看出Batch的策略：
1。如果是同步傳送，每次去佇列取，RecordBatch都會為空。這個時候，訊息就不會batch，一個Record形成一個RecordBatch

2。Producer 入隊速率 < Sender出隊速率 && lingerMs = 0 ，訊息也不會被batch

3。Producer 入隊速率 > Sender出對速率，訊息會被batch

4。lingerMs > 0，這個時候Sender會等待，直到lingerMs > 0 或者佇列滿了，或者超過了一個RecordBatch的最大值，就會發送。這個邏輯在RecordAccumulator的ready函式裡面。

    public ReadyCheckResult ready(Cluster cluster, long nowMs) {
        Set<Node> readyNodes = new HashSet<Node>();
        long nextReadyCheckDelayMs = Long.MAX_VALUE;
        boolean unknownLeadersExist = false;

        boolean exhausted = this.free.queued() > 0;
        for (Map.Entry<TopicPartition, Deque<RecordBatch>> entry : this.batches.entrySet()) {
            TopicPartition part = entry.getKey();
            Deque<RecordBatch> deque = entry.getValue();

            Node leader = cluster.leaderFor(part);
            if (leader == null) {
                unknownLeadersExist = true;
            } else if (!readyNodes.contains(leader)) {
                synchronized (deque) {
                    RecordBatch batch = deque.peekFirst();
                    if (batch != null) {
                        boolean backingOff = batch.attempts > 0 && batch.lastAttemptMs + retryBackoffMs > nowMs;
                        long waitedTimeMs = nowMs - batch.lastAttemptMs;
                        long timeToWaitMs = backingOff ? retryBackoffMs : lingerMs;
                        long timeLeftMs = Math.max(timeToWaitMs - waitedTimeMs, 0);
                        boolean full = deque.size() > 1 || batch.records.isFull();
                        boolean expired = waitedTimeMs >= timeToWaitMs;
                        boolean sendable = full || expired || exhausted || closed || flushInProgress();  //關鍵的一句話
                        if (sendable && !backingOff) {
                            readyNodes.add(leader);
                        } else {

                            nextReadyCheckDelayMs = Math.min(timeLeftMs, nextReadyCheckDelayMs);
                        }
                    }
                }
            }
        }

        return new ReadyCheckResult(readyNodes, nextReadyCheckDelayMs, unknownLeadersExist);
    }

為什麼是Deque？

在上面我們看到，訊息佇列用的是一個“雙端佇列“，而不是普通的佇列。
一端生產，一端消費，用一個普通的佇列不就可以嗎，為什麼要“雙端“呢？

這其實是為了處理“傳送失敗，重試“的問題：當訊息傳送失敗，要重發的時候，需要把訊息優先放入佇列頭部重新發送，這就需要用到雙端佇列，在頭部，而不是尾部加入。

當然，即使如此，該訊息發出去的順序，還是和Producer放進去的順序不一致了。

Kafka原始碼深度解析－序列5 －Producer －RecordAccumulator佇列分析

在Kafka原始碼分析－序列2中，我們提到了整個Producer client的架構圖，如下所示：其它幾個元件我們在前面都講過了，今天講述最後一個元件RecordAccumulator. Batch傳送在以前的kafka client中，每條訊

Kafka原始碼深度解析－序列4 －Producer －network層核心原理

在上一篇我們分析了Java NIO的原理和使用方式，本篇將進一步分析Kafka client是如何基於NIO構建自己的network層。 network層的分層架構下圖展示了從最上層的KafkaProducer到最底層的Java NIO的構建層次關係：

Kafka原始碼深度解析－序列3 －Producer －Java NIO

在上一篇我們分析了Metadata的更新機制，其中涉及到一個問題，就是Sender如何跟伺服器通訊，也就是網路層。同很多Java專案一樣，Kafka client的網路層也是用的Java NIO，然後在上面做了一層封裝。下面首先看一下，在Sender和伺服器

Kafka原始碼深度解析－序列9 －Consumer －SubscriptionState內部結構分析

在前面講了，KafkaConsumer的一個重要部件就是SubscriptionState，這個部件維護了Consumer的消費狀態，本篇對其內部結構進行分析。 2種訂閱策略在第1篇講過，consumer可以自己指定要消費哪個partition，而不是

Kafka原始碼深度解析－系列1 －訊息佇列的策略與語義

-Kafka關鍵概念介紹 -訊息佇列的各種策略與語義作為一個訊息佇列，Kafka在業界已經相當有名。相對傳統的RabbitMq/ActiveMq，Kafka天生就是分散式的，支援資料的分片、複製以及叢集的方便擴充套件。與此同時，Kafka是高可靠的、持

SnapHelper原始碼深度解析

目錄介紹 01.SnapHelper簡單介紹 1.1 SnapHelper作用 1.2 SnapHelper類分析 1.3 LinearSnapHelper類分析 1.4 PagerSnapHelper類分析 02.SnapHelper原始碼分析

FeignClient原始碼深度解析

微信公眾號：吉姆餐廳ak 學習更多原始碼知識，歡迎關注。全文共16984字左右。概述 springCloud feign主要對netflix feign進行了增強和包裝，本篇從原始碼角度帶你過一遍裝配流程，揭開feign底層的神祕面紗。主要包括feign整合r

《Spring原始碼深度解析》讀後感

大概三週看完《Spring原始碼深度解析》寫下一篇讀後感玩首先高度概括：內容過於豐富重點不突出本書共分8個模組 1、XML解析部分非常全面，各種配置方法，解析步驟都有介紹，這裡其實就是些巢狀的呼叫，Spring原始碼肯定比自己寫的優美。

原始碼系列Spring，Mybatis，Springboot，Netty原始碼深度解析-Spring的整體架構與容器的基本實現-mybatis原始碼深度解析與最佳實踐

6套原始碼系列Spring，Mybatis，Springboot，Netty原始碼深度解析視訊課程 6套原始碼套餐課程介紹： 1、6套精品是掌櫃最近整理出的最新課程，都是當下最火的技術，最火的課程，也是全網課程的精品； 2、6套資源包含：全套完整

《Spring原始碼深度解析》學習筆記

《Spring原始碼深度解析》學習筆記——Spring的整體架構與容器的基本實現 spring框架是一個分層架構，它包含一系列的功能要素，並被分為大約20個模組，如下圖所示這些模組被總結為以下幾個部分： Core Container Core Container

netty原始碼解解析(4.0)-5 執行緒模型-EventExecutorGroup框架

上一章講了EventExecutorGroup的整體結構和原理，這一章我們來探究一下它的具體實現。 EventExecutorGroup和EventExecutor介面 io.netty.util.concurrent.EventExecutorGroup j

Mybatis攔截器原始碼深度解析

目錄：一. 建立攔截器鏈 1. 建立物件 2. 建立配置檔案 3. 載入攔截器鏈二. 方法呼叫解析 1. 對請求物件進行攔截器包裝 2. 執行呼叫三. 小結 Mybatis攔截器可以幫助我們在執行sql語句過程中增加外掛以實現一些通用的邏輯，比

Spring原始碼深度解析-1、Spring核心類簡單介紹

在更新JAVA基礎原始碼學習的同時，也有必要把Spring抓一抓，以前對於spring的程度僅在於使用，以及一點IOC/AOP的概念，具體深層的瞭解不是很深入，每次看了一點原始碼就看不下去，然後一轉眼都忘記看了啥。所以這次專門買了書，來細細品味下Spring。希望能從這一波學習中加強自己

Mybatis原始碼深度解析

前言： mybatis是我們常用的一種操作資料庫的框架。我們在使用的mybatis有多種方式：原生mybatis、與Spring結合使用的mybatis、與SprinBoot結合使用的mybatis。使

Spring原始碼深度解析，事務案例講解高階

Spring的整體架構Spring框架是一個分層架構，它包含一系列的功能要素，並被分為大約20個模組，如下圖所示這些模組被總結為以下幾個部分： Core Container Core Container(核心容器)包含有Core、Beans、Context和Expression Lan

Springboot原始碼深度解析，方法解析，類載入解析，容器建立

springboot的啟動都是從main方法開始的，如下：@SpringBootApplicationpublic class Application { public static void main(String[] args) { SpringApplication.run(Application.cl

spring原始碼深度解析筆記（三）

之前提到在xmlBeanFactory建構函式中呼叫了XmlBeanDefinitionReader型別的reader屬性提供的方法this.reader.loadBeanDefinitions(resource),這就是載入整個資源載入的切入點。當進入XmlBeanDe

spring原始碼深度解析筆記（四）

DTD與XSD的區別 DTD（Document Type Definition）即文件型別定義，是一種XML約束模式語言，是XML檔案的驗證機制，是屬於XML檔案組成的一部分。DTD是一種保證XML文件格式正確的有效方法，可以通過比較XML文件和DTD檔案來看

python3網路爬蟲-破解天眼查+企業工商資料-分散式爬蟲系統-原始碼深度解析

Python爬蟲-2018年-我破解天眼查和啟信寶企業資料爬蟲--破解反爬技術那些事情最近在自己用python3+mongdb寫了一套分散式多執行緒的天眼查爬蟲系統，實現了對天眼查整個網站的全部資料各種維度的採集和儲存，主要是為了深入學習爬蟲技術使用，並且根據天眼查網頁的

RecyclerView用法和原始碼深度解析

目錄介紹 1.RecycleView的結構 2.Adapter 2.1 RecyclerView.Adapter扮演的角色 2.2 重寫的方法 2.3 notifyDataSetChanged()重新整理資料 2.4 資料變更通知之觀察者模式 a.首先看.

Kafka原始碼深度解析－序列5 －Producer －RecordAccumulator佇列分析

Batch傳送

每個TopicPartition一個佇列

batch的策略

為什麼是Deque？

相關推薦