kafkaChannel實現一個source下，不同日誌採集到kafka不同主題中

阿新 • • 發佈：2018-11-12

1.需求

使用flume採集資料，在使用一個source情況下，將不同的日誌採集到指定的kafka的主題中。

例如：有兩個日誌檔案：error.log和info.log

error.log採集到kafka的kafka_channel主題

info.log採集到kafka的kafka_channel2主題

2.解決方案

我們使用tailDir source 和kafkaChannel

思路：

使用a0.sources.r1.headers.f1.headerKey = error，a0.sources.r1.headers.f2.headerKey = info。去設定event的一個header值，不同檔案設定不同的header值，用於區分，其中headerKey可以隨便設定，就是header中的一個key而已，

在原始碼中找到kafka-channel，在都doPut()方法中，去獲去每一個event的header，我們知道event的hader一個map。然後header.get(headerKey)獲取我們設定的頭標記，如果是error，kafka的主題設定為kafka_channel如果是info，則kafka的主題設定為kafka_channel2，也就是如下程式碼邏輯。

String type=headers.get("headerKey");
if(type.equals("info")){
  topicStr="kafka_channel2";
}else if(type.equals("error")){
  topicStr="kafka_channel";
}

原始碼更改

更改前：

 protected void doPut(Event event) throws InterruptedException {
      type = TransactionType.PUT;
      if (!producerRecords.isPresent()) {
        producerRecords = Optional.of(new LinkedList<ProducerRecord<String, byte[]>>());
      }
      String key = event.getHeaders().get(KEY_HEADER);
      //get header
      Map<String, String> headers = event.getHeaders();
      String  topicStr=null;
      Integer partitionId = null;
     
      try {
      if (staticPartitionId != null) {
          partitionId = staticPartitionId;
        }
        if (partitionHeader != null) {
          String headerVal = event.getHeaders().get(partitionHeader);
          if (headerVal != null) {
            partitionId = Integer.parseInt(headerVal);
          }
        }
       
        if (partitionId != null) {
          producerRecords.get().add(
 new ProducerRecord<String, byte[]>(topic.get(), partitionId, key,
                                       serializeValue(event, parseAsFlumeEvent)));
        } else {
          producerRecords.get().add(
            new ProducerRecord<String, byte[]>(topic.get(), key,
                                serializeValue(event, parseAsFlumeEvent)));
        }
      } catch (NumberFormatException e) {
        throw new ChannelException("Non integer partition id specified", e);
      } catch (Exception e) {
        throw new ChannelException("Error while serializing event", e);
      }
    }

更改後：

 protected void doPut(Event event) throws InterruptedException {
      type = TransactionType.PUT;
      if (!producerRecords.isPresent()) {
        producerRecords = Optional.of(new LinkedList<ProducerRecord<String, byte[]>>());
      }
      String key = event.getHeaders().get(KEY_HEADER);
      //get header
      Map<String, String> headers = event.getHeaders();
      String  topicStr=null;
      Integer partitionId = null;
      /**
       * 在這可以更改程式碼邏輯，實現：資料傳送到指定的kafka分割槽中
       */
      try {
      if (staticPartitionId != null) {
          partitionId = staticPartitionId;
        }
        if (partitionHeader != null) {
          String headerVal = event.getHeaders().get(partitionHeader);
          if (headerVal != null) {
            partitionId = Integer.parseInt(headerVal);
          }
        }
        /**
         *新增的邏輯
         */
        String type=headers.get("headerKey");
        if(type.equals("info")){
          topicStr="kafka_channel2";
        }else if(type.equals("error")){
          topicStr="kafka_channel";
        }
        if (partitionId != null) {
          producerRecords.get().add(
              new ProducerRecord<String, byte[]>(topicStr, partitionId, key,
                                                 serializeValue(event, parseAsFlumeEvent)));
        } else {
          producerRecords.get().add(
              new ProducerRecord<String, byte[]>(topicStr, key,
                                                 serializeValue(event, parseAsFlumeEvent)));
        }
      } catch (NumberFormatException e) {
        throw new ChannelException("Non integer partition id specified", e);
      } catch (Exception e) {
        throw new ChannelException("Error while serializing event", e);
      }
    }

採集方法

注意：

更改原始碼後，不需要在配置檔案中指定kafka的主題，當然指定主題也不錯，但是已經沒作用了，已經在程式碼中更改了。如果你有精力還可以把不同的kafka主題寫到properties配置檔案中，把程式寫活一點。在相同的思路下你還可以做到顆粒更細：就是指定主題和分割槽，通過條件判斷更改topic和partitionId。最後kafkaSink要想實現這些功能更改原始碼的思路是一樣的。

a0.sources = r1 
a0.channels = c1  

a0.sources.r1.type = TAILDIR
#通過 json 格式存下每個檔案消費的偏移量，避免從頭消費
a0.sources.r1.positionFile = /data/server/flume-1.8.0/conf/taildir_position.json
a0.sources.r1.filegroups = f1 f2
#配置f1資訊
a0.sources.r1.headers.f1.headerKey = error
a0.sources.r1.filegroups.f1 = /data/access/error.log
#配置f1資訊
a0.sources.r1.headers.f2.headerKey = info
a0.sources.r1.filegroups.f2 = /data/access/info.log
#是否新增一個儲存的絕對路徑名的標頭檔案
#a0.sources.r1.fileHeader = true

#攔截器獲取伺服器的主機名
a0.sources.r1.interceptors = i1 i2 i3
#a0.sources.r1.interceptors.i1.type = org.apache.flume.interceptor.HostInterceptor$Builder
a0.sources.r1.interceptors.i1.type = org.apache.flume.host.MyHostInterceptor$Builder
a0.sources.r1.interceptors.i1.preserveExisting = false
#a0.sources.r1.interceptors.i1.useIP = false
a0.sources.r1.interceptors.i1.HeaderName= agentHost
#靜態過濾器新增指定的標誌
a0.sources.r1.interceptors.i2.type = org.apache.flume.interceptor.StaticInterceptor$Builder
a0.sources.r1.interceptors.i2.key = logType
a0.sources.r1.interceptors.i2.value= kafka_data
a0.sources.r1.interceptors.i2.preserveExisting = false
#新增時間戳
a0.sources.r1.interceptors.i3.type = timestamp
#定義channel
a0.channels.c1.type = org.apache.flume.channel.kafka.KafkaChannel
a0.channels.c1.kafka.bootstrap.servers = 10.2.40.10:9092,10.2.40.14:9092,10.2.40.15:9092
a0.channels.c1.parseAsFlumeEvent = false
#a0.channels.c1.kafka.producer.compression.type = lz4
a0.sources.r1.channels = c1

kafkaChannel實現一個source下，不同日誌採集到kafka不同主題中

1.需求

2.解決方案

原始碼更改

採集方法

kafkaChannel實現一個source下，不同日誌採集到kafka不同主題中

三種方法實現一個函數，可以左旋字符串中的k個字符

實現一個函數，可以左旋字符串中的k個字符。

用shell實現一個小指令碼，用來同來統計自己某個檔案下的程式碼，總的程式碼行數，總的註釋量，總的空行量？支援遍歷查詢，支援軟連結查詢

[Unity3D 版本5.X]實現一個跟隨攝像機，聚焦到客戶端主角身上

請實現一個函數，將一個字符串中的空格替換成“%20”。例如，當字符串為We Are Happy.則經過替換之後的字符串為We%20Are%20Happy。

請實現一個裝飾器，限制該函數被調用的頻率，如10秒一次

算法：用兩個棧來實現一個隊列，完成隊列的Push和Pop操作。隊列中的元素為int類型。《劍指offer》

面試題9-用兩個棧來實現一個隊列，完成隊列的Push和Pop操作

利用切片操作，實現一個trim()函式，去除字串首尾的空格，注意不要呼叫str的strip()方法：# 測試: if trim('hello ') != 'hello': print('測試失敗!') elif trim(' hello'

Swift：我的第二個Demo（textField實現一個登入介面，沒有完成點選空白鍵盤）

請實現一個函數，將一個字符串中的每個空格替換成“%20”。例如，當字符串為We Are Happy.則經過替換之後的字符串為We%20Are%20Happy

利用切片操作，實現一個trim()函式，去除字串首尾的空格，注意不要呼叫str的strip()方法：

springboot程式logback日誌基本配置，多個包不同日誌級別輸入到檔案中

利用切片操作，實現一個trim()函式，去除字串首尾的空格

實現一個算法，尋找字符串中出現次數最少的、並且首次出現位置最前的字符如"cbaacfdeaebb"，符合要求的是"f"，因為他只出現了一次（次數最少）。並且比其他只出現一次的字符（如"d"）首次出現的位置最靠前。

JavaScript：用原生js實現重力條件下，可拖拽小球的碰撞運動

用 Log4Net 三步實現 .Net Core 類庫分日誌等級（不同檔案目錄）存日誌

windows下，Kiwi_Syslog日誌伺服器的搭建

使用java來把一個目錄下的所有檔案拷貝到另外一個目錄下，並且重新命名

kafkaChannel實現一個source下，不同日誌採集到kafka不同主題中

1.需求

2.解決方案

原始碼更改

採集方法

相關推薦