在filebeat 6.0.0中如何將log輸出到不同到kafka topics

阿新 • • 發佈：2019-01-10

2017年11月，elastic釋出了最新的elastic stack 6.0.0版本，整個版本做了不少的改動，大家可以查閱官方的release note和break change。我們第一時間嚐鮮，將日誌分析系統升級到6.0.0。坑肯定是少不了了，這裡，說一下filebeat升級後但影響。

在filebeat 5.x的版本中，如果你想在一個filebeat agent上收集不同的log，然後publish到不同的kakfa topics，可以這樣做：

定義多個 prospector，為每個prospector設定不同的document_type
在kafka output中，使用%{[type]}，來獲取不同的document_type值，然後設定到topic中。（注意，請不要使用網上有些人說的topics，那是畫蛇添足）

如下例：

filebeat.prospectors:
    # App logs - prospector
    - input_type: log
      paths:
        - /myapp/logs/myapp.log
      exclude_lines: [".+? INFO[^*].+", ".+? DEBUG[^*].+"]
      exclude_files: [".gz$", ".tmp"]
      fields:
        api: myappapi
        environment: STG
      ignore_older: 24 
h
      document_type: applog_myappapi
      scan_frequency: 1s

      # Multine on Timestamp, YYYY-MM-DD
      # https://www.elastic.co/guide/en/beats/filebeat/master/multiline-examples.html 
      multiline:
        pattern: '^[0-9]{4}-[0-9]{2}-[0-9]{2}'
        negate: true
        match: after
        max_lines: 500 

        timeout: 5s

    # Server Stats - prospector
    - input_type: log
      paths:
        - /myapp/logs/serverstats.log

      # Exclude messages with log level
      exclude_lines: [".+? ERROR[^*].+", ".+? DEBUG[^*].+"]
      exclude_files: [".gz$", ".tmp"]
      fields:
        api: myappapi
        environment: STG
      ignore_older: 24h
      document_type: applog_myappapi_stats
      scan_frequency: 1s

    # ELB prospector
    -
      input_type: log
      paths:
        - /var/log/httpd/elasticbeanstalk-access_log
      document_type: elblog_myappapi
      fields:
        api: myappapi
        environment: STG
      exclude_lines: [".+? INFO[^*].+", ".+? DEBUG[^*].+"]
      exclude_files: [".gz$", ".tmp"]
      ignore_older: 24h

      # 0s, it is done as often as possible. Default: 10s
      scan_frequency: 1s
registry_file: /var/lib/filebeat/registry
############################# Output ##########################################
# Configure what outputs to use when sending the data collected by the beat.
# Multiple outputs may be used.
#----------------------------- Kafka output --------------------------------

output.kafka:
    # initial brokers for reading cluster metadata
    hosts: ["broker.1.ip.address:9092", "broker.2.ip.address:9092", "broker.3.ip.address:9092"]

    # message topic selection + partitioning

    topic: '%{[type]}'
    partition.round_robin:
      reachable_only: false

    required_acks: 1
    compression: gzip
    max_message_bytes: 1000000

但由於升級到6.0.0之後，document_type這個選項被deprecated了，也就沒法通過%{[type]}來訪問。因此，同樣的配置會造成filebeat無法獲得topic的配置，進而不能給kafka傳送訊息。而且還沒有error log，cpu使用率一直是100%…
更坑爹的是，6.0.0的文件沒有更新這一塊，仍然給出了下面的例子：

output.kafka:
  # initial brokers for reading cluster metadata
  hosts: ["kafka1:9092", "kafka2:9092", "kafka3:9092"]

  # message topic selection + partitioning
  topic: '%{[type]}'
  partition.round_robin:
    reachable_only: false

  required_acks: 1
  compression: gzip
  max_message_bytes: 1000000

正確的做法是使用fields。然後通過%{[]}獲取對應的值。例子：

filebeat.prospectors:

# Each - is a prospector. Most options can be set at the prospector level, so
# you can use different prospectors for various configurations.
# Below are the prospector specific configurations.

- type: log

  # Change to true to enable this prospector configuration.
  enabled: true

  # Paths that should be crawled and fetched. Glob based paths.
  paths:
    - /var/log/test1.log
    #- c:\programdata\elasticsearch\logs\*
  fields:
    log_topics: test1
- type: log
  enabled: true
  paths:
    - /var/log/test2.log
  fields:
    log_topics: test2
#----------------------------- kafka output --------------------------------
output.kafka:
# Boolean flag to enable or disable the output module.
  enabled: true

# The list of Kafka broker addresses from where to fetch the cluster metadata.
# The cluster metadata contain the actual Kafka brokers events are published
# to.
  hosts: [{{kafka_url}}]

# The Kafka topic used for produced events. The setting can be a format string
# using any event field. To set the topic from document type use `%{[type]}`.
  topic: '%{[fields][log_topics]}'

對應的，在logstash上，如果要分別解釋對應的topic：

input {
  kafka{
        bootstrap_servers => "{{kafka_url}}"
        topics => ["test1","test2"]
        codec => "json"
        consumer_threads => 2
        enable_auto_commit => true
        auto_commit_interval_ms => "1000"
        group_id=> test
  }
}
filter {
  if[fields][log_topics] == "test1" {
    grok {
      patterns_dir => ["./patterns"]
      match => {
        "message" => "%{PLATFORM_SYSLOG}"
      }
    }
  }
  if[fields][log_topics] == "test2" {
    grok {
      patterns_dir => ["./patterns"]
      match => {
        "message" => "%{IAM_SYSLOG}"
      }
    }
  }

在filebeat 6.0.0中如何將log輸出到不同到kafka topics

2017年11月，elastic釋出了最新的elastic stack 6.0.0版本，整個版本做了不少的改動，大家可以查閱官方的release note和break change。我們第一時間嚐鮮，將日誌分析系統升級到6.0.0。坑肯定是少不了了，這裡，

filebeat 6.0以後版本設置index名字

elk filebeat 坑坑坑，官網說明很不明顯 https://www.elastic.co/guide/en/beats/metricbeat/current/configuration-template.html 6.0以後版本具體設置在filebeat.yml如下 setup.templat

Filebeat 6.0 把日誌直接輸入到ES中如何自定義index

同時 emp elastic iba dex sts 報錯註意一個臨時搭建了一套EFK（elasticsearch，filebeat，kibana），filebeat 6.0 默認的index 是filebeat+時間，這樣無法滿足正常的業務需求，如果是收集一個地方的

JavaScript --使用prompt函式接收一個0-6之間的整數，輸出對應的星期幾，

<!DOCTYPE html> <html> <head lang="en"> <meta charset="UTF-8"> <title><

JS迴圈（分別在for迴圈/while迴圈/do-while迴圈中使用console.log()輸出“0~100”之間的“奇數”）

// for迴圈 for (var n = 0; n < 100; n++) { if (n % 2 == 1) { console.log(n); } } // while var i = 0; while (i < 100) { if (i % 2 == 1) { conso

MONyog_5.6.9.0 key激活|監控MYSQL

ron com src mysq 軟件推出 alt key 執行 SQLyog與MONyog是一家公司對mysql推出的商業化軟件，可能大家對SQLyog很熟悉，MONyog是對mysql-server服務的監控、腳本執行時長、安全性、等的監控！ k

筆記屬性權限用戶臨時權限（猿課精講1.6-2.0）

linux1.6 文件或目錄屬性信息ls -l 看目錄的詳細信息- 普通文件 d 目錄 s 進程間通信 c 字符設備 b 塊設備 l 軟連接（快捷方式） p管道文件所屬主所屬組其他人權限rwxls -li inode號ls -la 111 有兩個子目錄 ls -lh 根據文件大小更改單位1.7 chmo

jboss規則引擎KIE Drools 6.3.0 Final 教程(3)

easy add get 8.0 .get csdn 一個專家 try 在前2部教程中。介紹了怎樣在本地執行.drools文件以及使用stateless的方法訪問遠程repository上的規則。 KIE Drools還提供了一種叫有狀態-stateful的訪問方式。

CentOS 6.9升級gcc至6.4.0版本

gcc一、升級前測試：1、查看系統版本：# cat /etc/redhat-release2、查看默認的gcc版本：# gcc --version3、查看默認動態庫：# strings /usr/lib64/libstdc++.so.6 | grep GLIBC4、不支持c++11的新特性：嘗試寫一個

下列給定程序中函數fun的功能是：用下面的公式求π的近似值，直到最後一項的絕對值小於指定的數為止，π/4=1-1/3+1/5-1/7+...，例如，程序運行後，輸入0.0001，程序輸出3.1414

print fab stdio.h 運行 return printf main blog 程序 #include <math.h> #include <stdio.h> float fun ( float num ) { int s

輸入一組整數,0結束輸入,之後輸出輸入的最大的和最小的整數.【思路】

cnblogs amp println system ack rgs int 輸入 != package com.ykmimi.new1; /** * 輸入一組整數,0結束輸入,之後輸出輸入的最大的和最小的整數. */ import java.util.Scanner

2.6.1PyCharm3.0默認快捷鍵

斷點 left 層次編輯器主動取消書簽命名 char PyCharm3.0默認快捷鍵(翻譯的)PyCharm Default Keymap1、編輯（Editing）Ctrl + Space 基本的代碼完成（類、方法、屬性）Ctrl + Alt + Space

centos6.9安裝confluence 6.5.0

confluence 6.5.0 confluence 6.5.0安裝公司準備實行敏捷開發，經過一番工具選擇，最終選定了jira和confluence，jira用作項目管理，confluence用於分享管理。此文介紹confluence 6.5.0（其他版本安裝方法一樣）安裝配置：一、環境準備（如果

RHEL 6.4 安裝配置Nessus 7.0.0及操作手冊

升級 ref user 過程 des 錯誤 tar 離線更新 define 安裝環境：RHEL 6.4 1.下載Nessus安裝包下載地址：http://www.tenable.com/products/nessus/select-your-operating-syste

Elastic Stack5.2.2升級到6.0.0註意事項

process out ict oca tor eas isa 調整 ash 最近把Elastic Stack從5.2.2版本升級到6.0.0版本，性能確實有所提高，文檔記錄了升級過程中需要註意的一些問題。架構圖一、Filebeat 6.0版本filebeat p

ambari 2.6.0.0開發環境配置

環境配置 pom span url c-c++ pack max adl clean ambari 2.6.0.0開發環境配置安裝git安裝依賴 yum -y install curl-devel expat-devel gettext-devel openssl-deve

elasticsearch 6.0.0及之後移除了一個索引允許映射多個類型的操作（Removal of mapping types）

user 版本解決 ase asc adding course 新的 blog 用到了6.2，還以為像5.X 一樣允許建立父-子關系文檔，即一個索引下允許映射多個類型，操作後發現行不通如下代碼： PUT /company { "mappings": {

python3.6+django2.0 一小時學會開發一套學員管理系統demo

lean pycharm 成了 ... ati etl $.ajax size static 1.在pycharm中新建project demo1 添加app01 點擊create按鈕完成新建 2.在demo項目目錄下新建目錄static，並在settings.py中追加代

Centos7 安裝Python3.6.5 及安裝ipython 6.1.0

Python3.6.5安裝 Ipython6.1.0安裝一、centos7 安裝 Python3.6.5教程1、在安裝Python之前，需要先安裝一些後面遇到的依賴問題（如果有依賴問題，按照提示安裝）： yum -y install zlib-devel bzip2-devel openssl

VMware vCenter Server 6.5.0 U1

nload span 官網 http available document bundle HA article VMware vCenter Server 6.5.0 U1gName: VMware-VCSA-all-6.5.0-8024368.iso Release

在filebeat 6.0.0中如何將log輸出到不同到kafka topics

相關推薦