debezium 監聽 MySQL ,並用flink消費初體驗

阿新 • • 發佈：2020-11-12

環境準備

MySQL（開啟binlog）
Kafka(使用內嵌式debezium則不需要)
debezium聯結器

官網參考 https://debezium.io/documentation/reference/1.3/tutorial.html

在 Kafka 環境下安裝 debezium 聯結器

把從官網下載的mysql 聯結器上傳到Kafka 伺服器上並解壓，我的解壓路徑為 /opt/kafka/plugins/debezium-connector-mysql

然後在 /opt/kafka/config/connect-distribute.properties 中編輯

　note: 因為是分散式環境，所以配置connect-distribute.properties

##
# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements. See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License
"); you may not use this file except in compliance with
# the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License
for the specific language governing permissions and
# limitations under the License.
##

# This file contains some of the configurations for the Kafka Connect distributed worker. This file is intended
# to be used with the examples, and some settings may differ from those used in a production system, especially
# the `bootstrap.servers` and those specifying replication factors.

# A list of host/port pairs to use for establishing the initial connection to the Kafka cluster.
bootstrap.servers=kafka1:9092,kafka2:9093,kafka3:9094

# unique name for the cluster, used in forming the Connect cluster group. Note that this must not conflict with consumer group IDs
group.id=connect-cluster

# The converters specify the format of data in Kafka and how to translate it into Connect data. Every Connect user will
# need to configure these based on the format they want their data in when loaded from or stored into Kafka
key.converter=org.apache.kafka.connect.json.JsonConverter
value.converter=org.apache.kafka.connect.json.JsonConverter
# Converter-specific settings can be passed in by prefixing the Converter's setting with the converter we want to apply
# it to
key.converter.schemas.enable=true
# 這個配置開啟之後會附帶schema 資訊
value.converter.schemas.enable=true

# Topic to use for storing offsets. This topic should have many partitions and be replicated and compacted.
# Kafka Connect will attempt to create the topic automatically when needed, but you can always manually create
# the topic before starting Kafka Connect if a specific topic configuration is needed.
# Most users will want to use the built-in default replication factor of 3 or in some cases even specify a larger value.
# Since this means there must be at least as many brokers as the maximum replication factor used, we'd like to be able
# to run this example on a single-broker cluster and so here we instead set the replication factor to 1.
offset.storage.topic=connect-offsets
offset.storage.replication.factor=1
#offset.storage.partitions=25

# Topic to use for storing connector and task configurations; note that this should be a single partition, highly replicated,
# and compacted topic. Kafka Connect will attempt to create the topic automatically when needed, but you can always manually create
# the topic before starting Kafka Connect if a specific topic configuration is needed.
# Most users will want to use the built-in default replication factor of 3 or in some cases even specify a larger value.
# Since this means there must be at least as many brokers as the maximum replication factor used, we'd like to be able
# to run this example on a single-broker cluster and so here we instead set the replication factor to 1.
config.storage.topic=connect-configs
config.storage.replication.factor=1

# Topic to use for storing statuses. This topic can have multiple partitions and should be replicated and compacted.
# Kafka Connect will attempt to create the topic automatically when needed, but you can always manually create
# the topic before starting Kafka Connect if a specific topic configuration is needed.
# Most users will want to use the built-in default replication factor of 3 or in some cases even specify a larger value.
# Since this means there must be at least as many brokers as the maximum replication factor used, we'd like to be able
# to run this example on a single-broker cluster and so here we instead set the replication factor to 1.
status.storage.topic=connect-status
status.storage.replication.factor=1
#status.storage.partitions=5

# Flush much faster than normal, which is useful for testing/debugging
offset.flush.interval.ms=10000

# These are provided to inform the user about the presence of the REST host and port configs
# Hostname & Port for the REST API to listen on. If this is set, it will bind to the interface used to listen to requests.
#rest.host.name=
#rest.port=8083

# The Hostname & Port that will be given out to other workers to connect to i.e. URLs that are routable from other servers.
#rest.advertised.host.name=
#rest.advertised.port=

# Set to a list of filesystem paths separated by commas (,) to enable class loading isolation for plugins
# (connectors, converters, transformations). The list should consist of top level directories that include
# any combination of:
# a) directories immediately containing jars with plugins and their dependencies
# b) uber-jars with plugins and their dependencies
# c) directories immediately containing the package directory structure of classes of plugins and their dependencies
# Examples:
# plugin.path=/usr/local/share/java,/usr/local/share/kafka/plugins,/opt/connectors,
plugin.path=/opt/kafka/plugins

重點配置 plugin.path ，同時也要注意，路徑為聯結器解壓路徑的父級目錄

開啟Kafka connect

just a shell

bin/connect-distributed.sh config/connect-distributed.properties

Kafka connect 的具體使用方式得去官網看，但總體來說就是通過傳送post 請求來搞的，啟動後先測試下是否啟動成功

curl -H "Accept:application/json" kafka1:8083/

註冊Mysql 監聽器

這裡直接從官網抄個demo下來，改改引數

curl -i -X POST -H "Accept:application/json" -H "Content-Type:application/json" kafka1:8083/connectors/ -d '{ "name": "connector_demo", "config": { "connector.class": "io.debezium.connector.mysql.MySqlConnector", "tasks.max": "1", "database.hostname": "host.docker.internal", "database.port": "3306", "database.user": "root", "database.password": "123", "database.server.id": "184054", "database.server.name": "dbserver1", "database.include.list": "sensor_offset", "database.history.kafka.bootstrap.servers": "kafka1:9092", "database.history.kafka.topic": "dbhistory.sensor_offset" } }'

具體配置參考官網：https://debezium.io/documentation/reference/1.3/connectors/mysql.html#configure-the-mysql-connector_debezium

然後你就可以看到你的Kafka裡多了幾個topic

配置flink-connecter

老規矩，抄官網：https://ci.apache.org/projects/flink/flink-docs-release-1.11/zh/dev/table/connectors/formats/debezium.html

利用table api 直接來操作

@Test
    public void testDebezium() throws Exception {
        tableEnvironment.getConfig().setSqlDialect(SqlDialect.DEFAULT);
        tableEnvironment.executeSql("CREATE TABLE offset_manager (\n" +
                "  groupid STRING,\n" +
                "  topic STRING,\n" +
                "  `partition` int,\n" +
                "  untiloffset int\n" +
                ") WITH (\n" +
                " 'connector' = 'kafka',\n" +
                " 'topic' = 'dbserver1.sensor_offset.offset_manager',\n" +
                " 'properties.bootstrap.servers' = 'kafka1:9092',\n" +
                " 'properties.group.id' = 'testGroup',\n" +
                " 'format' = 'debezium-json'\n" +
                ")");

        Table offset_manager = tableEnvironment.from("offset_manager");
        tableEnvironment.toRetractStream(offset_manager, Row.class).print();
        env.execute();
    }

debezium 監聽 MySQL ,並用flink消費初體驗

環境準備

在 Kafka 環境下安裝 debezium 聯結器

開啟Kafka connect

註冊Mysql 監聽器

配置flink-connecter

debezium 監聽 MySQL ,並用flink消費初體驗

監聽mysql表內容變化 mysql開啟binlog

# C#基於阿里canal監聽mysql binlog kafka模式

詳解監聽MySQL的binlog日誌工具分析：Canal

Canal監聽mysql

Mysql資料庫監聽binlog的開啟步驟

Springboot整合Kafka-控制或關閉消費、動態開啟或關閉監聽

Scala實現Flink消費kafka資料並用連線流過濾後存入PostgreSQL資料庫

MySQL 如何處理監聽連線的

自己手動寫一個基於LinkedList的訊息佇列（監聽機制&&實時消費）

[Spring cloud 一步步實現廣告系統] 15. 使用開源元件監聽Binlog 實現增量索引準備

Android監聽鍵盤狀態獲取鍵盤高度的實現方法

Android scrollview如何監聽滑動狀態

SpringBoot啟動應用及回撥監聽原理解析

Java基於ServletContextListener實現UDP監聽

flutter 中監聽滑動事件

SpringMVC事件監聽ApplicationListener例項解析

處理Oracle 監聽檔案listener.log問題

Win Oracle 監聽檔案配置參考程式碼例項

Redis叢集下過期key監聽的實現程式碼

debezium 監聽 MySQL ,並用flink消費初體驗

環境準備

在 Kafka 環境下安裝 debezium 聯結器

開啟Kafka connect

註冊Mysql 監聽器

配置flink-connecter

相關推薦