使用最新版flink tidb cdc 同步資料到StarRocks
阿新 • • 發佈:2022-04-19
Flink CDC 2.2 版本新增了 OceanBase CE,PolarDB-X,SqlServer,TiDB 四種資料來源接入。其中新增 OceanBase CDC,SqlServer CDC,TiDB CDC 三個聯結器,而 PolarDB-X 的支援則是通過對 MySQL CDC 聯結器進行相容適配實現。
釋出原文連結:https://blog.csdn.net/weixin_44904816/article/details/123836091
最近在測tidb原始的ticdc(PingCap提供)匯入資料到kafka,再使用flink StarRocks cdc匯入到StarRocks過程中碰到較多問題,剛好可以使用flink新發布的flink tidb cdc來對接,實現從tidb資料同步到StarRocks。
下載flink tidb cdc connector,放在flink lib下:
wget https://repo1.maven.org/maven2/com/ververica/flink-sql-connector-tidb-cdc/2.2.0/flink-sql-connector-tidb-cdc-2.2.0.jar
(客戶成功)sr@cs03:~/app/flink-1.13.3/lib$pwd
/home/disk1/sr/app/flink-1.13.3/lib
(客戶成功)sr@cs03:~/app/flink-1.13.3/lib$ll
total 353940
-rw-rw-r-- 1 sr sr 9404460 Mar 19 17:29 flink-connector-starrocks-1.2.1_flink-1.13_2.12.jar
-rw-r--r-- 1 sr sr 92313 Oct 13 2021 flink-csv-1.13.3.jar
-rw-r--r-- 1 sr sr 115418686 Oct 13 2021 flink-dist_2.11-1.13.3.jar
-rw-r--r-- 1 sr sr 19583 Nov 5 21:27 flink-format-changelog-json-1.4.0.jar
-rw-r--r-- 1 sr sr 148127 Oct 13 2021 flink-json-1.13.3.jar
-rw-rw-r-- 1 sr sr 41368997 Feb 12 2020 flink-shaded-hadoop-2-uber-2.7.5-10.0.jar.bak
-rwxrwxr-x 1 sr sr 7709740 Jun 8 2021 flink-shaded-zookeeper-3.4.14.jar
-rw-rw-r-- 1 sr sr 3674114 Oct 12 2021 flink-sql-connector-kafka_2.12-1.13.3.jar
-rw-rw-r-- 1 sr sr 19648014 Dec 16 23:36 flink-sql-connector-mysql-cdc-2.1.1.jar
-rw-rw-r-- 1 sr sr 85355380 Apr 19 17:03 flink-sql-connector-tidb-cdc-2.2.0.jar
-rw-r--r-- 1 sr sr 36453353 Oct 13 2021 flink-table_2.11-1.13.3.jar
-rw-r--r-- 1 sr sr 41061738 Oct 13 2021 flink-table-blink_2.11-1.13.3.jar
-rwxrwxr-x 1 sr sr 67114 Mar 31 2021 log4j-1.2-api-2.12.1.jar
-rwxrwxr-x 1 sr sr 276771 Mar 31 2021 log4j-api-2.12.1.jar
-rwxrwxr-x 1 sr sr 1674433 Mar 31 2021 log4j-core-2.12.1.jar
-rwxrwxr-x 1 sr sr 23518 Mar 31 2021 log4j-slf4j-impl-2.12.1.jar
(客戶成功)sr@cs03:~/app/flink-1.13.3/bin$./yarn-session.sh
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in [jar:file:/home/disk1/sr/app/flink-1.13.3/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]
......
......
......
2022-04-19 17:23:37,134 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - The configured TaskManager memory is 2728 MB. YARN will allocate 3072 MB to make up an integer multiple of its minimum allocation memory (1024 MB, configured via 'yarn.scheduler.minimum-allocation-mb'). The extra 344 MB may not be used by Flink.
2022-04-19 17:23:37,134 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - Cluster specification: ClusterSpecification{masterMemoryMB=2600, taskManagerMemoryMB=2728, slotsPerTaskManager=2}
2022-04-19 17:23:39,039 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - Submitting application master application_1641365150688_0057
2022-04-19 17:23:39,069 INFO org.apache.hadoop.yarn.client.api.impl.YarnClientImpl [] - Submitted application application_1641365150688_0057
2022-04-19 17:23:39,070 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - Waiting for the cluster to be allocated
2022-04-19 17:23:39,071 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - Deploying cluster, current state ACCEPTED
2022-04-19 17:23:44,097 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - YARN application has been deployed successfully.
2022-04-19 17:23:44,159 INFO org.apache.flink.yarn.YarnClusterDescriptor [] - Found Web Interface cs03:38081 of application 'application_1641365150688_0057'.
JobManager Web Interface: http://cs03:38081
1.tidb環境部署
- 下載tidb部署安裝包tidb-community-server-v4.0.9-linux-amd64.tar.gz,下載地址
- 安裝環境
- 使用tidb playground模式啟動tidb單節點測試環境