阿里雲構建Kafka單機叢集環境

阿新 • • 發佈：2019-01-25

簡介

在一臺ECS阿里雲伺服器上構建Kafa單個叢集環境需要如下的幾個步驟：

伺服器環境
JDK的安裝
ZooKeeper的安裝
Kafka的安裝

1. 伺服器環境

CPU： 1核
記憶體： 2048 MB (I/O優化) 1Mbps
作業系統 ubuntu14.04 64位
感覺伺服器效能還是很好的，當然不是給阿里打廣告，汗。
隨便向kafka裡面發了點資料，效能圖如下所示：

2. 安裝JDK

想要跑Java程式，就必須安裝JDK。JDK版本，本人用的是JDK1.7。
基本操作如下：

從JDK官網獲取JDK的tar.gz包；
將tar包上傳到伺服器上的opt/JDK下面；

解壓tar包；
更改etc/profile檔案，將下列資訊寫在後面；(ps mac環境需要sudo su 以root許可權進行操作)

 cd /
 cd etc
 vim profile
 然後進行修改 新增如下部分：
 export JAVA_HOME=/opt/JDK/jdk1.7.0_79
 export PATH=$JAVA_HOME/bin:$PATH
 export CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar

改好後的profile檔案資訊如下：

# /etc/profile: system-wide .profile file for the Bourne shell (sh(1)) 

# and Bourne compatible shells (bash(1), ksh(1), ash(1), ...).

if [ "$PS1" ]; then
  if [ "$BASH" ] && [ "$BASH" != "/bin/sh" ]; then
    # The file bash.bashrc already sets the default PS1.
    # PS1='\h:\w\$ '
    if [ -f /etc/bash.bashrc ]; then
      . /etc/bash.bashrc
    fi
  else
    if [ "`id -u`" 
 -eq 0 ]; then
      PS1='# '
    else
      PS1='$ '
    fi
  fi
fi

# The default umask is now handled by pam_umask.
# See pam_umask(8) and /etc/login.defs.

if [ -d /etc/profile.d ]; then
  for i in /etc/profile.d/*.sh; do
    if [ -r $i ]; then
      . $i
    fi
  done
  unset i
fi

export JAVA_HOME=/opt/JDK/jdk1.7.0_79
export PATH=$JAVA_HOME/bin:$PATH
export CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar

按下ESC鍵後，輸入“!wq”，按回車儲存資訊；
輸入 java -v 檢視是否生效(未生效的，貌似需要重新登陸下)。

3. 安裝ZooKeeper

Kafka叢集是通過ZooKeeper進行選舉Leader，和儲存儲存Topic的資訊的，所以想執行Kafka。還需要搭建Zookeeper環境。
ZooKeeper環境搭建步驟如下：

從官網獲取tar.gz包；
將tar.gz包上傳到阿里雲伺服器的opt/zookeeper下面；
執行tar -zxvf ＊.tar.gz 解壓縮；
進入解壓好的Zookeeper目錄下的conf目錄下面；
將zoo_sample.cfg檔案改名成zoo.cfg；(當然也可以備份)
根據需要修改zoo.cfg檔案，當然也可以不改；
啟動zookeeper。

3-7步驟具體的操作命令如下所示：

cd opt/zookeeper
tar -zxvf zookeeper-3.4.6.tar.gz
cd zookeeper-3.4.6／conf
scp zoo_sample.cfg zoo.cfg
cd ..
#開啟zookeeper命令
./bin/zkServer.sh start
#關閉zookeeper命令
./bin/zkServer.sh start

結果後可以通過ps -ef|grep zookeeper 檢視zookeeper是否成功啟動

4. 安裝Kafka

經過上面3個步驟的折磨後，我們終於可以來構建自己的kafka單機叢集了。(單機你也說是叢集，汗——不服來打我QAQ)
kafka具體的步驟如下：

下載kafka安裝包，我下的包是kafka_2.11-0.10.1.0.tgz，這個官網可找到這；
將kafka包上傳到阿里雲伺服器上的opt/kafka目錄下；
將kafka包解壓；
進入config目錄下，修改server.properties檔案；
主要修改內容為：

# The id of the broker. This must be set to a unique integer for each broker.
broker.id=0
port=9092
host.name=阿里雲內網地址
advertised.host.name=阿里雲外網對映地址

修改後的配置檔案如下：

# Licensed to the Apache Software Foundation (ASF) under one or more
# contributor license agreements.  See the NOTICE file distributed with
# this work for additional information regarding copyright ownership.
# The ASF licenses this file to You under the Apache License, Version 2.0
# (the "License"); you may not use this file except in compliance with
# the License.  You may obtain a copy of the License at
#
#    http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

# see kafka.server.KafkaConfig for additional details and defaults

############################# Server Basics #############################

# The id of the broker. This must be set to a unique integer for each broker.
broker.id=0
port=9092
host.name=阿里雲內網地址
advertised.host.name=阿里雲外網對映地址

# Switch to enable topic deletion or not, default value is false
delete.topic.enable=true

############################# Socket Server Settings #############################

# The address the socket server listens on. It will get the value returned from
# java.net.InetAddress.getCanonicalHostName() if not configured.
#   FORMAT:
#     listeners = security_protocol://host_name:port
#   EXAMPLE:
#     listeners = PLAINTEXT://your.host.name:9092
#listeners=PLAINTEXT://:9092

# Hostname and port the broker will advertise to producers and consumers. If not set,
# it uses the value for "listeners" if configured.  Otherwise, it will use the value
# returned from java.net.InetAddress.getCanonicalHostName().
#advertised.listeners=PLAINTEXT://your.host.name:9092

# The number of threads handling network requests
num.network.threads=3

# The number of threads doing disk I/O
num.io.threads=8

# The send buffer (SO_SNDBUF) used by the socket server
socket.send.buffer.bytes=102400

# The receive buffer (SO_RCVBUF) used by the socket server
socket.receive.buffer.bytes=102400

# The maximum size of a request that the socket server will accept (protection against OOM)
socket.request.max.bytes=104857600


############################# Log Basics #############################

# A comma seperated list of directories under which to store log files
log.dirs=/tmp/kafka-logs

# The default number of log partitions per topic. More partitions allow greater
# parallelism for consumption, but this will also result in more files across
# the brokers.
num.partitions=1

# The number of threads per data directory to be used for log recovery at startup and flushing at shutdown.
# This value is recommended to be increased for installations with data dirs located in RAID array.
num.recovery.threads.per.data.dir=1

############################# Log Flush Policy #############################

# Messages are immediately written to the filesystem but by default we only fsync() to sync
# the OS cache lazily. The following configurations control the flush of data to disk.
# There are a few important trade-offs here:
#    1. Durability: Unflushed data may be lost if you are not using replication.
#    2. Latency: Very large flush intervals may lead to latency spikes when the flush does occur as there will be a lot of data to flush.
#    3. Throughput: The flush is generally the most expensive operation, and a small flush interval may lead to exceessive seeks.
# The settings below allow one to configure the flush policy to flush data after a period of time or
# every N messages (or both). This can be done globally and overridden on a per-topic basis.

# The number of messages to accept before forcing a flush of data to disk
#log.flush.interval.messages=10000

# The maximum amount of time a message can sit in a log before we force a flush
#log.flush.interval.ms=1000

############################# Log Retention Policy #############################

# The following configurations control the disposal of log segments. The policy can
# be set to delete segments after a period of time, or after a given size has accumulated.
# A segment will be deleted whenever *either* of these criteria are met. Deletion always happens
# from the end of the log.

# The minimum age of a log file to be eligible for deletion
log.retention.hours=168

# A size-based retention policy for logs. Segments are pruned from the log as long as the remaining
# segments don't drop below log.retention.bytes.
#log.retention.bytes=1073741824

# The maximum size of a log segment file. When this size is reached a new log segment will be created.
log.segment.bytes=1073741824

# The interval at which log segments are checked to see if they can be deleted according
# to the retention policies
log.retention.check.interval.ms=300000

############################# Zookeeper #############################

# Zookeeper connection string (see zookeeper docs for details).
# This is a comma separated host:port pairs, each corresponding to a zk
# server. e.g. "127.0.0.1:3000,127.0.0.1:3001,127.0.0.1:3002".
# You can also append an optional chroot string to the urls to specify the
# root directory for all kafka znodes.
zookeeper.connect=localhost:2181

# Timeout in ms for connecting to zookeeper
zookeeper.connection.timeout.ms=6000

啟動kafka。

nohup ./bin/kafka-server-start.sh config/server.properties >  /dev/null 2>&1 &

6.驗證kafka是否啟動成功；
執行jps，檢視是否名為kafka的程序即可。

5. 踩過的坑

要配置hostname，port埠號和其他選項
Bug：ERROR org.apache.kafka.common.errors.InvalidReplicationFactorException: replication factor: 1 larger than available brokers: 0
說的很明白，可以使用的broker數量少於1個，可就是Kafka程序沒有啟動或宕機了。
解決辦法：1. 執行JPS 檢視是否有Kafka程序； 2.重新啟動Kafka。
無法繫結到某某地址
Bug：Socket server failed to bind to xxx.xxx.xxx.xxx:9092: Cannot assign requested address.
在ECS上面配置kafka的地址千萬不要寫外部地址，比如139.225.155.153(我隨便寫的)，這樣事繫結不上去的，因為這個是阿里雲內部；它會去內網去尋找他的地址，所以配成127.0.0.1 會自動識別成本機地址/不然應該使用外網的對映地址。
host name配置出問題
Bug：報錯：java.net.UnknownHostException: 主機名: 主機名

Caused by: java.net.UnknownHostException: iZuf6gsbgu35znsy7ve3s6x: iZuf6gsbgu35znsy7ve3s6x
    at java.net.InetAddress.getLocalHost(InetAddress.java:1475)
    at kafka.network.RequestChannel$.<init>(RequestChannel.scala:40)
    at kafka.network.RequestChannel$.<clinit>(RequestChannel.scala)
    ... 10 more

4 外部呼叫無法消費kafka

21:45:58,162 DEBUG Selector:365 - Connection with /168.221.153.152 disconnected
java.net.ConnectException: Connection refused
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:739)
    at org.apache.kafka.common.network.PlaintextTransportLayer.finishConnect(PlaintextTransportLayer.java:51)
    at org.apache.kafka.common.network.KafkaChannel.finishConnect(KafkaChannel.java:73)
    at org.apache.kafka.common.network.Selector.pollSelectionKeys(Selector.java:323)
    at org.apache.kafka.common.network.Selector.poll(Selector.java:291)
    at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:260)
    at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:236)
    at org.apache.kafka.clients.producer.internals.Sender.run(Sender.java:135)
    at java.lang.Thread.run(Thread.java:745)
21:45:58,162 DEBUG NetworkClient:463 - Node -1 disconnected.

6. 其他

關於Kafka的配置檔案具體內容、Kafka如何構建叢集、Kafka常用命令、Kafka簡單Demo的編寫和Kafka Streams 例子的編寫，請看Kafka系列的其它部分內容。

阿里雲構建Kafka單機叢集環境

簡介在一臺ECS阿里雲伺服器上構建Kafa單個叢集環境需要如下的幾個步驟：伺服器環境 JDK的安裝 ZooKeeper的安裝 Kafka的安裝 1. 伺服器環境 CPU： 1核記憶體： 2048 MB (I/O優化) 1Mbp

linux部署kafka單機叢集環境

一、說明：作業系統：linux kafka版本資訊：kafka_2.11-0.8.2.1 二、具體操作： 1、安裝kafka之間先檢查作業系統中是否裝有JDK，若沒有點選開啟連結有JDK安裝步驟。 2、關閉SELINUX、開啟防火牆9092

一次阿里雲上的kakfa叢集升級歷險記

由於要在生產環境上debezium，筆者看到生產環境上的kafka版本是1.0.0，而現在kafka最新版本都是2.0了，於是想升級一下kafka。按照kafka的官網上的例子來升級。發現升級完kafka叢集

阿里雲ubantu16.04 搭建LAMP環境

1.登入伺服器 2.sudo apt-get update 更新軟體列表 3.sudo apt-get install lamp-server^ （注意右上角的' ^ '這個不能少）輸入apache2 -v 測試是否安裝成功 4.此時就可以在自己的機器上的瀏覽器輸入i

阿里雲ECS-centos7建站環境搭建

又快到一年的雙十一了，阿里雲也搞起了拼團活動，買了一臺最低配的雲伺服器，自己玩。連線遠端伺服器直接使用了xshell，不詳細描述。最基礎的環境準備，安裝jdk和tomcat。在usr目錄下建立了資料夾java，又在java檔案中建立了jdk和tomcat兩個子目錄 &nb

阿里雲伺服器安裝PHP執行環境(CentOS6.8 64位|Vsftpd2.2.2)的錯誤

1. ERROR: unable to bind listening socket for address '127.0.0.1:9000': Address already in use (98) [16-Nov-2018 18:48:17] ERROR: FPM initialization

阿里雲ECS上部署node環境，使用pm2執行持久服務

記錄在阿里雲伺服器ECS上部署node環境 1.連線伺服器：ssh 使用者名稱@伺服器ip 開啟終端(Terminal): 輸入 " ssh 使用者名稱@伺服器ip" 輸入回車(enter) 輸入密碼即可連線到伺服器 2.安裝node環境在登陸阿里雲的終端中下載node安裝包，並解壓

阿里雲容器服務同一叢集下不同可用區node上的容器通訊問題

建立了一個叢集cluster,所屬區可用區A，一併新增一個節點node-a,也在可用區A。叢集新增已有節點，該節點在同區域可用區B上，執行完指令碼新增成功後成功新增節點node-b。建立應用，起兩個例項，分別執行在兩個節點上。問題：node-b節點上的服務可以訪問

阿里雲伺服器ECS安裝執行環境及配置

Elastic Compute Service（ECS）是阿里雲提供的一種基礎雲端計算服務。隨時建立所需數量的雲伺服器ECS例項。在使用過程中，隨著業務的擴充套件，您可以隨時擴容磁碟、增加頻寬。如果不再需要雲伺服器，也能隨時釋放資源，節省費用。包括例項規格、塊儲存、映象、快照、頻寬和安全組

阿里雲Centos配置Java mysql環境

解除安裝一安裝的mysql 檢視 yum list installed | grep mysql 解除安裝 yum -y remove mysql-libs.x86_64 下載MYSQL的YUM源：wget http://repo.mysql.com/mysql57-communit

阿里雲Kubernetes實戰1–叢集搭建與服務暴露

前言：考慮到公司持續整合與docker容器技術實施已有一段時間，取得了不錯的效果，但對於裝置運維、系統隔離、裝置利用率和擴充套件性還有待提升，綜合目前比較成熟的微服務技術，打算把現有業務遷移到K8S叢集。由於公司所有業務均部署在阿里雲上，最開始就調研了阿里雲自己提供的Kubernetes叢集，但後來還

【技術乾貨】阿里雲構建千萬級別架構演變之路

本文作者：喬銳傑，現擔任上海駐雲資訊科技有限公司運維總監/架構師。曾任職過黑客講師、java軟體工程師/網站架構師、高階運維、阿里雲架構師等職位。維護過上千臺伺服器，主導過眾安保險、新華社等千萬級上雲架構。在雲端運維、分散式叢集架構等方面有著豐富的經驗。前言

阿里雲伺服器購買配置、環境部署、搭建網站教程（轉載）

阿里雲伺服器購買怎麼選擇合適自己需求配置？如何安裝伺服器環境來搭建網站呢？很多沒有云計算基礎的小白在ecs伺服器配置上都會遇到各種問題，今天詳細的寫一篇阿里雲伺服器配置教程文章，手把手教導大家如何配置！購買阿里雲伺服器或者其它任何產品，記得先領取阿里雲代金券

kafka使用筆記-基於SASL認證的kafka偽叢集環境搭建及測試

繼搭建免認證kafka單機之後由於業務需要，搭建了基於SASL認證的kafka偽叢集環境。本次同樣使用的是 kafka_2.10-0.10.1.0.tgz 版本的kafka，整合zookeeper，只需要對此進行配置即可，無需單獨安裝。一、準備工作 1、環境：ubuntu1

阿里雲Centos7 安裝 k8s 叢集（使用過程中的坑）

個人備忘下面這個地址能滿足大部分需求：上文：5.2 的配置三臺伺服器都要修改，5.3 的命令 [[email protected] ~]# etcdctl mk /atomic.io/network/config '{ "Network": "1

阿里雲伺服器配置java生產環境jdk1.7+tomcat7.0+mysql5.5（二）

二、安裝jdk1.7 1.下載rpm包地址 http://pan.baidu.com/s/1qXMlJcg 2.上傳jdk到 usr/java/jdk下; 3.安裝命令：rpm -ivh

阿里雲伺服器Windows部署JavaWeb環境

最近心血來潮，在全球資訊網上購買了一個域名，然後想自己建立一個屬於自己的網站。買完之後發現兩眼一弄黑，於是在網上各種查資料。。。由於域名使用需要實名稽核，一兩天不一定稽核完，上傳資料後就先放到一邊了。執行一個網站必須要有伺服器，於是購買了一臺阿里雲伺服器，選

阿里雲實現Hadoop+Spark叢集

前兩篇我已經介紹瞭如何在伺服器上搭建Hadoop環境已經Hadoop叢集，接下來我將介紹一下如何在Hadoop上搭建Spark叢集。（如果你還沒看過我前兩篇blog，那麼這篇你也可以看，不過還是建議先閱讀一下前兩篇bolg：手把手教你如何使用阿里雲搭建Ha

記錄阿里雲ECS伺服器Java開發環境的搭建過程

1、新增使用者admin，新增許可權到wheel組 adduser admin passwd admin gpasswd -a admin wheel 參考：https://www.digitalocean.com/community/tutorials/initial

阿里雲線上 ubuntu 14.04環境搭建 lnmp

文章參考地址：http://blog.csdn.net/styshoo/article/details/52675689 http://www.linuxidc.com/Linux/2015-05/116933.htm# 首先

阿里雲構建Kafka單機叢集環境

簡介

1. 伺服器環境

2. 安裝JDK

3. 安裝ZooKeeper

4. 安裝Kafka

5. 踩過的坑

6. 其他

相關推薦